Learn R Programming

GSE (version 4.2-1)

horse: Horse-colic data

Description

This is a modified version of the original data set (taken from UCI repository, see reference), where only quantitative variables are considered. This data set is about horse diseases where the task is to determine if the lesion of the horse was surgical or not. It contains rows with completely missing values except for ID and must be removed by the users. They are kept mainly for pedagogical purposes.

Usage

data(horse)

Arguments

Format

A data frame with 368 observations on the following 7 variables are quantitative and 1 categorical. The first variable is a numeric id.

Hospital_Numbernumeric id, i.e. the case number assigned to the horse (may not be unique if the horse is treated > 1 time)
Rectal_temperaturerectal temperature in degree celcius
Pulsethe heart rate in beats per minute; normal rate is 30-40 for adults
Respiratory_raterespiratory rate; normal rate is 8 to 10
Nasogastric_reflux_PHscale is from 0 to 14 with 7 being neutral; normal values are in the 3 to 4 range
Packed_cell_volumethe number of red cells by volume in the blood; normal range is 30 to 50
Total_proteinnormal values lie in the 6-7.5 (gms/dL) range
Abdomcentesis_total_proteinValues are in gms/dL
surgical_leisonwas the problem (lesion) surgical?; 1 = yes, 2 = no

References

Frank, A. & Asuncion, A. (2010). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

Examples

Run this code
if (FALSE) {
data(horse)
horse.cts <- horse[,-c(1,9)] ## remove the id and categorical variable
res <- GSE(horse.cts)
plot(res, which="dd", xlog10=TRUE, ylog10=TRUE)
getOutliers(res)
}

Run the code above in your browser using DataLab