Learn R Programming

ipred (version 0.9-15)

dystrophy: Detection of muscular dystrophy carriers.

Description

The dystrophy data frame has 209 rows and 10 columns.

Usage

data(dystrophy)

Arguments

Format

This data frame contains the following columns:

OBS

numeric. Observation number.

HospID

numeric. Hospital ID number.

AGE

numeric, age in years.

M

numeric. Month of examination.

Y

numeric. Year of examination.

CK

numeric. Serum marker creatine kinase.

H

numeric. Serum marker hemopexin.

PK

numeric. Serum marker pyruvate kinase.

LD

numeric. Serum marker lactate dehydroginase.

Class

factor with levels, carrier and normal.

Details

Duchenne Muscular Dystrophy (DMD) is a genetically transmitted disease, passed from a mother to her children. Affected female offspring usually suffer no apparent symptoms, male offspring with the disease die at young age. Although female carriers have no physical symptoms they tend to exhibit elevated levels of certain serum enzymes or proteins.
The dystrophy dataset contains 209 observations of 75 female DMD carriers and 134 female DMD non-carrier. It includes 6 variables describing age of the female and the serum parameters serum marker creatine kinase (CK), serum marker hemopexin (H), serum marker pyruvate kinase (PK) and serum marker lactate dehydroginase (LD). The serum markers CK and H may be measured rather inexpensive from frozen serum, PK and LD requires fresh serum.

References

Robert Tibshirani and Geoffry Hinton (1998), Coaching variables for regression and classification. Statistics and Computing 8, 25-33.

Examples

Run this code
if (FALSE) {

data("dystrophy")
library("rpart")
errorest(Class~CK+H~AGE+PK+LD, data = dystrophy, model = inbagg, 
pFUN = list(list(model = lm, predict = mypredict.lm), list(model = rpart)), 
ns = 0.75, estimator = "cv")
}

Run the code above in your browser using DataLab