Learn R Programming

SVMMaj (version 0.2.9.3)

diabetes: Pima Indians Diabetes Data Set

Description

From National Institute of Diabetes and Digestive and Kidney Diseases.

Arguments

Format

X is a data frame of 768 female patients with 8 attributes.

no.pregnantnumber of pregnancies.
glucoseplasma glucose concentration in an oral glucose tolerance test
blood.pressdiastolic blood pressure (mm Hg)
triceps.thicktriceps skin fold thickness (mm)
insulin2-Hour serum insulin (mu U/ml)
BMIbody mass index (weight in kg/(height in m)**2)
pedigreediabetes pedigree function
ageage in years

y contains the class labels: Yes or No, for diabetic according to WHO criteria.

The training set diabetes.tr contains a randomly selected set of 600 subjects, and diabetes.te contains the remaining 168 subjects. diabetes contains all 768 objects.

Details

Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.

References

Smith, J.W., Everhart, J.E., Dickson, W.C., Knowler, W.C., & Johannes, R.S. (1988). Using the ADAP learning algorithm to forecast the onset of diabetes mellitus. In Proceedings of the Symposium on Computer Applications and Medical Care (pp. 261--265). IEEE Computer Society Press.

Examples

Run this code

attach(diabetes)
summary(X)
summary(y)

Run the code above in your browser using DataLab