Details are given of the classifying factors for the 72 columns of the Golub data set.
data(golubInfo)
A data frame with 72 observations on the following 6 variables,
that identifies the samples (observations) in the data set Golub
Samples
a numeric vector: sample number
BM.PB
a factor with levels BM
(from bone marrow)
PB
(from peripheral blood)
Gender
a factor with levels F
M
Source
a factor with levels CALGB
CCG
DFCI
St-Jude
. These are the hospitals from which the sample came
tissue.mf
a factor with levels BM:NA
BM:f
BM:m
PB:NA
PB:f
PB:m
. This factor identifies the
several combinations of source
and Gender
cancer
a factor with levels allB
allT
aml
There are two types of Acute Lymphoblastic Leukemia (allB
and
allT
), plus Acute Myoblastic Leukemia (aml
)
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring, Science, 531-537, 1999, T. R. Golub and D. K. Slonim and P. Tamayo and C. Huard and M. Gaasenbeek and J. P. Mesirov and H. Coller and M.L. Loh and J. R. Downing and M. A. Caligiuri and C. D. Bloomfield and E. S. Lander
data(golubInfo)
str(golubInfo)
Run the code above in your browser using DataLab