The khan data frame has 2308 rows and 65 columns. These are one of
the datasets data used in the Tibshirani et al paper in PNAS on nearest
shrunken centroids.
The first two columns of gene ids and names and the remaining
columns are gene expression values for 63 samples. An attribute
cancer_type contains the cancer type for each sample.