Format
The Arabidopsis data set contains a phenotype vector y
and a genotype matrix Z
:
- y
- Binary vector of phenotype AvrRpm1 for 84 inbred lines. See Atwell et al. 2010 Nature for details.
- Z
- A 84 times 216100 matrix of genotypes of the 84 inbred lines, with two different homozygotes coded as -1 and 1. 30 SNPs in the original data set that are fixed to only one homozygotic genotype were removed.
Source
Atwell, S., Y. S. Huang, B. J. Vilhjalmsson, G. Willems, M. Horton, et al., 2010. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature 465: 627-631.