genki: SNP derived performance metrics for Illumina 450K DNA methylation arrays.
Description
A very simple genotype calling by one-dimensional K-means clustering is
performed on each SNP, and for those SNPs where there are three genotypes, the squared deviations are summed for each genotype (similar
to a standard deviation for each of allele A homozygote, heterozygote and
allele B homozygote). By default these are further divided by the square root
of the number of samples to get a standard error-like statistic.
Usage
genki(bn, g = getsnp(rownames(bn)), se = TRUE)
Arguments
bn
a matrix of beta values(default method), a MethyLumiSet object (methylumi package), a MethylSet or RGChannelSet object (minfi package) or a exprmethy450 object (IMA package).
g
vector of SNP names
se
TRUE or FALSE specifies whether to calculate the standard error-like statistic
Value
a vector of 3 values for the dispersion of the three genotype
peaks (AA, AB, BB : low, medium and high beta values)
Details
There are 65 well-behaved SNP genotyping probes included on the array.
These each produce a distribution of betas with tight peaks for the three
possible genotypes, which will be broadened by technical variation between
samples. The spread of the peaks is thus usable as a performance metric.
References
Pidsley R, Wong CCY, Volta M, Lunnon K, Mill J, Schalkwyk LC:
A data-driven approach to preprocessing Illumina 450K methylation
array data (submitted)