pcaSnpFilters: Regions of SNP-PC correlation to filter for Principal Component Analysis
Description
Base positions for the LCT (2q21), HLA (including MHC), and inversion (8p23, 17q21.31)
regions from the GRCh36/hg18, GRCh37/hg19 and GRCh38/hg38 genome genome builds.
These regions result in high SNP-PC
correlation if they are included in Principal Component Analysis
(PCA). The pcaSnpFilters datasets can be used to filter SNPs prior to running PCA
to avoid correlations.
References
Novembre, John et al. (2008), Genes mirror geography within Europe.
Nature, 456: 98-101. doi:10.1038/nature07331