snpgdsBED2GDS

the file name of binary file, genotype information

bed.fn

the file name of first six columns of <code>".ped"</code>

fam.fn

the file name of extended MAP file:
        two extra columns = allele names

bim.fn

out.gdsfn

if <code>TRUE</code>, to include family information in the
        sample annotation

family

if TRUE, genotypes are stored in the individual-major
        mode, (i.e, list all SNPs for the first individual, and then list all
        SNPs for the second individual, etc); <code>NA</code>, the dimension is
        determined by the BED file

snpfirstdim

the compression method for the GDS variables,
        except "genotype"; optional values are defined in the function
        <code>add.gdsn</code>

compress.annotation

the compression method for "genotype"; optional
        values are defined in the function <code>add.gdsn</code>

compress.geno

<code>NULL</code> or an object from <code><a rd-options="" href="/link/snpgdsOption?package=SNPRelate&version=1.6.4" data-mini-rdoc="SNPRelate::snpgdsOption">snpgdsOption</a></code>,
        see details

option

<code>"int"</code> -- chromosome code in the GDS file is integer;
        <code>"char"</code> -- chromosome code in the GDS file is character

cvt.chr

<code>"int"</code> -- to create an integer <code>snp.id</code>
        starting from 1; <code>"auto"</code> -- if SNP IDs in the PLINK file are not
        unique, to create an an integer <code>snp.id</code>, otherwise to use SNP
        IDs for <code>snp.id</code>

cvt.snpid

verbose


    Convert a PLINK binary ped file to a GDS file.


GWAS

Genome-wide association studies (GWAS) are widely used to
investigate the genetic basis of diseases and traits, but they
pose many computational challenges. We developed an R package
SNPRelate to provide a binary format for single-nucleotide
polymorphism (SNP) data in GWAS utilizing CoreArray Genomic
Data Structure (GDS) data files. The GDS format offers the
efficient operations specifically designed for integers with
two bits, since a SNP could occupy only two bits. SNPRelate is
also designed to accelerate two key computations on SNP data
using parallel computing for multi-core symmetric
multiprocessing computer architectures: Principal Component
Analysis (PCA) and relatedness analysis using
Identity-By-Descent measures. The SNP GDS format is also used by the
GWASTools package with the support of S4 classes and generic functions.
The extended GDS format is implemented in the SeqArray package to
support the storage of single nucleotide variations (SNVs),
insertion/deletion polymorphism (indel) and structural variation calls.

Xiuwen Zheng

SNPRelate

Parallel Computing Toolset for Relatedness and Principal Component
Analysis of SNP Data

snpgdsBED2GDS function

<code>NULL</code> or an object from <code><a rd-options='' href='snpgdsOption'>snpgdsOption</a></code>,
        see details

snpgdsBED2GDS: Conversion from PLINK BED to GDS

Description

Usage

Arguments

Value

Details

References

See Also

Examples