Genotype and intensity data are stored in external files (GDS or NetCDF), so it is
possible to analyze data sets that are too large to be contained in
memory. The GenotypeReader
class and IntensityReader
class unions provide a common interface for GDS and NetCDF files.
Two sets of classes for annotation are provided.
SnpAnnotationDataFrame
and
ScanAnnotationDataFrame
extend
AnnotatedDataFrame
and provide in-memory containers for SNP and
scan annotation and metadata.
SnpAnnotationSQLite
and
ScanAnnotationSQLite
provide interfaces to SNP and scan
annotation and metadata stored in SQLite databases.
The GenotypeData
and IntensityData
classes
combine genotype or intensity data with SNP and scan annotation,
ensuring that the data in the NetCDF files is consistent with
annotation through unique SNP and scan IDs. A majority of the
functions in the GWASTools package take GenotypeData
and/or IntensityData
objects as arguments.