number of SNPs that should be read in as one chunk
tid
which chromosome ? (character)
frompos
start of the region
topos
end of the region
samplenames
a vector of individual names/IDs
gffpath
the corresponding GFF file
include.unknown
including positions with unknown nucleotides
Value
The function creates an object of class "GENOME"
---------------------------------------------------------
The following slots will be filled in the "GENOME" object
---------------------------------------------------------
Slot
Description
1.
n.sites
total number of sites
2.
n.biallelic.sites
number of biallelic sites
3.
region.data
some detailed information on the data read
4.
region.names
names of regions
Details
WhopGenome is required ! require(WhopGenome)
WhopGenome provides some powerful filter meachanisms which can be applied to
the VCF reading process. The filter rules can be set via WhopGenome functions.
Whop_readVCF expects a vcf_handle returned from vcf_open.
The Whop_readVCF function expects a tabixed VCF with a diploid GT-field.
In case of haploid data, the GT-field has to be transformed to a pseudo-
diploid field (0 -> 0|0 etc.). An alternative is to use readData(..., format="VCFhap")
which can read non-tabixed haploid VCFs directly.
The ff-package we use limits the data size to
individuals * (number of SNPs) <= .Machine$integer.max
In case of very large data sets, the bigmemory package will be used.
This may slow down calculations.
See also readData(..., format="VCF") !