VariantFilteringParam
is defined to ease configuring the call
to the functions that filter input genetic variants according to a desired
segregating inheritance model (xLinked()
, autosomalRecessiveHomozygous()
, etc).
VariantFilteringParam(vcfFilenames, pedFilename=NA_character_, bsgenome="BSgenome.Hsapiens.1000genomes.hs37d5", orgdb="org.Hs.eg.db", txdb="TxDb.Hsapiens.UCSC.hg19.knownGene", snpdb="SNPlocs.Hsapiens.dbSNP144.GRCh37", spliceSiteMatricesFilenames=NA, radicalAAchangeFilename=file.path(system.file("extdata", package="VariantFiltering"), "AA_chemical_properties_HanadaGojoboriLi2006.tsv"), codonusageFilename=file.path(system.file("extdata", package="VariantFiltering"), "humanCodonUsage.txt"), geneticCode=getGeneticCode("SGC0"), allTranscripts=FALSE, regionAnnotations=list(CodingVariants(), IntronVariants(), FiveSpliceSiteVariants(), ThreeSpliceSiteVariants(), PromoterVariants(), FiveUTRVariants(), ThreeUTRVariants()), otherAnnotations=c("MafDb.1Kgenomes.phase3.hs37d5", "MafDb.ExAC.r0.3.1.snvs.hs37d5", "PolyPhen.Hsapiens.dbSNP131", "SIFT.Hsapiens.dbSNP137", "phastCons100way.UCSC.hg19", "humanGenesPhylostrata"), geneKeytype=NA_character_, yieldSize=NA_integer_)
"show"(object)
"$"(x, name)
"names"(x)
BSgenome.Hsapiens.1000genomes.hs37d5
by defautl).
org.Hs.eg.db
by default).
TxDb.Hsapiens.UCSC.hg19.knownGene
by default).
The package GenomicFeatures
provides infraestructure to build such annotation packages from different sources such as online UCSC tracks, Biomart tables, or GFF
files.
SNPlocs.Hsapiens.dbSNP.20120608
by default).
NA
value indicates that splice sites will not be scored. To use this feature
assign to this argument the function spliceSiteMatricesHuman()
. See the files (hsap.donors.hcmc10_15_1.ibn
and hsap.acceptors.hcmc10_15_1.ibn
) returned by this function for details on their format.
AA_chemical_properties_HanadaGojoboriLi2006.tsv
) for details on its format.
getGeneticCode("SGC0")
, the standard genetic code. An alternative genetic code, for instance, is getGeneticCode("SGC1")
, the vertebrate mitochondrial genetic code. See getGeneticCode
in the Biostrings package for further details.
allTranscripts=TRUE
) or with only one transcript per variant.
VariantType-class
objects defining what regions to annotate.
NA_character_
indicates that it will be assumed to be an Entrez identifier unless the values in the GENEID
column returned by the TxDb package start with ENSG
and then it will be assumed that they are Ensembl gene identifiers, or with one of NM_, NP_, NR_, XM_, XP_, XR_ or YP_
and then it will be assumed that they are RefSeq gene identifiers.
TabixFile
function when opening the input VCF file and it allows to iterate through the variants in chunks of the given size to limit the memory requirements. Its default value (NA_integer_
) implies that the whole input VCF file will be read into main memory.
VariantFilteringParam()
.
VariantFilteringParam()
.
names()
to find out what these slots are.
VariantFilteringParam
object is returned by the method VariantFilteringParam
.
VariantFilteringParam
serves as a purpose of simplifying the call to the inheritance model function and its subsequent annotation and filtering steps. It also groups all the parameters that the user can customize (i.e newer versions of the annotation packages, when available). The method VariantFilteringParam()
creates an VariantFilteringParam
object used as an input argument to other functions such as autosomalRecessiveHomozygous()
, etc.
The method names()
allows one to see the names of the slots from a VariantFilteringParam
object. Using the $
operator, one can retrieve the values of these slots in an analogous way to a list
.
vfpar <- VariantFilteringParam(system.file("extdata", "CEUtrio.vcf.bgz", package="VariantFiltering"),
system.file("extdata", "CEUtrio.ped", package="VariantFiltering"),
snpdb=list(), otherAnnotations=character(0))
vfpar
names(vfpar)
vfpar$vcfFiles
Run the code above in your browser using DataLab