Learn R Programming

VariantFiltering (version 1.8.6)

VariantFilteringParam-class: VariantFiltering parameter class

Description

The class VariantFilteringParam is defined to ease configuring the call to the functions that filter input genetic variants according to a desired segregating inheritance model (xLinked(), autosomalRecessiveHomozygous(), etc).

Usage

VariantFilteringParam(vcfFilenames, pedFilename=NA_character_, bsgenome="BSgenome.Hsapiens.1000genomes.hs37d5", orgdb="org.Hs.eg.db", txdb="TxDb.Hsapiens.UCSC.hg19.knownGene", snpdb="SNPlocs.Hsapiens.dbSNP144.GRCh37", spliceSiteMatricesFilenames=NA, radicalAAchangeFilename=file.path(system.file("extdata", package="VariantFiltering"), "AA_chemical_properties_HanadaGojoboriLi2006.tsv"), codonusageFilename=file.path(system.file("extdata", package="VariantFiltering"), "humanCodonUsage.txt"), geneticCode=getGeneticCode("SGC0"), allTranscripts=FALSE, regionAnnotations=list(CodingVariants(), IntronVariants(), FiveSpliceSiteVariants(), ThreeSpliceSiteVariants(), PromoterVariants(), FiveUTRVariants(), ThreeUTRVariants()), otherAnnotations=c("MafDb.1Kgenomes.phase3.hs37d5", "MafDb.ExAC.r0.3.1.snvs.hs37d5", "PolyPhen.Hsapiens.dbSNP131", "SIFT.Hsapiens.dbSNP137", "phastCons100way.UCSC.hg19", "humanGenesPhylostrata"), geneKeytype=NA_character_, yieldSize=NA_integer_) "show"(object) "$"(x, name) "names"(x)

Arguments

vcfFilenames
Character string of the input VCF file name.
pedFilename
Character string of the pedigree file name in PED format.
bsgenome
Character string of a genome annotation package (BSgenome.Hsapiens.1000genomes.hs37d5 by defautl).
orgdb
Character string of a gene-centric annotation package (org.Hs.eg.db by default).
txdb
Character string of a transcript-centric annotation package (TxDb.Hsapiens.UCSC.hg19.knownGene by default). The package GenomicFeatures provides infraestructure to build such annotation packages from different sources such as online UCSC tracks, Biomart tables, or GFF files.
snpdb
Character string of a SNP-centric annotation package (SNPlocs.Hsapiens.dbSNP.20120608 by default).
spliceSiteMatricesFilenames
Names of text files containing position weight matrices for splice site recognition. The default NA value indicates that splice sites will not be scored. To use this feature assign to this argument the function spliceSiteMatricesHuman(). See the files (hsap.donors.hcmc10_15_1.ibn and hsap.acceptors.hcmc10_15_1.ibn) returned by this function for details on their format.
radicalAAchangeFilename
Name of a tab-separated text file containing chemical properties of amino acids. These properties are interpreted such that amino acid changes within a property are considered "conservative" and between properties are considered "radical". See the default file (AA_chemical_properties_HanadaGojoboriLi2006.tsv) for details on its format.
codonusageFilename
Name of a text file containing the codon usage.
geneticCode
Name character vector of length 64 describing the genetic code. The default value is getGeneticCode("SGC0"), the standard genetic code. An alternative genetic code, for instance, is getGeneticCode("SGC1"), the vertebrate mitochondrial genetic code. See getGeneticCode in the Biostrings package for further details.
allTranscripts
Logical. This option allows the user to choose between working with all the transcripts affected by the variant (allTranscripts=TRUE) or with only one transcript per variant.
regionAnnotations
List of VariantType-class objects defining what regions to annotate.
otherAnnotations
Character vector of names of annotation packages or annotation objects.
geneKeytype
Character vector of the type of key gene identifier provided by the transcript-centric (TxDb) annotation package to interrogate the organism-centric (OrgDb) annotation package. The default value (NA_character_ indicates that it will be assumed to be an Entrez identifier unless the values in the GENEID column returned by the TxDb package start with ENSG and then it will be assumed that they are Ensembl gene identifiers, or with one of NM_, NP_, NR_, XM_, XP_, XR_ or YP_ and then it will be assumed that they are RefSeq gene identifiers.
yieldSize
Number of variants to yield each time the input VCF file is read. This argument is passed to the TabixFile function when opening the input VCF file and it allows to iterate through the variants in chunks of the given size to limit the memory requirements. Its default value (NA_integer_) implies that the whole input VCF file will be read into main memory.
object
A VariantFilteringParam object created through VariantFilteringParam().
x
A VariantFilteringParam object created through VariantFilteringParam().
name
Slot name of a VariantFilteringParam object. Use names() to find out what these slots are.

Value

An VariantFilteringParam object is returned by the method VariantFilteringParam.

Details

The class VariantFilteringParam serves as a purpose of simplifying the call to the inheritance model function and its subsequent annotation and filtering steps. It also groups all the parameters that the user can customize (i.e newer versions of the annotation packages, when available).

The method VariantFilteringParam() creates an VariantFilteringParam object used as an input argument to other functions such as autosomalRecessiveHomozygous(), etc.

The method names() allows one to see the names of the slots from a VariantFilteringParam object. Using the $ operator, one can retrieve the values of these slots in an analogous way to a list.

Examples

Run this code
vfpar <- VariantFilteringParam(system.file("extdata", "CEUtrio.vcf.bgz", package="VariantFiltering"),
                               system.file("extdata", "CEUtrio.ped", package="VariantFiltering"),
                               snpdb=list(), otherAnnotations=character(0))
vfpar
names(vfpar)
vfpar$vcfFiles

Run the code above in your browser using DataLab