updog
Flexible Genotyping for PolyploidsImplements empirical Bayes approaches to genotype
polyploids from next generation sequencing data while
accounting for allelic bias, overdispersion, and sequencing
error. The main function is flexdog
, which allows
the specification of many different genotype distributions.
An experimental function that takes into account varying
levels of relatedness is implemented in mupdog
.
Also provided are functions to simulate genotypes
(rgeno
) and read-counts
(rflexdog
), as well as functions to calculate
oracle genotyping error rates (oracle_mis
) and
correlation with the true genotypes (oracle_cor
).
These latter two functions are useful for read depth calculations.
Run browseVignettes(package = "updog")
in R
for example usage. The methods are described in detail in
Gerard et. al. (2018) and Gerard and Ferr<U+00E3>o (2019).
flexdog
The main function that fits an empirical Bayes approach to genotype polyploids from next generation sequencing data.
multidog
A convenience function for running
flexdog
over many SNPs. This function provides
support for parallel computing.
mupdog
An experimental approach to genotype autopolyploids that accounts for varying levels of relatedness between the individuals in the sample.
rgeno
simulate the genotypes of a sample
from one of the models allowed in flexdog
.
rflexdog
Simulate read-counts from the
flexdog
model.
plot.flexdog
Plotting the output of
flexdog
.
plot.mupdog
Plotting the output of
mupdog
.
oracle_joint
The joint distribution of the true genotype and an oracle estimator.
oracle_plot
Visualize the output of oracle_joint
.
oracle_mis
The oracle misclassification error rate (Bayes rate).
oracle_cor
Correlation between the true genotype and the oracle estimated genotype.
snpdat
A small example dataset for using
flexdog
.
uitdewilligen
A small example dataset
for using mupdog
.
mupout
The output from fitting
mupdog
to uitdewilligen
.
The package is named updog
for "Using
Parental Data for Offspring Genotyping" because
we originally developed the
method for full-sib populations, but it works
now for more general populations.
Our best competitor is probably the fitPoly
package,
which you can check out at
https://cran.r-project.org/package=fitPoly. Though, we think
that updog
returns better calibrated measures of uncertainty
when you have next-generation sequencing data.
If you find a bug or want an enhancement, please submit an issue at http://github.com/dcgerard/updog/issues.
Gerard, D., Ferr<U+00E3>o, L. F. V., Garcia, A. A. F., & Stephens, M. (2018). Genotyping Polyploids from Messy Sequencing Data. Genetics, 210(3), 789-807. doi: 10.1534/genetics.118.301468.
Gerard, D. and Ferr<U+00E3>o, L. F. V. (2019). Priors for Genotyping Polyploids. Bioinformatics. doi: 10.1093/bioinformatics/btz852.