Define the Genomic Ranges and Genomic Positions covered by the AmpliSeq<U+2122> Panel to include in the study and define SNP positions to exclude from the study.
Trimming amplicon ends is performed if specified. This function is mostly useful if you want to add some SNP positions which are not existing in the
positions_ranges.rda file provided within the package. It is provided to be able to reconstruct positions_ranges.rda
data.
PrepareLibrary(info.dir = "Info/", bed.filename = "lungcolonV2.bed.txt",
snp.filename = "ExAC.r0.3.sites.vep.vcf.gz",
snp.extra = c("chr2:212812097", "chr4:1807909", "chr7:140481511",
"chr14:105246474", "chr18:48586344", "chr19:1223055"),
output.name = "positions_ranges.rda", output.dir = info.dir,
load.from.broad.insitute = FALSE)
char, name of the folder containing the library information files (default 'Info/')
char, name of a BED table (tab-delimited) describing the Panel (with first 3 columns: "chr" (ex:chr1), "start position" (ex:115252190), "end position" (ex:115252305), i.e. the Ion AmpliSeq<U+2122> Colon and Lung Cancer Research Panel v2 (default 'lungcolonV2.bed.txt' as provided in the inst/extdata/Info folder of the package).
char, name of the vcf file describing known SNP positions, obtained from ftp://ftp.broadinstitute.org/pub/ExAC_release/release0.3/ExAC.r0.3.sites.vep.vcf.gz (default 'ExAC.r0.3.sites.vep.vcf.gz'). It requires a corresponding TBI file to be in the same folder (obtained from ftp://ftp.broadinstitute.org/pub/ExAC_release/release0.3/ExAC.r0.3.sites.vep.vcf.gz.tbi)
a vector of char, a vector of extra known snp positions manually curated (ex:"chrN:XXXXXXXXX")
char, filename to save pos_ind
and pos_snp
(default 'positions_ranges.rda')
char, directory where to save pos_ind
and pos_snp
(default info.dir
)
boolean, if TRUE load snp.filename
from Broad Institute ftp server otherwise use the file positions_ranges_broad.rda (default FALSE)
Save the following variables in a .rda file defined by output.name
in the folder defined by output.dir
:
pos_ranges
, a GRanges descriptor of amplicon positions
pos_ind
, a vector of char "chrN:XXXXXXXXX", defining ALL index positions
pos_snp
, a vector of char "chrN:XXXXXXXXX", defining SNP positions
Analysis of base-position error rate of next-generation sequencing to detect tumor mutations in circulating DNA N. P<U+00E9>cuchet, Y. Rozenholc, E. Zonta, D. Pietraz, A. Didelot, P. Combe, L. Gibault, J-B. Bachet, V. Taly, E. Fabre, H. Blons, P. Laurent-Puig in Clinical Chemistry
positions_ranges,
# NOT RUN {
bad.pos = "chr7:15478"
PrepareLibrary(info.dir='./',snp.extra=bad.pos,output.dir=paste0(tempdir(),'/'))
# }
Run the code above in your browser using DataLab