TileSeqs(dbFile, tblName = "Seqs", identifier = "", minLength = 26, maxLength = 27, maxTilePermutations = 10, minCoverage = 0.9, add2tbl = FALSE, processors = 1, verbose = TRUE, ...)
maxLength
.
minCoverage
of 0.9 request that additional tiles are added until 90% of the group is represented by the tile permutations.
NULL
to automatically detect and use all available processors.
SearchDB
.
data.frame
with a row for each tile, and multiple columns of information. The row_names
column gives the row number. The start
, end
, start_aligned
, and end_aligned
columns provide positioning of the tile in a consensus sequence formed from the group. The column misprime
is a logical specifying whether the tile meets the specified constraints. The columns width
and id
indicate the tile's length and group of origin, respectively.The coverage
field gives the fraction of sequences containing the tile in the group that encompass the tile's start and end positions in the alignment, whereas groupCoverage
contains the fraction of all sequences in the group containing a tile at their respective target site. For example, if only a single sequence out of 10 has information (no gap) in the first alignment position, then coverage
would be 100% (1.0), while groupCoverage
would be 10% (0.1).The final column, target_site
, provides the sequence of the tile.
TileSeqs
will create a set of overlapping tiles representing each target site in an alignment of sequences. The most common tile permutations are added until the desired minimum group coverage is obtained. The dbFile
is assumed to contain DNAStringSet
sequences (any U's are converted to T's).Target sites with one more more tiles not meeting a set of requirements are marked with misprime
equals TRUE
. Requirements include minimum group coverage, minimum length, and maximum length. Additionally, tiles are required not to contain more than four runs of a single base or four di-nucleotide repeats.
DesignPrimers
db <- system.file("extdata", "Bacteria_175seqs.sqlite", package="DECIPHER")
tiles <- TileSeqs(db, identifier="Pseudomonadales")
Run the code above in your browser using DataLab