FindSynteny(dbFile, tblName = "Seqs", identifier = "", useFrames = FALSE, alphabet = c("MF", "ILV", "A", "C", "WYQHP", "G", "TSN", "RK", "DE"), geneticCode = GENETIC_CODE, sepCost = -0.01, gapCost = -0.2, shiftCost = -20, codingCost = -3, maxSep = 5000, maxGap = 5000, minScore = 200, dropScore = -100, maskRepeats = TRUE, storage = 0.5, processors = 1, verbose = TRUE)
FALSE
(the default) then faster but less sensitive to distant homology.
alphabet
equal to AA_STANDARD
.
storage
gigabytes may be required, but will not be stored for later reuse.
NULL
to automatically detect and use all available processors.
FindSynteny
searches for ``hits'' between sequences that can be chained into collinear ``blocks'' of synteny. Hits are defined as k-mer exact nucleotide matches or k-mer matches in a reduced amino acid alphabet (if useFrames
is TRUE
). Hits are chained into blocks as long as they are: (1) within the same sequence, (2) within maxSep
and maxGap
distance, and (3) help maintain the score above minScore
. Blocks are extended from their first and last hit until their score drops below dropScore
from the maximum that was reached. This process results in a set of hits and blocks stored in an object of class ``Synteny''.
AlignSynteny
, Synteny-class
db <- system.file("extdata", "Influenza.sqlite", package="DECIPHER")
synteny <- FindSynteny(db, useFrames=TRUE, minScore=50)
synteny
pairs(synteny) # scatterplot matrix
Run the code above in your browser using DataLab