recstat
in order
to determine if the regions located between two Stop codons correspond to putative CDSs.test.co.recstat(rec, fac = 1, length.min = 150, stop.max = 0.2, win.lim = 0.8,
direct = TRUE, level = 0.01)
recstat
function.fac
$\ge$ 1).Start
and End
give the location of starting and ending
positions of the region; and CDS
is a binary indicator equal to 1 if a putative CDS is
predicted, and to 0 if not.length.min
nucleotides. For each window inside a region considered, a Kruskal-Wallis test
is computed on the factor scores of the codons found in this window, this for the three possible
reading frames. If a proportion of at least win.lim
windows in the region reject the null
hypothesis of means equality between the reading frames, then, there is a good probability that
a CDS is located in the region.
Inside the first and the last windows of a region submitted to the test, the relative position of
the two Stop codons is used to determine if those windows can be used in the analysis. If the
first Stop is located within the stop.max
fraction of the 5' end of the window, then this
window is kept in the analysis. In the same way, if the second Stop is located within the
stop.max
fraction of the 3' end of the window, this window is also kept in the analysis.test.li.recstat
library(seqinr)
library(ade4)
ff <- system.file("sequences/ECOUNC.fsa", package = "seqinr2")
seq <- read.fasta(ff)
rec <- recstat(seq[[1]], seqname = getName(seq))
test.co.recstat(rec)
Run the code above in your browser using DataLab