msaMuscle(inputSeqs, cluster="default", gapOpening="default", gapExtension="default", maxiters="default", substitutionMatrix="default", type="default", order=c("aligned", "input"), verbose=FALSE, help=FALSE, ...)
msa
.
In the original MUSCLE implementation, this
parameter is called -in
."upgma"
, "upgmamax"
,
"upgmamin"
, "upgmb"
, and "neighborjoining"
.le=TRUE
, the default is 2.9, for
sp=TRUE
, the default is 1,439, and for sv=TRUE
,
the default is 300. Note that these defaults may not be suitable
if custom substitution matrices are being used. In such a case,
a sensible choice of gap penalties that fits well to the
substitution matrix must be made.maxiters
to 0 which leads to an (out of memory) error.
Therefore, maxiters=0
is not allowed in msaMuscle
.msaMuscle
.inputSeqs
;
see msa
.msa
for more details); the original MUSCLE
implementation does not allow for preserving the order of input
sequences. The msaMuscle
function realizes this functionality
by reverse matching of sequence names. Therefore, the sequences
need to have unique names. If the sequences do not have
names or if the names are not unique, the msaMuscle
function assignes generic unique names "Seq1"
-Seqn
to the sequences and issues a warning. The choice "input"
is not available at all for sequence data that is
read directly from a FASTA file.TRUE
, the algorithm displays detailed
information and progress messages.TRUE
, information about algorithm-specific
parameters is displayed. In this case, no multiple sequence
alignment is performed and the function quits after displaying
the additional help information.msaMuscle
with help=TRUE
.
For more details, see also the documentation of MUSCLE.msaMuscle
returns a MsaAAMultipleAlignment
,
MsaDNAMultipleAlignment
, or
MsaRNAMultipleAlignment
object.
If called with help=TRUE
, msaMuscle
returns
an invisible NULL
.
inputSeqs
argument above). Parameters that
are common to all multiple sequences alignments provided by the
msa package are explicitly provided by the function and named
in the same for all algorithms. Most other parameters that are
specific to MUSCLE can be passed to MUSCLE via additional
arguments (see argument help
above). For a note on the order of output sequences and direct reading from
FASTA files, see msa
.
http://www.drive5.com/muscle/muscle.html Edgar, R. C. (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32(5):1792-1797. DOI: 10.1093/nar/gkh340.
Edgar, R. C. (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5:113. DOI: 10.1186/1471-2105-5-113.
msa
, MsaAAMultipleAlignment
,
MsaDNAMultipleAlignment
,
MsaRNAMultipleAlignment
,
MsaMetaData
## read sequences
filepath <- system.file("examples", "exampleAA.fasta", package="msa")
mySeqs <- readAAStringSet(filepath)
## call msaMuscle with default values
msaMuscle(mySeqs)
## call msaMuscle with custom parameters
msaMuscle(mySeqs, gapOpening=12, gapExtension=3, maxiters=16,
cluster="upgmamax", SUEFF=0.4, brenner=FALSE,
order="input", verbose=FALSE)
## call msaMuscle with a custom substitution matrix
data(PAM120)
msaMuscle(mySeqs, substitutionMatrix=PAM120)
Run the code above in your browser using DataLab