This function is a wrapper for MAFFT and can be used for (profile) aligning of DNA and amino acid sequences.
mafft(x, y, add, method = "auto", maxiterate = 0, op = 1.53,
ep = 0, gt, options, thread = -1, exec, quiet, file)An object of class DNAbin or AAbin.
An object of class DNAbin or AAbin, if given both
x and y are preserved and aligned to each other ("profile
alignment").
A character string giving the method used for adding y to
x: "add", "addprofile" (default), or any unambiguous
abbreviation of these.
A character string giving the alignment method. Available
accuracy-oriented methods for less than 200 sequences are
"localpair", "globalpair", and "genafpair";
"retree 1" and "retree 2" are for speed-oriented alignment.
The default is "auto", which lets MAFFT choose an appropriate
alignment method.
An integer giving the number of cycles of iterative
refinement to perform. Possible choices are 0: progressive method,
no iterative refinement (default); 2: two cycles of iterative
refinement; 1000: at most 1000 cycles of iterative refinement.
A numeric giving the gap opening penalty at group-to-group
alignment; default 1.53.
A numeric giving the offset value, which works like gap
extension penalty, for group-to-group alignment; default 0.0, but 0.123 is
recommended if no long indels are expected.
An object of class phylo that is to be used as a
guide tree during alignment.
A vector of mode character specifying additional arguments to
MAFFT, that are not included in mafft such as, e.g.,
--adjustdirection.
Integer giving the number of physical cores MAFFT should use;
with thread = -1 the number of cores is determined automatically.
A character string giving the path to the MAFFT executable
including its name, e.g. something like /user/local/bin/mafft under
UNIX-alikes.
Logical, if set to TRUE, mafft progress is printed out on
the screen.
A character string indicating the filename of the output FASTA
file; if this is missing the the alignment will be returned as matrix of
class DNAbin or AAbin.
A matrix of class "DNAbin" or "AAbin".
"localpair" selects the L-INS-i algorithm, probably
most accurate; recommended for <200 sequences; iterative refinement method
incorporating local pairwise alignment information.
"globalpair" selects the G-INS-i algorithm suitable for
sequences of similar lengths; recommended for <200 sequences; iterative
refinement method incorporating global pairwise alignment information.
"genafpair" selects the E-INS-i algorithm suitable for
sequences containing large unalignable regions; recommended for <200
sequences.
"retree 1" selects the FFT-NS-1 algorithm, the simplest
progressive option in MAFFT; recommended for >200 sequences.
"retree 2" selects the FFT-NS-2 algorithm that uses a second
iteration of alignment based on a guide tree computed from an FFT-NS-1
alignment; this is the default in MAFFT; recommended for >200 sequences.
Katoh, K. and H. Toh. 2008. Recent developments in the MAFFT multiple sequence alignment program. Briefings in Bioinformatics 9: 286-298.
Katoh, K., K.-i. Kuma, H. Toh, and T. Miyata. 2005. Mafft version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Research 33: 511--518.
Katoh, K., K. Misawa, K.-i. Kuma, and T. Miyata. 2002. Mafft: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleid Acids Research 30: 3059--3066.
read.fas to import DNA sequences; prank
for another alignment algorithm; gblocks and
aliscore for alignment cleaning.