Question: Two questions about the MAFFT alignment tool.
gravatar for josh.singer
7 months ago by
United Kingdom / Glasgow / Centre for Virus Research
josh.singer0 wrote:

Two questions about MAFFT.

(a) Suppose I have an unaligned nucleotide multi-fasta file A, with no gap ('-') characters, but it might contain any of the other IUPAC nucleotide characters. I align it using MAFFT, default options, this gives multi-fasta file B, which obviously may contain gaps. Suppose I then strip out all the gap characters to give file C. Is it guaranteed that A and C will be the same? (allowing for trivial differences in sequence order and upper/lower case).

(b) Suppose I have two unaligned nucleotide files A1 and A2. I align A1 with MAFFT, default options to give B1. I then align B1 and A2 using MAFFT --add, with A2 passed in as the "new sequences". This gives B2. Separately, I concatenate A1 and A2 to give A3, then align A3 using MAFFT with default options to give B3. Are B2 and B3 "algorithmically" equivalent? i.e. the only differences would be down to things like arbitrary stochastic choices.


mafft alignment • 201 views
ADD COMMENTlink modified 7 months ago by Biostar ♦♦ 20 • written 7 months ago by josh.singer0
gravatar for Mensur Dlakic
7 months ago by
Mensur Dlakic7.2k
Mensur Dlakic7.2k wrote:

a) Yes

b) Not sure, but probably not. In your first example, B1 is used as a guide to align A2. In your second example, all sequences would be aligned together. As long as A1 and A2 are comparable I would not expect B2 and B3 to be wildly different, but I would not expect them to be identical either.

ADD COMMENTlink written 7 months ago by Mensur Dlakic7.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 988 users visited in the last hour