**0**wrote:

Two questions about MAFFT.

(a) Suppose I have an unaligned nucleotide multi-fasta file A, with no gap ('-') characters, but it might contain any of the other IUPAC nucleotide characters. I align it using MAFFT, default options, this gives multi-fasta file B, which obviously may contain gaps. Suppose I then strip out all the gap characters to give file C. Is it guaranteed that A and C will be the same? (allowing for trivial differences in sequence order and upper/lower case).

(b) Suppose I have two unaligned nucleotide files A1 and A2. I align A1 with MAFFT, default options to give B1. I then align B1 and A2 using MAFFT --add, with A2 passed in as the "new sequences". This gives B2. Separately, I concatenate A1 and A2 to give A3, then align A3 using MAFFT with default options to give B3. Are B2 and B3 "algorithmically" equivalent? i.e. the only differences would be down to things like arbitrary stochastic choices.

josh

**20**• written 7 months ago by josh.singer •

**0**