Question: Fasta Alignment Format - Sequences Must Be Same Length?
3
gravatar for User 9996
6.6 years ago by
User 9996760
User 9996760 wrote:

If I have a FASTA file that gives aligned protein sequences for genes, e.g. a sequence for paralog A and a sequence for paralog B, is there a requirement that the aligned sequences be the same length? Do most alignment programs yield same length sequences at the end?

thanks.

ADD COMMENTlink modified 6.2 years ago by Daniel Standage3.7k • written 6.6 years ago by User 9996760
5
gravatar for Peter
6.6 years ago by
Peter90
European Union
Peter90 wrote:

Yes, alignment programs usually output alignments with equal-length sequences. Of course, if you align two sequences with different lengths, gap characters will be introduced (global alignment)

ADD COMMENTlink written 6.6 years ago by Peter90
1

I agree with Peter and Daniel.

It can be a little bit confusing when you look at your multi fasta file because the gaps may be represented two symbols : - at the end and beginning of sequences they are represented by spaces - inside the sequences they are represented by '-'

I prefer to use the same symbol for all gaps.

ADD REPLYlink written 6.6 years ago by Bilouweb1.1k
5
gravatar for Daniel Standage
6.6 years ago by
Daniel Standage3.7k
Davis, California, USA
Daniel Standage3.7k wrote:

No matter what the length of A and B are, A' and B' (the aligned sequences) will be of the same length. This simply comes from the definition of an alignment. The characters (representing base pairs) from the two sequences are arranged as to minimize the differences between them, and then the empty spaces (if any) are filled in with gaps (dash characters). These gaps are typically interpreted as evolutionary events between two homologous sequences, i.e. an insertion of nucleotides to one sequence or a deletion of nucleotides from the other (indels).

ADD COMMENTlink written 6.6 years ago by Daniel Standage3.7k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1337 users visited in the last hour