Question: what is the problem with using clustal to do pairwise alignment?
1
gravatar for glarue
2.9 years ago by
glarue40
United States
glarue40 wrote:

I have a set of putatively-paralogous protein sequence pairs that I'm trying to align. I (naively) used Clustal Omega to generate pairwise alignments, but am getting some weird downstream results (abnormally high dS values between many pairs) so I went back through the documentation and noticed that the Clustal suite insists that it is only to be used for multi-sequence alignments, not pairwise ones.

I am looking for some clarity as to what problems might be expected in using Clustal to generate pairwise alignments, so that I can better troubleshoot my data.

alignment • 2.2k views
ADD COMMENTlink modified 2.9 years ago by Hussain Ather940 • written 2.9 years ago by glarue40

Did you eye-ball the alignment that you got? I also (naively perhaps) would have expected Clustal to be fine with pairwise too. You can check the alignment by eye though to see if it does look screwy.

ADD REPLYlink written 2.9 years ago by Joe18k

So, I ran a comparison between Clustal Omega and NEEDLE (EMBOSS's recommended pairwise alignment tool) and, disconcertingly, with default options they both definitely give different alignments: different alignments

By eye, I guess the Clustal ones do seem worse but I'm not confident in my ability to assess them manually.

ADD REPLYlink modified 2.9 years ago • written 2.9 years ago by glarue40

Were they using the same mismatch/gap/extend penalties?

ADD REPLYlink written 2.9 years ago by Joe18k

Good point - no, they were both set to use the default values. I can't figure out how to set the gap penalties for Clustal Omega, but using ClustalW I tried setting the values to be the same as NEEDLE and this resulted in yet another, different, alignment.

ADD REPLYlink written 2.9 years ago by glarue40
1
gravatar for Hussain Ather
2.9 years ago by
Hussain Ather940
National Institutes of Health, Bethesda, MD
Hussain Ather940 wrote:

There are just differences between the methods of multiple sequence alignments and pairwise sequence alignments. Multiple sequence alignments can also use pairwise within their own methods. Clustal performs multiple sequence alignment by using pairwise sequence alignments and also constructing a guide tree for multiple alignment. It actually performs a "series of pairwise alignments of clusters of sequences." (Higgins et al 1988). For the alignment programs that only perform pairwise sequence alignments, dot-matrix methods, dynamic programming, and word methods might be used used.

ADD COMMENTlink written 2.9 years ago by Hussain Ather940

Thanks for the response. What I'm after, though, is a better understanding of why EMBOSS recommends against using Clustal for pairwise alignments - presumably they're worse than those produced by, say, NEEDLE, but in what way? What sort of issues might I see in the alignments that would be ameliorated by using a dedicated pairwise alignment algorithm?

ADD REPLYlink written 2.8 years ago by glarue40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2050 users visited in the last hour