Question: Multiple Sequence Alignment tools
gravatar for Rory Osborne
3.8 years ago by
University of Warwick
Rory Osborne0 wrote:

Does anyone happen to know of any software/ online tool which allows me to do the following:

I would like to align several protein sequences (over 100) from one organism with a similar number from a different organism, to find which protein(s) from organism X are most like those from organism Y, or even which domains. I am already aware of the many MSA tools available online, but none of them exclude intra-species homology (I am only interested in where similarities occur between the two organisms). Checking protein by protein is obviously doable but would require an exhaustible number of alignments to be made. Ideally I would give this tool all of the protein sequences (FASTA) from one organism, and 1 sequence from the second organism, and it would return to me a list of proteins with the highest scoring homology, and where in the sequence the similarities occur. Alternatively I would give it all from both organisms and it would do a similar, although I appreciate this is far less likely to work.

Any suggestions would be greatly appreciated,


protein alignment sequence • 1.2k views
ADD COMMENTlink modified 3.6 years ago by stellafrancis2550 • written 3.8 years ago by Rory Osborne0

Unless I'm missing some complexity here, you don't need an MSA to achieve this. A local blast library will do this far more simply.

ADD REPLYlink written 3.8 years ago by george.ry1.1k

I agree with @george.ry, first use blast to find the most similar sequences. Then you can use these 'hit' sequences for a multiple alignment with your query sequence to compare it "better".

ADD REPLYlink written 3.8 years ago by Benn8.0k

Right said george.ry and b.nota. Another way to look into this problem could be look for a Bidirectional Best Hits i.e. searching a sequence from orgY against orgX, if a hit is found, then take the same hit and search against orgY. If birectional hits are found, then it gives you better confidence of homology. The concept is similar to COG. Probably What Is The Best Method To Find Orthologous Genes Of A Species? can give even better information.



Persistent LABS

ADD REPLYlink written 3.8 years ago by Persistent LABS740

You can use mafft then process the result to get what you need

ADD REPLYlink written 3.8 years ago by Medhat8.7k
gravatar for 5heikki
3.8 years ago by
5heikki8.9k wrote:

You could concatenate the proteins into one file and then cluster (e.g. with cd-hit) said file with high similarity percentage. Just be sure that you can tell from sequence names from which proteome the cluster members come from. Then you can examine the cluster output file..

ADD COMMENTlink written 3.8 years ago by 5heikki8.9k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1227 users visited in the last hour