Recovering Positions Of Identical Matches From Multiple/ Pairwise Sequence Alignment
1
0
Entering edit mode
11.0 years ago
VS ▴ 730

I would like to do a pairwise/multiple sequence alignment for a gene from two/three species and then record in a separate file the exact positions of perfect identity between the sequences. Is there a tool that does this already? Or is there any option in blast/clustal that could help in recovering this information ?

Will be very thankful for your time and responses.

sequence analysis msa blast • 2.9k views
ADD COMMENT
0
Entering edit mode

Have you tried GBlocks? http://molevol.cmima.csic.es/castresana/Gblocks.html I know it can extract highly conserved regions from an alignment, but not sure if you can configure it to extract only perfect matches. Edit: here is the webserver: http://www.vardb.org/vardb/analysis/gblocks.html

ADD REPLY
2
Entering edit mode
11.0 years ago

BLAT can return exact matches, by filtering for results where the query and target sizes (sequence lengths) match. Perhaps you could do an EMBOSS Needle alignment with a very high gap penalty (to help push down gapped alignment results) and then use BLAT to do the sequence position lookup on aligned results.

ADD COMMENT
0
Entering edit mode

Thanks for the answer! Any particular reason you recommend EMBOSS Needle over blast with similarly high gap penalty?

ADD REPLY
0
Entering edit mode

I think a local alignment (e.g., BLAST) won't always give you the edges of the query sequence in a search match, but global (e.g., Needleman-Wunsch) will. Because you want an exact match, you probably want those edges to match, too.

ADD REPLY

Login before adding your answer.

Traffic: 1613 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6