Comparing two sets of 16s sequences to identify any identicle (within 97%) sequences
1
0
Entering edit mode
4.0 years ago

I have two sets of 16s sequences and want to see if there are any sequences within 97% similarity. Blastn uses local alignment and I am looking for global alignment (I think? please correct me on this). If I am indeed looking for global alignment what is an appropriate took to use that will return a % similarity score between the two sequeces?

alignment genome • 554 views
ADD COMMENT
0
Entering edit mode

Are you looking for similarities between the files or within the files too? My initial thought was simple to cluster everything at 97% identity with e.g. CD-HIT, and then investigate the clusters.

ADD REPLY
0
Entering edit mode

I am looking between files :) I will give CD-HIT a look.

ADD REPLY
0
Entering edit mode
4.0 years ago
Mensur Dlakic ★ 27k

At that level of sequence identity local and global alignments are likely to be the same. You can test it by aligning two sequence using BLASTn or Needleman-Wunsch's global algorithm.

ADD COMMENT

Login before adding your answer.

Traffic: 2456 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6