Question: Quantify similarity between multi-fasta files
gravatar for rowe
4.3 years ago by
United Kingdom
rowe0 wrote:

Hi there,

I want to compare the output of de novo assemblies of multiple samples. From this, I'd like to cluster the samples on (dis)similarity.

With bla(s)t, I get per-sequences scores (which I could use to get a percentage of similar bases between the query and database). With CD-HIT (EST), I do get clusters, but still no score/percentage.

Does anybody have a more straightforward solution for this?

Seasons greetings,


ADD COMMENTlink modified 4.3 years ago by learnBioinformatics40 • written 4.3 years ago by rowe0
gravatar for learnBioinformatics
4.3 years ago by
United States
learnBioinformatics40 wrote:

If I remember right, clustalW can give the similarity matrix between sequences.

ADD COMMENTlink written 4.3 years ago by learnBioinformatics40
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 687 users visited in the last hour