Question

Quantify similarity between multi-fasta files

0

Entering edit mode

9.3 years ago

rowe • 0

Hi there,

I want to compare the output of de novo assemblies of multiple samples. From this, I'd like to cluster the samples on (dis)similarity.

With bla(s)t, I get per-sequences scores (which I could use to get a percentage of similar bases between the query and database). With CD-HIT (EST), I do get clusters, but still no score/percentage.

Does anybody have a more straightforward solution for this?

Seasons greetings,

Robin

clustering fasta denovo similarity • 2.4k views

ADD COMMENT • link updated 2.1 years ago by Ram 43k • written 9.3 years ago by rowe • 0

Ram · Answer 1 · 2014-12-28

0

Entering edit mode

9.3 years ago

learnBioinformatics ▴ 60

If I remember right, clustalW can give the similarity matrix between sequences.

ADD COMMENT • link updated 2.1 years ago by Ram 43k • written 9.3 years ago by learnBioinformatics ▴ 60