How to cluster (nearest neighbour method) amino acid peptide sequences based on sequence identity?
1
0
Entering edit mode
9.3 years ago

Hi,

I have a list of amino-acid peptides. For example:

ILM
FILM
FILVM
..
..

Now I need to cluster (nearest neighbour method) these peptides based on sequence identity

Please suggest any tool to do clustering

Thanks in advance

alignment • 4.7k views
ADD COMMENT
0
Entering edit mode
ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode
9.3 years ago
jockbanan ▴ 420

If it is just the matter of sequence identity, uclust/cd-hit would probably be the best. If you want to take into account biochemical properties of amino acids used, there are some more sophisticated tools out there, namely: MUSI and the Gibbs sampling tool (IMHO better than MUSI) and also Hammock (there is also a Galaxy version of it), which is suitable especially for very large datasets.

ADD COMMENT

Login before adding your answer.

Traffic: 2557 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6