Nucleotide Sequence-Clustering Tools
Entering edit mode
12.4 years ago
Raghul ▴ 200

Hi to all I have a transcriptome. I extracted CDS for all sequences both complete & partial. The amino acid usage results show bias towards particular amino acids. Few amino acids are much more than expected which clearly indicates that certain sequences or family of sequences are highly represented. Are there any tools to cluster sequences based on similarity (not duplicates) to avoid redundancy? I have registered for a tool called Usearch & waiting for a reply, still have no idea whether it could be useful!

I also want to know whether the term sequence clustering is appropriate to use here. Because there are different meanings for this word in bioinformatic analysis.

thank u raghul

dna sequence clustering • 4.8k views
Entering edit mode
12.4 years ago

As indicated in How to cluster 454 reads?, you can use CD-HIT. You can also read this question: What softwares can be used for clustering nucleic acid fragments??


Login before adding your answer.

Traffic: 1779 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6