Question: Clustering of metabarcoding reads from many environmental samples
0
gravatar for lvogel
3.2 years ago by
lvogel30
Western Europe
lvogel30 wrote:

Hi,

I've been using CD-HIT EST in the past to cluster sequences of COI from environmental samples. For my current project, however, there are over 75 samples, and my supervisor wants to know if all of the sequences from all of the samples can be pooled, and then clustered, and then in the end we can know which sequences came from which sample. I tried searching, and I found that CD-HIT OTU does something like this with pooled sequences from multiple samples, but it's apparently only for 16s rRNA, and I have only eukaryotic organisms (COI). Could anyone recommend a clustering program for me?

Thank you for your time,

Laur

ADD COMMENTlink modified 21 months ago • written 3.2 years ago by lvogel30

Hello gb,

Actually, I did end up moving on to USEARCH for most of the steps of my metabarcoding pipeline. It worked well and required only minimal scripting knowledge. Recently, however, I've been tasked with even larger datasets, which exceeded the 32-bit memory of the free version of USEARCH. So I'm currently learning how to use VSEARCH. You're correct, though, and should get accepted answer.

ADD REPLYlink modified 21 months ago • written 21 months ago by lvogel30
1

Yes, I am in the same situation. With VSEARCH you can also make an otu table but with the lack of documentation it is difficult to find the right commands. Look at this page: https://github.com/torognes/vsearch/wiki/VSEARCH-pipeline

ADD REPLYlink written 20 months ago by gb1.2k
1
gravatar for gb
21 months ago by
gb1.2k
gb1.2k wrote:

Hello,

This answer comes a bit late but you can exactly do this with usearch.

https://www.drive5.com/usearch/manual/pipe_otus.html

https://www.drive5.com/usearch/manual/cmd_cluster_otus.html

If you prefer CD-HIT above uparse you need to do some extra steps for otu clustering like length trimming, find uniq read abundance and sort on the read abundance. Using CD-hit alone is not the proper way to do otu-clustering.

ADD COMMENTlink written 21 months ago by gb1.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1055 users visited in the last hour