Dear all,
I am trying to cluster my sequence dataset so that I can determine the number of OTU's at various percent identities. I have tried with QIIME and it worked out well. However I am trying to use the mothur workflow to perform clustering of my sequence dataset. I am able to see that mothur requires alignment of the dataset to be done before we cluster them.
Is there a way we can skip the alignment step in mothur and cluster the sequences like how QIIME does or is this a oversight from my side??
You might want to check out my ISME J paper on why an alignment (to a reference) is important for analyzing 16S rRNA gene sequences (http://www.ncbi.nlm.nih.gov/pubmed/23018771). So in short, the requirement for alignment is a feature, not a bug. Now if you have ITS sequences, that's another ball of wax and I frankly wonder how those folks can align their sequences to anything or even use OTUs to do community ecology analyses.
I am having the same problem.
I am using ITS on fungi, I find Mothur useful and it serves as a great tool to use instead of or with qiime.
However, there is HUGE world of research using the ITS regions for fungi, We don't align, and we get great and useful data. Not recognizing this, and ignoring the fungal world is, in my opinion, significantly restricting mothur's utility to the microbial community.
Can we use mothur to cluster, without requiring alignments or not?