Question: Which Denovo Assembler To Use For Meta-Transcriptomic Data
gravatar for bambus0725
6.7 years ago by
bambus072550 wrote:


I have couple of questions.

Firstly, I am looking for a denovo assembly tool that can assemble a meta-transcriptome data(paired-end sequence(insert size=300) **consists of mixed sequence reads of multiple species in a microbial community,sequenced using Illumina Genome analyser.I have few number of libraries each holds from ~700,00 to 17million reads.

And in search of that I came across a METAVELVET,a de novo metagenome assembly and I am not sure whether this works well with my data?

Secondly,I used SORTMERNA tool,to detect and remove rRNA's from the same data(whole library) mentioned above and the output was,

file1:- non-rrna's

file2:- rrna's

i.e total percent of rrna it could find is 54.65% and nonrrna 45.35%,but when I checked file1(interested for my work)many rrna's were detected which means the tool misclassified rRNA's as nonrRna's(got to know by doing blastx)then I repeated the process again but this time not on the whole data library instead only on non-rRNA output (file1) generated after appying SORTMERNA on complete data library.This time it again classified data as 51.3% of rRNA and 48.7% nonrrna only from the non-rrna file.

Why the tool couldn't classify all rrna's present in the data with the first run,but in the second run?(still few misclassifications)

It would be very helpful if someone could help me out.

Thanks in advance!!

bioinformatics tools assembly • 3.0k views
ADD COMMENTlink modified 6.7 years ago by Istvan Albert ♦♦ 84k • written 6.7 years ago by bambus072550

it is not a good idea to add two questions into a single post. It discourages contributions because on one likes to answer just a half the question. You'll just end up with neither question being answered.

You should post each question separately.

ADD REPLYlink modified 6.7 years ago • written 6.7 years ago by Istvan Albert ♦♦ 84k

My gut feeling is that MetaVelvet is not the right tool. I think the uneven coverage from transcriptome rather than genome will confuse the k-mer binning step.

ADD REPLYlink written 6.7 years ago by c.v.oflynn90
gravatar for Istvan Albert
6.7 years ago by
Istvan Albert ♦♦ 84k
University Park, USA
Istvan Albert ♦♦ 84k wrote:

metatranscriptome assembly is at very incipient stage and whatever results you get will probably be wild guesstimates

as for the second question, I suspect that you are not running the tool on the correct files,

the problem intrigued me so we tried the same that you describe but we find exactly what we expect (0 rrna in a non-rrna file)

ADD COMMENTlink written 6.7 years ago by Istvan Albert ♦♦ 84k

Hi Istvan Albert,

Thanks for your suggestion,hereafter I post the questions separately.

SortmeRNA:the files I used for applying SORTMERNA were right and what you said is also true,because for instance I have 8 different large data files on which I applied the tool.Out of 8 files,the classification for 4 files is perfect i.e,there were well classified,whereas for the others I came across this issue(rrna's in nonrrna file) that's why I wonder how could this happen?

and thanks once again for your answer

ADD REPLYlink written 6.7 years ago by bambus072550

frankly my guess would be to try again, it is very easy to use the wrong file, otherwise post an example of read that only gets found the second time around

ADD REPLYlink written 6.7 years ago by Istvan Albert ♦♦ 84k are few FP reads,



ADD REPLYlink written 6.7 years ago by bambus072550
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1429 users visited in the last hour