Question: merge PE reads for Transcript De-Novo Assembly ??
gravatar for Paul
5.2 years ago by
European Union
Paul1.4k wrote:

Dear all,

Do you recommend merging pair-end Illumina reads for Transcript De-Novo assembly?  Thank you for any advice and sharing your experiences..


ADD COMMENTlink modified 2.9 years ago by Biostar ♦♦ 20 • written 5.2 years ago by Paul1.4k

Yes, I got better results after joining the reads, but some bioinformaticians would be probably disagree. So , better is to go for de novo assembly by both ways, then check the average contig length, total number of contigs and length of assembly and then you can argue that which one worked better for you



ADD REPLYlink written 5.2 years ago by Manvendra Singh2.1k

Thank you for comment - probably it would be the best solution...

ADD REPLYlink written 5.2 years ago by Paul1.4k
gravatar for Istvan Albert
5.2 years ago by
Istvan Albert ♦♦ 84k
University Park, USA
Istvan Albert ♦♦ 84k wrote:

It all depends on how many reads overlap If the majority of the reads overlap then there is little that should be gained from treating them as paired end. In fact it should be counterproductive to do so as the system has to deal with more and redundant data.

Logic dictates that providing more information to the system ought to make it perform better. In this case the extra information is that the reads are overlapping and an external tool solved that problem.

Now in reality and practice, the way algorithms are built, tuned and released, depending on the tool and version it just might be that you end up with unexpected performance when choosing one option vs the other. Hence as Manvendra Singh suggests I think it is best to be cautious and evaluate both methods.





ADD COMMENTlink modified 5.2 years ago • written 5.2 years ago by Istvan Albert ♦♦ 84k

My PE overlap 90%, I merged them and got really good results with high k-mers. I only ued merged PEs in the assembly, avoiding many chimeric scaffolds.

ADD REPLYlink modified 5.2 years ago • written 5.2 years ago by apelin20470

Thank you guys for the comment and sharing your experiences.. What tools is the best to evaluate if my reads are overlap?

ADD REPLYlink written 5.2 years ago by Paul1.4k

Try using MeFit ( We have found it to work the best for getting overlapping reads. Other options is FLASH (

ADD REPLYlink written 4.9 years ago by vishal.koparde10
gravatar for Brian Bushnell
5.2 years ago by
Walnut Creek, USA
Brian Bushnell17k wrote:

This really depends on the assembler (as well as the merging program!).  I have found that merging reads improves Ray assemblies, and makes Soap assemblies dramatically worse.

ADD COMMENTlink written 5.2 years ago by Brian Bushnell17k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2037 users visited in the last hour