Question: Cap3 Integration Of Velvet And Newbler Assemblies
4
gravatar for tommivat
6.8 years ago by
tommivat240
Finland
tommivat240 wrote:

I am conducting de novo assembly of ~33Mb genome using 454 and Illumina reads. I cannot use MIRA, since I have ~80M Illumina reads (would require ~160Gb memory). So far I have found that it's usually most efficient to first assemble reads with Newbler and Velvet, respectively, and then combine the results using some third assembly program. I have been using CAP3 for the last step but I'm not satisfied with the results.

Statistics for the intermediate and final assemblies can be seen below. The problem is that CAP3 results are worse compared to the intermediate ones. It seems that CAP3 throws most of the contigs away. Two questions:

  • Should I use some specific options for CAP3 when conducting the final assembly
  • Are there any ready-made pipeline for doing this kind of 'integration' more effectively?

Statistics for the CAP3 output:

Number of contigs        826
Total size of contigs    5220088
Longest contig      37928
Mean contig size       6320
Median contig size       3734
N50 contig length      12593
L50 contig count        130

Statistics for Newbler output:

Number of contigs       1942
Total size of contigs   32110351
Longest contig     170575
Mean contig size      16535
Median contig size       8447
N50 contig length      37018
L50 contig count        272

Statistics for Velvet output:

Number of contigs       4939
Total size of contigs   34602711
Longest contig     134827
Mean contig size       7006
Median contig size       3463
N50 contig length      15446
L50 contig count        662
assembly velvet • 3.2k views
ADD COMMENTlink written 6.8 years ago by tommivat240
1
gravatar for avik
6.8 years ago by
avik60
avik60 wrote:

try Minimus2 , although it may not improve assembly statistics drastically

ADD COMMENTlink written 6.8 years ago by avik60
1
gravatar for SES
6.8 years ago by
SES8.2k
Vancouver, BC
SES8.2k wrote:

In addition to Minimus2, you may want to try Zorro, which is based on the same pipeline and uses MUMmer. I think CAP3 was designed for EST assembly and I have doubts about what it is doing with genomic contig assembly.

ADD COMMENTlink written 6.8 years ago by SES8.2k
1
gravatar for lexnederbragt
6.8 years ago by
lexnederbragt1.2k
Oslo, Norway
lexnederbragt1.2k wrote:

The newbler program from 454 can take in both 454 reads and illumina reads - have you tried that? See http://contig.wordpress.com/2011/01/21/newbler-input-ii-sequencing-reads-from-other-platforms/ (and maybe http://contig.wordpress.com/2011/09/01/newbler-input-iii-a-quick-fix-for-the-new-illumina-fastq-header/)

ADD COMMENTlink written 6.8 years ago by lexnederbragt1.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 831 users visited in the last hour