Question: Second assembly from existing contigs
gravatar for likoo27
13 days ago by
likoo270 wrote:

My promotor told to try and do the assembly from the first round of assemblies. What i have are contigs : 1. created using Nanopore reads and assemblied in Canu 2. contigs from Canu corrected with PE reads using Pilon 3. contigs from SPAdes assembly using PE reads 4. contigs from hybrid SPAdes using Nanopore and PE

But my question is how one could do that? And would this really give any better output?

assembly genome • 120 views
ADD COMMENTlink modified 12 days ago by colindaven830 • written 13 days ago by likoo270

which version of those has the best stats? (N50 etc) . Is it near to the expected?

Can you describe more elaborated what kind of data you all have? seq versions, #reads, coverage, ...

ADD REPLYlink written 13 days ago by lieven.sterck3.1k

I so sorry for not providing more information. Haploid geenome calculated with k=20 is 13,444,390 bp. As for the coverage I'm actually not sure how to check it especially after using canu and pilon. Should I map the reads back to the assembly? I just recently started with boinformatics so I still don't fully understand everything..

  1. #contigs 57, total length 15353986, %GC 48.87, N50 1113526, L50 5, complete BUSCO 73,79%
  2. #contigs 57, total length 15422215, %GC 48.94, N50 1118589, L50 5, complete BUSCO 94,83%
  3. #contigs 9297, total length 21499633, %GC 49.04, N50 3765, L50 1349, complete BUSCO 83.79%
  4. #contigs 3416, total length 21499633, %GC 48.98, N50 28419, L50 188, complete BUSCO 97.24%
ADD REPLYlink written 12 days ago by likoo270

As also pointed out by colindaven , I would go for #2, that one looks already more than decent.

You might indeed investigate though whether you might buff it up even more with the other three.

ADD REPLYlink written 12 days ago by lieven.sterck3.1k
gravatar for colindaven
12 days ago by
Hannover Medical School
colindaven830 wrote:

Theres a tool called metassembler which might help you out.

However, it's a difficult and inherently biased undertaking. I'd try to pick the two most complete assemblies for use. I.e. #2 and #4

Actually, #2 looks pretty decent by itself. Why not try to orientate this to a reference genome if one exists using a tool like Medusa ?

ADD COMMENTlink modified 12 days ago • written 12 days ago by colindaven830

Thank you, I will check out those tools

ADD REPLYlink written 12 days ago by likoo270
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1495 users visited in the last hour