De-Novo Genome Assembly Vs Comparative Genome Assembly
4
2
Entering edit mode
12.9 years ago
Saad Khan ▴ 440

Which of the two genome assembly methods is better (more accurate) in cases where there are many potential reference genome available for a particular newly sequenced genome. Also if all the reference genomes belong to same genus how many of them should be considered while carrying out genome assembly.

assembly genome • 6.9k views
ADD COMMENT
4
Entering edit mode
12.9 years ago
Michael Barton ★ 1.9k

The approach I used for our 454 sequence reads was de novo assembly followed mapping the resulting contigs against a reference genome using nucmer. I assemble de novo first to prevent any inversions or repeats in any reference genome making their way into my build.

One question is what effect do inversions present in a reference genome, but not in your genome, have on comparative assembly? I imagine it depends on the weighting comparative assembly software gives to read-overlap or paired-read data versus reference genome alignment. A software suite like AMOS does allow you to combine different genome builds together so it's worth experimenting perhaps?

ADD COMMENT
2
Entering edit mode
12.9 years ago

Both approaches have issues, and it is not possible to say if one is the better. De-novo assembly fails in repetitive regions. Mapping assembly miss segmental insertions and rearrangements. You can try both approaches and see how the results compare. To my knowledge there is no software that does this in one go - which is a shame.

ADD COMMENT
2
Entering edit mode

And, it depends on the organism! Some organisms are much easier to assemble than others, in one project, two bacteria in the same family had very different assembly results - one assembling into a couple of contigs, the other ending up very fragmented. I'm with the "try both" camp.

ADD REPLY
1
Entering edit mode
12.9 years ago
Felix ▴ 90

It also depends on the sequencing depth of the newly sequenced genome. If you only have a low coverage (maybe <6x) the de-novo assembly quality will be dubious. I believe combining the two approaches would be the better strategy in this case.

ADD COMMENT
0
Entering edit mode
10.7 years ago
Rohit ★ 1.5k

If you have time, try to run a de novo assembly and then try to map it to a Reference.

Segemehl is one tool that helps out in finding insertions and deletions.

http://www.bioinf.uni-leipzig.de/Software/segemehl/

ADD COMMENT

Login before adding your answer.

Traffic: 2655 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6