Question: efficient algorithm for de novo assembly using de bruijn graph
gravatar for StudentBio
2.5 years ago by
StudentBio0 wrote:

hello everebody please I am trying to assemble my reads using an algorithms for de novo assembly( De Bruijn Graph) and I don t know which algorith de novo more efficient in my case the optimal k using kmergenie is 17 Read Data : Ion Torrent, single end, percentage of GC 42, sequence length between 20 and 397

ADD COMMENTlink modified 2.5 years ago by colindaven2.6k • written 2.5 years ago by StudentBio0

What organism is it? How many reads do you have?

You can try SPAdes, which does it's own rounds of kmer optimisation.

ADD REPLYlink written 2.5 years ago by Joe18k

phoenix dactylifera genome length 22 953 390 bp ; het 4,25%

ADD REPLYlink written 2.5 years ago by StudentBio0

Ususally the is no "better" assembler, and each time you have a new dataset, a different assembler may turn out to be the best. In addition, more information would be useful in getting good suggestions about good assemblers for your case, such as expected genome size, expected sequencing coverage, ploidy and heterozigosity of the organism, among others. For example, some assemblers are optimized for small- or medium-sized genomes (such as SPAdes and MIRA), others are good for large genomes.

Are you assembling several genomes? On a different thread ( choice of k value for mapping my reads again a reference genome ), you found 81 as best kmer acording to KmerGenie.

ADD REPLYlink written 2.5 years ago by h.mon32k
gravatar for colindaven
2.5 years ago by
Hannover Medical School
colindaven2.6k wrote:

Thats a really poor dataset for de novo assembly as it contains only single end reads. Also, Ion torrent data contains lots of nasty indels which cause artificial frameshifts in predicted genes. Lastly, there is no long range information.

What coverage do you have ?

I would advise you to try soap2denovo and Spades. Potentially also Abyss.

By the way, shouldn't the genome be more like 600+ MBp ?

ADD COMMENTlink written 2.5 years ago by colindaven2.6k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1516 users visited in the last hour