Question

What is a "draft genome assembly"?

0

Entering edit mode

7.4 years ago

EVR ▴ 610

Hi,

Can someone explain what is draft genome assembly? does it mean it assembled in de novo method?

Genome_assembly de novo • 8.8k views

ADD COMMENT • link updated 7.4 years ago by Damian Kao 16k • written 7.4 years ago by EVR ▴ 610

score 2 · Answer 1 · 2016-11-25

I am sorry to be pedantic about this, but I think there is a deeper point to be made here about what "completeness" can mean in terms of genome assemblies.

All the genome assemblies we have right now for various organisms are really an amalgamation of the individuals/samples/cells that were collected and sequenced. During the assembly process, the genotype that is the most frequently observed in the collected sample is used. So the resulting assembly is really a chimeric genome. Not to say this genome isn't useful, because it is incredibly useful in terms of a backbone for mapping reads to for various purposes.

So if we are referring to completeness of information about the organism, we would need to also know the population level variants seen in the genomes of individuals. There are plenty of research projects/consortiums working hard on gathering variant data (ie. 1000 genome project). There are also proposals by well regarded bioinformaticians to start utilizing graph based file formats to represent genomes so we can take advantage of the extra variant information (ie. GFA format, HISAT2 mapper).

If we are referring to completeness of the genome assembly. We are usually talking about scaffold length/conservation of genes/number of gaps type metrics that measure how well the assembler performed on the set of data given. These are more of a technical measure of algorithm performance.

score 1 · Answer 2 · 2016-11-25

1

Entering edit mode

7.4 years ago

Matteo Schiavinato ★ 3.6k

You will find everything that you need in here: http://onlinelibrary.wiley.com/doi/10.1111/eva.12178/full

every draft genome assembly constitutes merely a hypothesis of the true underlying genome sequence

ADD COMMENT • link 7.4 years ago by Matteo Schiavinato ★ 3.6k

score 1 · Answer 3 · 2016-11-25

1

Entering edit mode

7.4 years ago

mastal511 ★ 2.1k

Essentially it means that the genome assembly is just a preliminary result, and more work would need to be done to generate a more complete and accurate version of the genome.

ADD COMMENT • link 7.4 years ago by mastal511 ★ 2.1k