What is a "draft genome assembly"?
3
0
Entering edit mode
7.4 years ago
EVR ▴ 610

Hi,

Can someone explain what is draft genome assembly? does it mean it assembled in de novo method?

Genome_assembly de novo • 8.8k views
ADD COMMENT
2
Entering edit mode
7.4 years ago

I am sorry to be pedantic about this, but I think there is a deeper point to be made here about what "completeness" can mean in terms of genome assemblies.

All the genome assemblies we have right now for various organisms are really an amalgamation of the individuals/samples/cells that were collected and sequenced. During the assembly process, the genotype that is the most frequently observed in the collected sample is used. So the resulting assembly is really a chimeric genome. Not to say this genome isn't useful, because it is incredibly useful in terms of a backbone for mapping reads to for various purposes.

So if we are referring to completeness of information about the organism, we would need to also know the population level variants seen in the genomes of individuals. There are plenty of research projects/consortiums working hard on gathering variant data (ie. 1000 genome project). There are also proposals by well regarded bioinformaticians to start utilizing graph based file formats to represent genomes so we can take advantage of the extra variant information (ie. GFA format, HISAT2 mapper).

If we are referring to completeness of the genome assembly. We are usually talking about scaffold length/conservation of genes/number of gaps type metrics that measure how well the assembler performed on the set of data given. These are more of a technical measure of algorithm performance.

ADD COMMENT
1
Entering edit mode
7.4 years ago

You will find everything that you need in here: http://onlinelibrary.wiley.com/doi/10.1111/eva.12178/full

every draft genome assembly constitutes merely a hypothesis of the true underlying genome sequence

ADD COMMENT
1
Entering edit mode
7.4 years ago
mastal511 ★ 2.1k

Essentially it means that the genome assembly is just a preliminary result, and more work would need to be done to generate a more complete and accurate version of the genome.

ADD COMMENT

Login before adding your answer.

Traffic: 2689 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6