Question: How are haplotypes/heterozygosity resolved in sequence assembly?
gravatar for DNAlias
13 months ago by
DNAlias10 wrote:

I am under the impression that many sequencing assemblers are unable to resolve heterozygosity, and account for it by either separating each variant into different contigs, or the two are fused into hybrid of the two variants.

1) Which of these outcomes is preferable and why?

2) I know that there are variant calling pipelines that require a reference genome, is there a way to recognize alleles during de novo assembly?

assembly • 267 views
ADD COMMENTlink modified 13 months ago by Vitis2.4k • written 13 months ago by DNAlias10
gravatar for Vitis
13 months ago by
New York
Vitis2.4k wrote:

I think the ultimate goal for assembling a heterozygous genome is to fully resolve the two haplotypes, essentially two genome assemblies. Platanus seems to be doing a fairly job dealing with heterozygous genomes. Also, long-read sequencing technologies like Nanopore and PacBio would enable variant phasing and resolution of alleles over long distance. Sometimes, the genetic trick of "trio binning" would also help. Basically, you sequence two parents plus the F1 offspring, so you are able to use the parental variant information to partition the offspring reads into haplotypes and do two assemblies simultaneously.

ADD COMMENTlink written 13 months ago by Vitis2.4k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1510 users visited in the last hour