Question: How are haplotypes/heterozygosity resolved in sequence assembly?
gravatar for DNAlias
5 months ago by
DNAlias0 wrote:

I am under the impression that many sequencing assemblers are unable to resolve heterozygosity, and account for it by either separating each variant into different contigs, or the two are fused into hybrid of the two variants.

1) Which of these outcomes is preferable and why?

2) I know that there are variant calling pipelines that require a reference genome, is there a way to recognize alleles during de novo assembly?

assembly • 176 views
ADD COMMENTlink modified 5 months ago by Vitis2.3k • written 5 months ago by DNAlias0
gravatar for Vitis
5 months ago by
New York
Vitis2.3k wrote:

I think the ultimate goal for assembling a heterozygous genome is to fully resolve the two haplotypes, essentially two genome assemblies. Platanus seems to be doing a fairly job dealing with heterozygous genomes. Also, long-read sequencing technologies like Nanopore and PacBio would enable variant phasing and resolution of alleles over long distance. Sometimes, the genetic trick of "trio binning" would also help. Basically, you sequence two parents plus the F1 offspring, so you are able to use the parental variant information to partition the offspring reads into haplotypes and do two assemblies simultaneously.

ADD COMMENTlink written 5 months ago by Vitis2.3k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1140 users visited in the last hour