How do variant callers get genotype info?
2
2
Entering edit mode
5.9 years ago
afzm ▴ 20

How are variant callers able to compute which copy of the chrs in case of a diploid species has the heterozygous detected variants? What information do they use, just paired end reads?

I could find some statistical data in GATK webpage, but I would like to understand if there is other information used, the rationale behind it the accuracy it would have and the factors that affect this process.

Thank you very much

variant calling SNP indel phasing • 1.2k views
ADD COMMENT
3
Entering edit mode
5.9 years ago

Usually the caller has no idea which chromosome homolog a variant is on. It can just see variants that are in the same read or read pair (unlikely for short reads) or it can try to infer which variants are on the same chromosome homolog (phased) using read-backed phasing (as part of the read assembly performed by the haplotype caller).

These in silico methods are spotty at best. Most people who need phasing just use a long-read technology, or they sequence the parents.

https://software.broadinstitute.org/gatk/documentation/tooldocs/3.8-0/org_broadinstitute_gatk_tools_walkers_phasing_ReadBackedPhasing.php

https://www.illumina.com/techniques/sequencing/dna-sequencing/whole-genome-sequencing/phased-sequencing.html

ADD COMMENT
1
Entering edit mode
5.9 years ago

Mathematical Notes on SAMtools Algorithms

https://software.broadinstitute.org/gatk/media/docs/Samtools.pdf

.... good luck...

ADD COMMENT

Login before adding your answer.

Traffic: 1640 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6