How To Calculate Genotype Likelihood Using Quality Scores For Reads
1
0
Entering edit mode
10.5 years ago
Mcmahanl ▴ 300

After google search and reading some articles, I still have no idea how to calculate genotype likelihood for a locus using quality scores for reads using simple probability. The probability math symbols in those articles are hard for me to understand how to do the calculation.

For example, for a locus, the reference sequence has T. There are 6 reads, 4 reads have T and 2 reads have G (with quality score = 10) at that locus.

If the true genotype of this locus is [T,T], the question is how does one calculate the probability of this genotype at this locus, i.e. P(D | [T, T])? D for the given data of reads.

ngs • 7.0k views
ADD COMMENT
3
Entering edit mode
10.5 years ago

see Heng Li's "Mathematical Notes on SAMtools Algorithms / 4.4 Likelihood of data given genotype " in http://www.broadinstitute.org/gatk/media/docs/Samtools.pdf

ADD COMMENT
0
Entering edit mode

Thank you so much Pierre Lindenbaum, it is really helpful

ADD REPLY

Login before adding your answer.

Traffic: 2436 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6