Interpretation of '1|2' in GT field and '0|1' in PGT field
0
1
Entering edit mode
22 months ago
Choi ▴ 10

Hello, I'm a beginner in bioinformatics. Recently, I called SNPs using illumina sequencing data using gatk HaplotypeCaller using diploid option. And then, I performed gatk GenotypeGVCFs to get a GVCF file. Here I have a question about GT field and PGT field.

ctg000220_np1212    1296462 .   G   *,A 2242.98 .   AC=1,1;AF=0.500,0.500;AN=2;DP=79;ExcessHet=0.0000;FS=0.000;MLEAC=1,1;MLEAF=0.500,0.500;MQ=58.81;QD=32.40;SOR=1.092  GT:AD:DP:GQ:PGT:PID:PL:PS   1|2:0,9,54:63:99:0|1:1296462_G_*:3015,2275,2240,405,0,216:1296462

I attached one representative SNP from my GVCF file. I interpreted that this SNP calling data indicated that 'G' is a reference allele and two alternate allele ( deletion and A). GT field is 1|2, indicating that among two chromosomes from parents one has deletion (G->) and the other has mutation (G->A). My question is why PGT field is 0|1? I cannot understand why 0 is in PGT field since there are two alternate alleles in this SNP position.

As I known, PGT field explain how the alternate alleles are phased in relation to one another (https://gatk.broadinstitute.org/hc/en-us/articles/360050354712-What-is-physical-phasing-). How 0|1 PGT value can explain the phase of two alternate alleles in this SNP?

Thank you for reading my question!

GVCF calling GATK SNP VCF • 475 views
ADD COMMENT

Login before adding your answer.

Traffic: 2113 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6