Genotype representation with 0, 1, 2 - what do they mean?
1
2
Entering edit mode
3.4 years ago
Volka ▴ 160

Hello all, I am currently learning about phasing and imputation, and I have come across a few ways of representing genotypes. As I understand it, 0 refers to the reference allele while 1 the alternate allele. 0/0 refers to homozygous for the reference allele, 1/1 would be homozygous for the alternate allele, while 0/1 and 1/0 would be the heterozygotes.

However, I have also come across genotypes represented by a single 0, 1 or 2, and here is where I am confused. It's a bit hard to find info for this, but it seems that 0 and 1 refers to the homozygote genotypes, while 2 refers to the heterozygote genotypes, is this correct?

Which of 0 or 1 would be homozygous for the reference and which for the alternate allele? When denoting a genotype as 2, would it not matter if its 0/1 or 1/0? Thanks very much!

genotype representation phasing imputation gwas • 3.6k views
ADD COMMENT
2
Entering edit mode

I have also come across genotypes represented by a single 0, 1 or 2

in a VCF these are haploid regions. For a diploid organism that could be a haploid region where there is a deletion.

for the other formats & diploid organism, it's usually: 0:HOM_REf, 1:HET 2: HOM_VAR

ADD REPLY
0
Entering edit mode
5 weeks ago
Mwangana • 0

My response is certainly late, but it might benefit someone... I have come across that coding with respect to dosage of the minor allele e.g. If we have a biallelic loci with T and G, and say G is a minor allele, the 3 possible genotypes will be coded as follows: TT=0, TG=1, and GG = 2

ADD COMMENT

Login before adding your answer.

Traffic: 2650 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6