The interpretation of SV output file
1
0
Entering edit mode
7.5 years ago
nina • 0

I am working on structural variant calling. However, the output file include SVs which have genotype(0/0) with consensus sequence. I know "0/1" means heterozygous mutations and "1/1" means homozygous mutations. Also I know "0/0" means missing alleles. I have thought missing alleles means that it could not detect specific location of the SV event. Does anyone can explain why structural variants that are called with specific sequence have "0/0"?

Structural variant calling • 1.9k views
ADD COMMENT
2
Entering edit mode
7.5 years ago
d-cameron ★ 2.9k

For VCF files, 0 indicates the reference allele, whereas . indicates a missing allele. According to the VCF file format specifications 0/0 means an (unphased) diploid reference at the position in question - ie the SV is not present.

That said, genotyping is problematic for SVs as the genotyping assumption of a given ploidy does not necessarily hold true. There's no guidance in the VCF specification about what tools are supposed to do in such cases so it's up to the tool developers to make a guess, not just write genotype information.

Does anyone can explain why structural variants that are called with specific sequence have "0/0"?

If you have multiple samples in the same VCF file, 0/0 would indicate that the SV in not present in that sample.

ADD COMMENT

Login before adding your answer.

Traffic: 2375 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6