Question: How to get GQX annotation in vcf file
0
gravatar for Clare
2.7 years ago by
Clare150
United States
Clare150 wrote:

I need to analyze both variant and nonvariant sites from a vcf. I produced this with GaTK Haplotype caller (v 3.7), but I'm struggling to know how to filter the variant and nonvariant sites 'fairly'. Illumina states you can do this using the GQX value, but my GATK gvcfs and vcfs don't have that annotation. Does anyone know if it is only the isaac variant caller software that produces GQX values? And is it possible for me to calculate it post hoc?

-- Here is the description of GQX from illumina: Genotype Quality for Variant and Non-variant Sites

The gVCF file uses an adapted version of genotype quality for variant and non-variant site filtration. This value is associated with the key GQX. The GQX value is intended to represent the minimum of {Phred genotype quality assuming the site is variant, Phred genotype quality assuming the site is non-variant}. The reason for using this value is to allow a single value to be used as the primary quality filter for both variant and non-variant sites. Filtering on this value corresponds to a conservative assumption appropriate for applications where reference genotype calls must be determined at the same stringency as variant genotypes, i.e.:

• An assertion that a site is homozygous reference at GQX >= 30 is made assuming the site is variant. • An assertion that a site is a non-reference genotype at GQX >= 30 is made assuming the site is non-variant.

filtering snp • 1.1k views
ADD COMMENTlink modified 2.1 years ago by Kevin Blighe65k • written 2.7 years ago by Clare150

Hi Clare, I have a similar question. Were you able to find a solution?

ADD REPLYlink written 2.1 years ago by nute11a20
0
gravatar for Kevin Blighe
2.1 years ago by
Kevin Blighe65k
Kevin Blighe65k wrote:

Is GQX not just the minimum of the QUAL and FORMAT::GQ values? You can just filter using a single value using BCFtools. Here, I require that, if either QUAL or GQ has a value below 30, then exclude the variant.

bcftools filter --exclude 'QUAL<30 || GQ<30' Variants.vcf

That does the same as filtering on FORMAT::GQX

Kevin

ADD COMMENTlink written 2.1 years ago by Kevin Blighe65k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 627 users visited in the last hour