Question: How to get GQX annotation in vcf file
0
gravatar for Clare
14 months ago by
Clare120
United States
Clare120 wrote:

I need to analyze both variant and nonvariant sites from a vcf. I produced this with GaTK Haplotype caller (v 3.7), but I'm struggling to know how to filter the variant and nonvariant sites 'fairly'. Illumina states you can do this using the GQX value, but my GATK gvcfs and vcfs don't have that annotation. Does anyone know if it is only the isaac variant caller software that produces GQX values? And is it possible for me to calculate it post hoc?

-- Here is the description of GQX from illumina: Genotype Quality for Variant and Non-variant Sites

The gVCF file uses an adapted version of genotype quality for variant and non-variant site filtration. This value is associated with the key GQX. The GQX value is intended to represent the minimum of {Phred genotype quality assuming the site is variant, Phred genotype quality assuming the site is non-variant}. The reason for using this value is to allow a single value to be used as the primary quality filter for both variant and non-variant sites. Filtering on this value corresponds to a conservative assumption appropriate for applications where reference genotype calls must be determined at the same stringency as variant genotypes, i.e.:

• An assertion that a site is homozygous reference at GQX >= 30 is made assuming the site is variant. • An assertion that a site is a non-reference genotype at GQX >= 30 is made assuming the site is non-variant.

filtering snp • 464 views
ADD COMMENTlink modified 7 months ago by Kevin Blighe39k • written 14 months ago by Clare120

Hi Clare, I have a similar question. Were you able to find a solution?

ADD REPLYlink written 7 months ago by nute11a0
0
gravatar for Kevin Blighe
7 months ago by
Kevin Blighe39k
Republic of Ireland
Kevin Blighe39k wrote:

Is GQX not just the minimum of the QUAL and FORMAT::GQ values? You can just filter using a single value using BCFtools. Here, I require that, if either QUAL or GQ has a value below 30, then exclude the variant.

bcftools filter --exclude 'QUAL<30 || GQ<30' Variants.vcf

That does the same as filtering on FORMAT::GQX

Kevin

ADD COMMENTlink written 7 months ago by Kevin Blighe39k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 952 users visited in the last hour