Should I Separate The Snp And Indel When Performing The Gatk Vqsr?
1
2
Entering edit mode
11.4 years ago
ugoodlfy ▴ 30

Hi All, I have used GATK UnifiedGenotyper to generate a raw.vcf file. Now I want to use GATK VQSR to get a more accurate result ,and I follow this protocol:

  • snp.model <- BuildErrorModelWithVQSR(raw.vcf, SNP);
  • indel.model <- BuildErrorModelWithVQSR(raw.vcf, INDEL);
  • recalibratedSNPs.rawIndels.vcf <- ApplyRecalibration(raw.vcf, snp.model, SNP);
  • analysisReady.vcf <- ApplyRecalibration(recalibratedSNPs.rawIndels.vcf, indel.model, INDEL);

I wanna know will it be better if I seperate the SNP and INDEL when I perform VQSR, like this:

  • SNP.raw.vcf , INDEL.raw.vcf <- Seperate_SNP_INDEL(raw.vcf);
  • snp.model <- BuildErrorModelWithVQSR(SNP.raw.vcf, SNP);
  • indel.model <- BuildErrorModelWithVQSR( INDEL.raw.vcf, INDEL);
  • SNP_analysisReady.vcf <- ApplyRecalibration(SNP.raw.vcf, snp.model, SNP);
  • INDEL_analysisReady.vcf <- ApplyRecalibration(INDEL.raw.vcf, INDEL.model, SNP);

Thanks a lot !

gatk • 4.9k views
ADD COMMENT
3
Entering edit mode
11.4 years ago

GATK best practices recommend running VQSR separately on SNPs and indels:

The UnifiedGenotyper uses a fundamentally different likelihood model when calling different classes of variation and so therefore the VQSR must be run separately for SNPs and INDELs to build separate adaptive error models

From GATK v4 best practices (although this was also true for earlier versions). See phase III, section 2:

http://www.broadinstitute.org/gatk/guide/article?id=1186

ADD COMMENT
1
Entering edit mode

Removed my answer, since Brad's is much more obviously correct. It's interesting how separating SNPs and INDELs are not even mentioned in the documentation I've been following: http://gatkforums.broadinstitute.org/discussion/39/variant-quality-score-recalibration

Looks like they ought to be separated at this stage after all.

ADD REPLY
0
Entering edit mode

Alex, definitely agreed. The VQSR documentation page is SNP specific, but that's not explicitly stated.

ADD REPLY

Login before adding your answer.

Traffic: 2029 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6