GATK3 error while performing local realignment for exome
1
0
Entering edit mode
6.0 years ago
shuksi1984 ▴ 60

How to fix the following error?

ERROR MESSAGE: Fasta index file /data/ngs/references/hg38gatkbundle/Homo_sapiens_assembly38.known_indels.vcf.fai for reference /data/ngs/references/hg38gatkbundle/Homo_sapiens_assembly38.known_indels.vcf does not exist

My command is:

java -jar path/to/gatk3.8.jar -T RealignerTargetCreator -R /path/to/hg38gatkbundle/Homo_sapiens_assembly38.known_indels.vcf -known /path/to/hg38gatkbundle/Mills_and_1000G_gold_standard.indels.hg38.vcf -I HG100.bam -L path/to/hg38gatkbundle/exons.hg38.bed -o HG100.IndelRealigner.intervals

NOTE: I am using hg38gatkbundle as my reference genome. hg38gatkbundle directory does not have any Homo_sapiens_assembly38.known_indels.vcf.fai for Homo_sapiens_assembly38.known_indels.vcf file

Also, exons.hg38.bed is not there in hg38gatkbundle directory

next-gen sequencing software error • 1.3k views
ADD COMMENT
1
Entering edit mode
6.0 years ago

The -R option takes a fasta file holding the reference sequence, not a VCF. In fact the only VCF that would normally be used here is the one you mentioned with --known.

As an aside, there's little purpose to realigning around indels unless you're using the old UnifiedGenotyper.

ADD COMMENT
0
Entering edit mode

Thanks, I changed the command accordingly and it worked. My command is :

java -jar /path/to/gatk3.8.jar -T RealignerTargetCreator -R /path/to/Homo_sapiens_assembly38.fasta -known/path/to/Homo_sapiens_assembly38.known_indels.vcf -known /path/to/Mills_and_1000G_gold_standard.indels.hg38.vcf -I HG100.bam -o HG100.IndelRealigner.intervals

But, still I didnt get the mystery of exons.hg38.bed. In the above command I removed it and it worked.

ADD REPLY
0
Entering edit mode

The -L option is just restricting the program to operate over a given set of regions, it's not strictly needed though it might speed things up a bit.

ADD REPLY
0
Entering edit mode

there's little purpose to realigning around indels unless you're using the old UnifiedGenotyper.

It could still be useful for non-GATK variant callers.

ADD REPLY

Login before adding your answer.

Traffic: 2507 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6