Question: GATK3 error while performing local realignment for exome
0
gravatar for shuksi1984
11 months ago by
shuksi198440
shuksi198440 wrote:

How to fix the following error?

ERROR MESSAGE: Fasta index file /data/ngs/references/hg38gatkbundle/Homo_sapiens_assembly38.known_indels.vcf.fai for reference /data/ngs/references/hg38gatkbundle/Homo_sapiens_assembly38.known_indels.vcf does not exist

My command is:

java -jar path/to/gatk3.8.jar -T RealignerTargetCreator -R /path/to/hg38gatkbundle/Homo_sapiens_assembly38.known_indels.vcf -known /path/to/hg38gatkbundle/Mills_and_1000G_gold_standard.indels.hg38.vcf -I HG100.bam -L path/to/hg38gatkbundle/exons.hg38.bed -o HG100.IndelRealigner.intervals

NOTE: I am using hg38gatkbundle as my reference genome. hg38gatkbundle directory does not have any Homo_sapiens_assembly38.known_indels.vcf.fai for Homo_sapiens_assembly38.known_indels.vcf file

Also, exons.hg38.bed is not there in hg38gatkbundle directory

ADD COMMENTlink modified 11 months ago by Devon Ryan88k • written 11 months ago by shuksi198440
1
gravatar for Devon Ryan
11 months ago by
Devon Ryan88k
Freiburg, Germany
Devon Ryan88k wrote:

The -R option takes a fasta file holding the reference sequence, not a VCF. In fact the only VCF that would normally be used here is the one you mentioned with --known.

As an aside, there's little purpose to realigning around indels unless you're using the old UnifiedGenotyper.

ADD COMMENTlink written 11 months ago by Devon Ryan88k

Thanks, I changed the command accordingly and it worked. My command is :

java -jar /path/to/gatk3.8.jar -T RealignerTargetCreator -R /path/to/Homo_sapiens_assembly38.fasta -known/path/to/Homo_sapiens_assembly38.known_indels.vcf -known /path/to/Mills_and_1000G_gold_standard.indels.hg38.vcf -I HG100.bam -o HG100.IndelRealigner.intervals

But, still I didnt get the mystery of exons.hg38.bed. In the above command I removed it and it worked.

ADD REPLYlink written 11 months ago by shuksi198440

The -L option is just restricting the program to operate over a given set of regions, it's not strictly needed though it might speed things up a bit.

ADD REPLYlink written 11 months ago by Devon Ryan88k

there's little purpose to realigning around indels unless you're using the old UnifiedGenotyper.

It could still be useful for non-GATK variant callers.

ADD REPLYlink written 11 months ago by igor7.4k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2307 users visited in the last hour