Speed up GATK joint genotype calling?
1
0
Entering edit mode
3.8 years ago
jtwalker ▴ 20

I'm using GATK's GenotypeGVCFs tool to jointly genotype ~1000 samples. It's my understanding that because of the genome wide annotations that are calculated, I can't speed things up by using CombineVCFs on smaller jointly called groups. Am I correct?

Is there some way to speed up my joint genotyping with GATK?

Thanks!

edit: The reference genome we're using is only composed of scaffolds. We're worried that because our genome is only composed of scaffolds that this might be slowing things down. Is this thinking correct, or does GATK not care?

gatk GATK Joint Genotyping • 2.4k views
ADD COMMENT
0
Entering edit mode

you can also parallelize things using each contig and the option -L

ADD REPLY
0
Entering edit mode
3.8 years ago
Medhat 8.9k

Parallelism options

This tool can be run in multi-threaded mode using this option.

TreeReducible (-nt)
  

for more details -nt

ADD COMMENT
0
Entering edit mode

I had considered this, but the gatk doc says that we'd need to have the same amount of ram available for each thread. As our ram usage is already a concern, would this still be a viable option?

ADD REPLY
0
Entering edit mode

The idea of threading is that all threads shares the same amount of memory (in contrast to multiprocessing), So my expectation that it should be Ok.

ADD REPLY

Login before adding your answer.

Traffic: 1918 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6