Interval list for GATK tool Haplotype Caller
2
0
Entering edit mode
3.9 years ago
jeremy • 0

Hello, I am currently using GATK's tool haplotypecaller to do variant discovery for some RNA-seq data. The is a very long running process so I have been looking at how to ways to optimize speed. It is mentioned that you can pass an interval list to HaplotypeCaller to speed up performance. It mentions that you can pass a vcf as the interval list to use. I am wondering if it is appropriate to use a reference vcf such as Ensembl's GRCH38.vcf file, as this will be intervals for genes and my variant discovery will only be looking within genes since that is the nature of the RNA-seq data.

Does make sense to use? I cant find much in the docs about what kind of interval list to use for RNA-seq data, it is mostly about whole genome or targeted exome. If that vcf is not proper to use, what interval list or how do I create an interval list to be used by HaplotypeCaller to speed up processing for this RNAseq data

RNA-Seq SNP • 2.6k views
ADD COMMENT
0
Entering edit mode

Yes, you can specify a interval list, just use the -L option and point to a BED file containing your intervals!

ADD REPLY
0
Entering edit mode
3.9 years ago
abedkurdi10 ▴ 190

As I know, intervals in HaplotypeCaller are BED files (Genomic positions). You specify the regions where to call variants.

ADD COMMENT
0
Entering edit mode

However, my question is for RNAseq, are the intervals just going to be every gene, so for example a BED file of the entire transcriptome?

ADD REPLY
0
Entering edit mode

Yes, for every gene or for genes of interest

ADD REPLY

Login before adding your answer.

Traffic: 3193 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6