How to slice a gvcf file with tabix?
1
0
Entering edit mode
9 weeks ago
Sd • 0

I am using the following tabix command to slice a GVCF file, but it is not creating sliced GVCF files. I have a bed file for the hg38 genome reference with 1MB regions and 1kb padding, and I want to slice using the bed file. I was wondering if you could help me with how to do it?

./tabix -R regions.bed -p vcf sample.g.vcf.gz > sample_sliced.gz
Tabix bed gvcf • 520 views
ADD COMMENT
0
Entering edit mode
9 weeks ago

why would you need to slice a gvcf file ? all the tools in gatk can use a BED / interval / chromosome to narrow the analysis

ADD COMMENT
0
Entering edit mode

I want to run gatk GenomicsDBImport tool to build a GenomicsDB. I want to slice it to reduce the computation time since I have 1300 gvcf files. What is your suggestion to run GenomicsDBImport efficiently?

ADD REPLY
0
Entering edit mode

What is your suggestion to run GenomicsDBImport efficiently?

call the GVCF by slices eg: chr1:1000000-2000000
run GenomicsDBImport for each sample for one interval
genotype gvcf for one interval

at the end concatenate all the VCFs

ADD REPLY
0
Entering edit mode

Then what is the best way to do slicing for 1Mb windows? Doing --intervals in the GenomicsDBImport?

ADD REPLY
0
Entering edit mode

No, in haplotypecaller

ADD REPLY
0
Entering edit mode

I already have gvcfs called by chromosome and there are 24 gvcfs per sample which I merged them into a single gvcf per sample. Now I want to slice them to smaller chunks.

ADD REPLY

Login before adding your answer.

Traffic: 2202 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6