Intersecting compressed gVCF with bed file
1
0
Entering edit mode
5 weeks ago
graeme.thorn ▴ 60

This may be a ridiculously simple question to ask but, I have a compressed genomic VCF file generated by the Strelka germline variant caller, with lines like the following, where no variation was detected:

chr1    27394730    .   T   .   .   PASS    END=27394756;BLOCKAVG_min30p3a  GT:GQX:DP:DPF:MIN_DP    0/0:3070:1137:14:1122

I need to intersect this with a set of regions I'm interested in. I have tried using bedtools intersect with a suitable BED file, but this only matches the the start of this blocked region at chr1 27394730 and not the remainder of the interval chr1:27394730-27394756.

Is there a way to run this intersection using bedtools?

I would think there's a way to do this by converting the gVCF into a BED file (preserving the variations that Strelka has found), but if there's a tool that can do this directly, then please point me in that direction

bedtools gVCF intersect • 232 views
ADD COMMENT
0
Entering edit mode
5 weeks ago
bcftools view --regions-file in.bed in.vcf.gz
ADD COMMENT
0
Entering edit mode

Thanks. Is there a way of stating which region overlaps with the vcf using bcftools view?

I'm sort of looking for the kind of output you get with bedtools using the -wa and -wb flags.

ADD REPLY
0
Entering edit mode

not, that's a job for bcftools annotate with an annotation as an indexed bed.gz.

ADD REPLY

Login before adding your answer.

Traffic: 1854 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6