Getting The Vcf Consensus For Only A Segment Of The Genome
1
0
Entering edit mode
10.1 years ago
Will 4.5k

I have numerous VCF files of a HIV genomic region and I'm looking to retrieve the resulting consensus sequence from just a region ~50 nt of the alignment so I can feed it to a TF binding prediction program. Normally I'd just use vcf-consensus to output the whole region and then slice the region I want out. However, there are numerous indels (of varying sizes) in this region that are important and I don't want to use that sort of method.

I can't seem to find any flags in the vcf-consensus tool that lets me limit the output.

Any suggestions?

• 2.2k views
ADD COMMENT
0
Entering edit mode
10.1 years ago

Aside from awk? You can use BEDTools to get the intersect of a vcf and a .bed file with your region.

ADD COMMENT
0
Entering edit mode

I can easily filter the VCF to get my desired region. But if there are insertions wouldn't bedtools-intersect just give me the 50nt region and not 50 nt + insertions?

ADD REPLY
0
Entering edit mode

Will, how did you solved this? How did you filter the desired region in a .vcf? Thanks!

ADD REPLY

Login before adding your answer.

Traffic: 2587 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6