Understanding the VCF data from my variant calling
1
1
Entering edit mode
6.2 years ago
jaqx008 ▴ 110

Hey guys. I was able to successfully generate my VCF file and I am confused as to how to go from here in analyzing my data. I was asked to identify variance in the open reading frame region using certain quality criteria. I haven't used the IGV before and I am needing help doing all these. Also, any other suggestion that would help is welcomed.

BTW Thanks for all your help in helping with my BAM to VCF.

VCF Variant calling IGV Snps Indels • 1.4k views
ADD COMMENT
1
Entering edit mode
6.2 years ago

Really need more information to properly answer this - are you just looking at variance or for deleterious changes? Regardless, you can filter your VCF file to pull out the variants for your region of interest quite easily with bedtools intersect.

Additionally, you can annotate your VCF file with something like the Variant Effect Predictor to see if any of the variants are predicted to impact function/truncate your protein of interest.

If you need to filter for certain criteria, you can use bcftools.

ADD COMMENT
0
Entering edit mode

Thank you jared. I am basically trying to identify variations within the open reading frame. I don't know if that makes sense. This isn't something I have done before. I am working with Dustmites and trying to see changes in the ORF only. sorry for my late response

ADD REPLY
0
Entering edit mode

If you're just looking for variations in DNA, my first suggestion should work for you. Get the positions of your exons from your reference genome and intersect with them to pull out variants that lie within the exons from your VCF file.

If you're looking for those that actually alter protein sequence, you can create a new consensus sequence with the variants identified above replacing the reference allele and translating that sequence into amino acids, then comparing which change between each sample and the reference genome.

ADD REPLY
0
Entering edit mode

OK. Sounds good. Let me find a way to extract the eons from the .fa reference genome.

ADD REPLY

Login before adding your answer.

Traffic: 2724 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6