I have performed variant calling and annotation analysis from WGS data. Now I need to get sequences of few genes containing variants.
How can I get sequences of particular variant containing genes?
I have performed variant calling and annotation analysis from WGS data. Now I need to get sequences of few genes containing variants.
How can I get sequences of particular variant containing genes?
It's tricky to get full sequences out of a bam, so your best bet is to make a fixed consensus sequence using your original reference fasta and your vcf.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
I know this isn't the exact answer to your question but the most common workflow to see the impact of variants on genes is to run a variant effect predictor
OP seems to have at least tried doing this (if it the same data): building snpeff database for plant
Yes I have done variant annotation.
Its rice data, I used available rice database from snpEff.
My PI wants to perform structural analysis too like comparing structure of normal gene structure with annotated gene containing variant.
Therefore I would need to extract gene sequences for only 3 or 4 genes from our WGS data.
Your guidance is highly appreciated.
Thankyou!