Question: Get Flanking Amino Acid Sequence
gravatar for windsur
9 months ago by
windsur10 wrote:

Dear all,

I've just perform an exome-seq and I've obtained the vcf file. Now to continue with my experiment, I need to extract the flanking regions wt and mut type of my dataset because I need to synthesize that for an immunotherapy research. I mean, in my vfc file I have a column like this:



And the desire output is like this:

Wt Epitope                  Mut Epitope

In case I've more than one transcritp, I'll need the first one. I know how to obtain the the flanking regions of nucleotides, but I had not find anything similar like a refGene.txt of amino acids. I've used hg19 as genome reference.

Any help is welcome!

python snp sequence dna-seq R • 302 views
ADD COMMENTlink modified 9 months ago • written 9 months ago by windsur10

Thank you Chris! But unfortunately I do not have a lot of time to learn how to use pVACtools, because I will need to use another format of my vcf file... I think there is another way faster to do what I need. Because if we have to amino acid position (e.g. G580C), with a script similar of bedtools I could get the flanking position. if anyone can help I will be very happy :)

ADD REPLYlink written 9 months ago by windsur10
gravatar for Chris Miller
9 months ago by
Chris Miller21k
Washington University in St. Louis, MO
Chris Miller21k wrote:

Highly suggest that you check out the pVACtools suite, which utilizes some VEP plugins and custom parsing to extract exactly this information and format it nicely prior to doing binding affinity predictions.

ADD COMMENTlink written 9 months ago by Chris Miller21k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 599 users visited in the last hour