Question: Extract genes from GTF with variants from VCF
gravatar for Picasa
10 months ago by
Picasa560 wrote:


I am studying allele specific expression and I am stuck on the step to get 2 transcripts (one from haplotype 1 and one from haplotype 2).

To get genes/transcripts from a GTF I use cufflink gffreads: this is not a problem.

Now I want to make version 2 (ie. haplotype 2, or alternative) of these transcripts (from a .vcf I have): if these are SNPs, there are no issues but if I got indels then the coordinates of the GTF are not relevant anymore.

Someone have an idea how can I create the haplotype 2 transcripts?

Thanks a lot.

variants gtf • 293 views
ADD COMMENTlink written 10 months ago by Picasa560

What's your end point objective? If I understand correctly, you have a VCF file with variants (SNP/INDELs) and you'd like to generate transcripts that reflect both haplotypes that you're studying. For haplotype 1, it's the reference? and haplotype 2 it's the variant?

Or are you after something else?

ADD REPLYlink written 10 months ago by Mark800

Yes exactly, sorry for not being clear.

It's easy to integrate the alternative SNPs (haplotype 2) because it doesn't shift the coordinates of my transcripts (in the GTF), but for indels... I don't have any solutions.

ADD REPLYlink written 10 months ago by Picasa560

I'm slightly confused. You have the reads already, or how was the vcf file generated? Or are you after integrating the indels into a reference fasta file?

ADD REPLYlink written 10 months ago by Mark800

So basically I had wgs data that I used to call variants (SNPs and indels) -> at the end .vcf file.

I have a reference genome fasta file and want to create an alternative genome fasta file (with SNPs and indels) -> This is easy.

But the difficulty is that I want to extract transcripts (so I have annotation GTF file) of this alternative one (what I call haplotype 2) with SNPs (easy) AND indels (difficult as the coordinates of my exons in the GTF files are not relevant anymore).

ADD REPLYlink modified 10 months ago • written 10 months ago by Picasa560
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1011 users visited in the last hour