Question: Extract exon Intron reads from RNA seq data
0
gravatar for Ambika
3.0 years ago by
Ambika40
United States
Ambika40 wrote:

Hello everyone,

I am working with a plant fungi of which I have a DNA sequence. I am trying to make primers for some differentially expressed genes for which I need to know exon-exon junctions. Is there any way I can know the exon intron information of genes I am interested in so that I can easily design the primers.

Many thanks in advance,

Ambika

rna-seq • 1.3k views
ADD COMMENTlink written 3.0 years ago by Ambika40

Have you checked to see if there is a gene annotation file available? You should be able to figure the junctions out from that.

ADD REPLYlink written 3.0 years ago by GenoMax94k

Genomax, I have a gtf file from augustus. But I am not sure how can I infer information out of that. Can you please tell me if thats the file I need and how can I interpret that file

ADD REPLYlink written 3.0 years ago by Ambika40

Does the GTF file have entries for exons (and perhaps introns)?

ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by GenoMax94k

It has CDS instead of exons

ADD REPLYlink written 3.0 years ago by Ambika40
1

Those 4 are exons of that one gene transcript (g17308.t1) so presumably there are more than one transcript for each gene (g17308)?

ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by GenoMax94k

So these numbers represent the starting and ending position of sequence which are exons. I tried to search for this gene in the file and I just have one transcript (g17308.t1) . So can I say this gene have 4 exons?

ADD REPLYlink written 3.0 years ago by Ambika40
1

If that is all blocks then yes. Check what the number following the strand signifies in specifications. I don't recollect off the top of my head.

ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by GenoMax94k
tig00007990     AUGUSTUS        CDS     869652  869790  0.90    +       0       transcript_id "g17308.t1"; gene_id "g17308";
tig00007990     AUGUSTUS        CDS     869843  870931  0.99    +       2       transcript_id "g17308.t1"; gene_id "g17308";
tig00007990     AUGUSTUS        CDS     870982  871333  1.00    +       2       transcript_id "g17308.t1"; gene_id "g17308";
tig00007990     AUGUSTUS        CDS     871385  871442  0.99    +       1       transcript_id "g17308.t1"; gene_id "g17308";

The format looks like this

ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by Ambika40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1510 users visited in the last hour
_