how to fetch intron coordinates from transcriptome.gtf file ? is there any direct tool ?
1
0
Entering edit mode
23 months ago
isha.lily20 ▴ 10

Hello

Is there any direct tool to fetch intron coordinates from transcriptome.gtf file ?

Is there any direct tool available ?

Thank you

intron extract coordinates • 2.0k views
ADD COMMENT
1
Entering edit mode

It’s simple to parse. Simply take the coordinates annotated as .`gene` transcript and those annotated as exon. Take the complement, so gene transcript minus exon, that’s the introns.

If you google the question there are multiple threads on that here at biostars already.

ADD REPLY
3
Entering edit mode

If you want the exact intron ranges you would need to modify the above by looking per transcript. Doing it per gene would give you regions contained within the introns of all transcripts, but not proper introns per say.

ADD REPLY
1
Entering edit mode

Indeed, good catch!

ADD REPLY
0
Entering edit mode

i have done intergenic minus exon for intron coordinates, that is wrong ?

ADD REPLY
2
Entering edit mode

In a sense, yes it is wrong because it doesn't capture biological complexity of alternative splicing. If you have simple gene models where each gene has a single annotated transcript, then it would suffice, but only because the gene models do not capture the real biology well.

Imagine these hypothetical transcripts for the same gene:

T1: oo     t1e1]--------[        t1e2]----------[  t1e3  ooo>
T2: ooo    t2e1]--------[    t2e2]-----[ t2e3]--[ t2e4 oo>

 oo: UTR's or leader sequences
 [  ]: internal exon boundaries
 --: intronic sequence

If you look at genes in a genome browser you will find even more complex cases. Therefore, it becomes very clear that introns depend on the actual transcript.

ADD REPLY
2
Entering edit mode
22 months ago
Juke34 8.5k

If GFF/GTF output format is ok you can use agat_sp_add_introns.pl from AGAT

ADD COMMENT

Login before adding your answer.

Traffic: 2594 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6