how to fetch intron coordinates from transcriptome.gtf file ? is there any direct tool ?
1
0
Entering edit mode
5 weeks ago
isha.lily20 ▴ 10

Hello

Is there any direct tool to fetch intron coordinates from transcriptome.gtf file ?

Is there any direct tool available ?

Thank you

intron extract coordinates • 566 views
1
Entering edit mode

It’s simple to parse. Simply take the coordinates annotated as .gene transcript and those annotated as exon. Take the complement, so gene transcript minus exon, that’s the introns.

3
Entering edit mode

If you want the exact intron ranges you would need to modify the above by looking per transcript. Doing it per gene would give you regions contained within the introns of all transcripts, but not proper introns per say.

1
Entering edit mode

Indeed, good catch!

0
Entering edit mode

i have done intergenic minus exon for intron coordinates, that is wrong ?

2
Entering edit mode

In a sense, yes it is wrong because it doesn't capture biological complexity of alternative splicing. If you have simple gene models where each gene has a single annotated transcript, then it would suffice, but only because the gene models do not capture the real biology well.

Imagine these hypothetical transcripts for the same gene:

T1: oo     t1e1]--------[        t1e2]----------[  t1e3  ooo>
T2: ooo    t2e1]--------[    t2e2]-----[ t2e3]--[ t2e4 oo>

[  ]: internal exon boundaries
--: intronic sequence


If you look at genes in a genome browser you will find even more complex cases. Therefore, it becomes very clear that introns depend on the actual transcript.

2
Entering edit mode
4 weeks ago
Juke34 ★ 7.1k

If GFF/GTF output format is ok you can use agat_sp_add_introns.pl from AGAT

Traffic: 677 users visited in the last hour
FAQ
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.