Question: extracting exons and introns cordinates from the gtf file
0
gravatar for HK
23 months ago by
HK30
HK30 wrote:

Hi everyone,

I have a gtf file download from Gencode (##description: evidence-based annotation of the human genome (GRCh38), version 25 (Ensembl 85), ##provider: GENCODE) and now want to load this gtf in R or in linux and want to do few things

1) want to know how many exons per gene and the cordinates of these exons per gene (the file has the transcripts information als, but i only want per gene).

2) Extract the sequence of these exons (gene). I did try biomart and did get the cordinates and sequence of exons but for all transcripts of that gene.

3) Also want the intron cordinates.

Kindly i need suggestion which package should i use in R to upload my downloaded gtf file and perform the tas or maybe how should ui do this by linux commands.

ADD COMMENTlink modified 23 months ago • written 23 months ago by HK30

Probably the BioConductor package GenomicFeatures does everything (or almost) you need. Have a look at the function exonsBy().

ADD REPLYlink written 23 months ago by h.mon26k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1640 users visited in the last hour