How To Find The Total Number Of Exons In A Transcript?
1
0
Entering edit mode
10.7 years ago
alexhli ▴ 90

I would like to look up the total number of exons in a transcript, starting with the NM# for this cDNA. For example, say I have a list of thousands of mutations which are formatted like this:

PRAMEF1:NM_023013:exon3:c.T314A:p.L105X

How can I tell if this mutation occurs in the last exon of the transcript? Or at least the first half of the total transcript? This issue is specifically related to nonsense-mediated decay.

Thanks

genetics mrna sequence • 5.7k views
ADD COMMENT
1
Entering edit mode
10.7 years ago
Emily 23k

You could use a Perl script to extract the RefSeq ID, then find the transcript using the Ensembl API and find out how many exons it has.

ADD COMMENT
0
Entering edit mode

You could also use BioMart, with RefSeq mRNA ID as the filter and Ensembl Exon ID as attribute. Returns 4 exons IDs for the example in the question.

ADD REPLY
1
Entering edit mode

The reason I suggested Perl as if all the data is in the format shown above, then they're going to have to extract the RefSeq ID in some way anyway, and you can easily do it with a split function in Perl. Once you've started doing it that way, you may as well continue.

The other reason I suggested the API is the numbers. BioMart can sometimes be a bit difficult with large numbers of items.

ADD REPLY

Login before adding your answer.

Traffic: 1807 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6