Identify lncRNA in list of Ensembl ID's
3
0
Entering edit mode
6.0 years ago
Laura • 0

Hi all,

I have a list of ~17000 transcripts in the form of Ensembl IDs (although this can be converted to any other form of ID needed), and I am looking to figure out which of the ~17000 are coding and which are non-coding RNAs.

Ensembl itself obviously tells you if it is coding or non-coding when you search the ensembl ID, but is it not feasible to do this by hand for so many transcripts. I have also found lncRNA databases but again they seem to be searched one by one.

Does anyone know how can I run the whole list against an existing database to give me a non-coding or coding for each of the transcripts?

Any help much appreciated!

Thanks! Laura

genome gene Ensembl • 6.9k views
ADD COMMENT
5
Entering edit mode
6.0 years ago
Prasad ★ 1.6k

this you could do it using Ensemble biomart. You can upload all the transcript id in the filter option and using Attribute you can select transcript type or gene type at what level you want to see.

I guess this solves the purpose

ADD COMMENT
1
Entering edit mode

There's a help video to get you started with BioMart.

ADD REPLY
0
Entering edit mode

That worked beautifully. Thank you!

ADD REPLY
2
0
Entering edit mode

Thanks! Managed, saved me lots and lots of time

ADD REPLY
1
Entering edit mode
6.0 years ago
poisonAlien ★ 3.1k

You can just download ensemble gtf and map transcript ids to their biotype.

Or upload your list to biomart and get all associated attributes.

ADD COMMENT

Login before adding your answer.

Traffic: 1888 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6