I have a list of ~17000 transcripts in the form of Ensembl IDs (although this can be converted to any other form of ID needed), and I am looking to figure out which of the ~17000 are coding and which are non-coding RNAs.
Ensembl itself obviously tells you if it is coding or non-coding when you search the ensembl ID, but is it not feasible to do this by hand for so many transcripts. I have also found lncRNA databases but again they seem to be searched one by one.
Does anyone know how can I run the whole list against an existing database to give me a non-coding or coding for each of the transcripts?
Any help much appreciated!
There's a help video to get you started with BioMart.
That worked beautifully. Thank you!