Question: Transcript and protein
gravatar for Vladislav
4 weeks ago by
Yaroslavl', Russia
Vladislav10 wrote:

Hi, Biostars community.

At first, sorry for my English.

I have some set of mRNA transcripts ids e.g.: 'NM_007300.4', 'NM_007297.4', 'NM_007294.3' ... Well, I can to find which of them is canonical by using knownCanonical.txt and kgXref.txt from ucsc. 'NM_007300' in this case.

But I also have a set of their proteins ids, e.g.: 'NP_009231.2', 'NP_009228.2', 'NP_009225.1' ...

So, can you, please, tell me, how to find which of them depends to canonical mRNA transcript?


transcript rna-seq protein • 112 views
ADD COMMENTlink written 4 weeks ago by Vladislav10

ID's you have above are basically cross-references to each other. Using EntrezDirect you can verify that:

$ esearch -db nuccore -query "NM_007297.4" | elink -target protein | efetch -format acc
$ esearch -db protein -query "NP_009228" | elink -target nuccore | efetch -format acc

If you want to convert Ensembl identifiers from knownCanonical.txt, you could use their REST API (random example from Canonical file) or BioMart.

ADD REPLYlink modified 4 weeks ago • written 4 weeks ago by genomax74k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1412 users visited in the last hour