Question: Transcript and protein
gravatar for Vladislav
11 months ago by
Yaroslavl', Russia
Vladislav10 wrote:

Hi, Biostars community.

At first, sorry for my English.

I have some set of mRNA transcripts ids e.g.: 'NM_007300.4', 'NM_007297.4', 'NM_007294.3' ... Well, I can to find which of them is canonical by using knownCanonical.txt and kgXref.txt from ucsc. 'NM_007300' in this case.

But I also have a set of their proteins ids, e.g.: 'NP_009231.2', 'NP_009228.2', 'NP_009225.1' ...

So, can you, please, tell me, how to find which of them depends to canonical mRNA transcript?


transcript rna-seq protein • 240 views
ADD COMMENTlink written 11 months ago by Vladislav10

ID's you have above are basically cross-references to each other. Using EntrezDirect you can verify that:

$ esearch -db nuccore -query "NM_007297.4" | elink -target protein | efetch -format acc
$ esearch -db protein -query "NP_009228" | elink -target nuccore | efetch -format acc

If you want to convert Ensembl identifiers from knownCanonical.txt, you could use their REST API (random example from Canonical file) or BioMart.

ADD REPLYlink modified 11 months ago • written 11 months ago by genomax89k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 743 users visited in the last hour