Converting full Protein names into Gene ID
1
0
Entering edit mode
8.7 years ago
pbio ▴ 150

Hi all,

I have a list which contains certain list of Protein names and it looks like this, these are full names with respect to each protein. Now the problem is to find out the gene names from the list of full protein names.

CYP3A1
CYSTATIN C
CYTOCHROME P450
CYTOCHROME P450 11B2, MITOCHONDRIAL
CYTOKERATIN 16
CYTOKERATIN 18
CYTOKERATIN 18 M30/M65 RATIO
CYTOKERATIN 19
D-DIMER

I know tools like DAVID and Biomart, which can do the conversion for short protein names to gene ids. Is there a way to convert this list to geneIDS?

Protein Gene • 4.0k views
ADD COMMENT
0
Entering edit mode

Are these all you have, or are there a lot more names in your list. Generally you would need an official identifier to use conversion tools. Are there any other columns in your file that may contain these?

ADD REPLY
2
Entering edit mode
8.7 years ago

These look like gene names to me (in the sense of the HGNC's full gene name). You could use the Ensembl API to match them to all gene names and synonyms in the database. Hopefully this would get you a fair number. The problem is these names are often ambiguous and have sometimes been assigned to different genes. This is why it's better to rely on database identifiers (with the proper database version).

ADD COMMENT

Login before adding your answer.

Traffic: 2620 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6