Question: how to convert SwissProt or UniProt gene names to standard ID
0
gravatar for Learner
3.4 years ago by
Learner 180
Learner 180 wrote:

I know that based on web like david or others you can convert a list of gene with different symbols to IDs. for example DAVID https://david.ncifcrf.gov/content.jsp?file=conversion.html I am looking to find a way which I can do this in R or python

Lets say I have the following symbols, how can I convert them to standard IDs .

CONT_PPIA_HUMAN
DECOY1_53727
CONT_PRDX1_HUMAN
P09041
Q3TV35
CONT_CAS2_BOVIN
Q8BG13
Q3UI43
Q64475
CONT_ANXA5_HUMAN
P38060
python R • 1.9k views
ADD COMMENTlink modified 3.4 years ago by Girolamo140 • written 3.4 years ago by Learner 180
1

What do you mean by "standard ID"? There is not really any such thing. Can you give an example of an output ID?

ADD REPLYlink modified 3.4 years ago • written 3.4 years ago by Neilfws48k

@Neilfws for example when I use DAVID, It does not recognise all but gives me an conversion like this from the list above. so, I don't know if I can search for different organisms

P38060  443585  
Q64475  427395  
Q3UI43  458616  
Q8BG13  464628  
P09041  481479  
Q3TV35  463168
ADD REPLYlink modified 3.4 years ago • written 3.4 years ago by Learner 180
1

A search on Google seems to give no results for a few of your IDs in your original list e.g. CONT_PPIA_HUMAN, DECOY1_53727 and CONT_PRDX1_HUMAN. It's safe bet to think that the first and third IDs correspond to the genes PPIA and PRDX1 in human. You can use the Ensembl REST APIs (which is language agnostic and works with Python) to convert e.g. Q8BG13 into many standard xrefs or the BiomaRt to convert one standard ID (e.g. TrEMBL Q8BG13) to another standard ID (MGI symbol). Perhaps even better, you can use the EBI search through the EB-eye (REST) as in this case you don't need to know Q8BG13 is a TrEMBL in mouse, info you need to know beforehand for BiomaRt and the Ensembl REST APIs.

ADD REPLYlink modified 3.4 years ago • written 3.4 years ago by Denise - Open Targets5.0k

There is a legitimate question as to whether DAVID is actually being actively looked after or the site is running in some sort of autopilot mode. Efforts to get a response from emails associated with DAVID have failed. Last update to DAVID was possibly done back in 2010.

ADD REPLYlink modified 3.4 years ago • written 3.4 years ago by genomax73k

Possibly HGNC ID's.

ADD REPLYlink written 3.4 years ago by genomax73k

@genomax2 stay away from DAVID, means for all functions ? like pathway analysis etc ? do you have any idea on how to use the HGNC based on python ? however, it is not funny website because you must submit the list and wait for their email to be sent

ADD REPLYlink modified 3.4 years ago • written 3.4 years ago by Learner 180

Possibly, long as you keep the possibility in mind that the data may not be recently updated. There have been past discussions about DAVID DAVID pathway analysis and Is the DAVID knowledgebase not updated any more? You can search biostars via google to get python based solutions for ID conversions.

ADD REPLYlink modified 3.4 years ago • written 3.4 years ago by genomax73k

You can use HGNC's REST access for programmatic queries: http://www.genenames.org/help/rest-web-service-help

ADD REPLYlink written 3.4 years ago by genomax73k
1
gravatar for Girolamo
3.4 years ago by
Girolamo140
Italia
Girolamo140 wrote:

DAVID is outdated i suggest to not use it. what about using http://www.uniprot.org/uploadlists/ to convert the gene id to the desired format

ADD COMMENTlink written 3.4 years ago by Girolamo140
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 786 users visited in the last hour