Traffic: 187 ip/hr
Question: [python] Uniprot ID to Gene name
 
5
 
 

Hi, I've got a huge list of Uniprot IDs and I want to get the matching gene names. Do you know how to do that in python ? (I'm currently searching with Biopython...) Thanks !

Yo.

log in to comment • 2 bookmarks • permalink similar posts • request help via email
 
1

See related post here: http://biostar.stackexchange.com/questions/22/gene-id-conversion-tool/8107#8107

log in to reply • written 2.0 years ago by Casey Bergman  13,92021337

2 answers

 
1
 
 
 

Perhaps sth like this:

for record in SwissProt.parse(open('uniprot_sprot.dat')):
  accessions = record.accessions
  gene_name = record.gene_name

Chris

 

It's a good beginning I think. But my input file is like this : "Uniprot_ID--->position", one per line. So I just want to translate Uniprot_ID to Gene_name in output. Does anything exist yet ? Thx a lot !

log in to reply • written 2.0 years ago by Yo_O  803
 
1

Well, my code snippet was rather meant as an inspiration of how to access the uniprot_id and corresponding gene_name from swissprot. Once you have that mapping (e.g. as a dictionary) it should be easy to do the mapping from that to your problem setting.

log in to reply • written 2.0 years ago by Chris  1,49029
 
1

Yep. I got what I wanted with this :

url = 'http://www.uniprot.org/mapping/'

query=uniprot_id params = {'from':'ACC','to':'ENSEMBL_ID','format':'tab','query':query} data = urllib.urlencode(params) request = urllib2.Request(url, data) response = urllib2.urlopen(request) page = response.read(200000)

Then I've got an homemade dictionary id_Ensembl <-> geneName

Thanks a lot for your answers guys !

Yo.

log in to reply • written 2.0 years ago by Yo_O  803
 

Oups, I don't know how to paste code here...if an admin can edit it..thanks

log in to reply • written 2.0 years ago by Yo_O  803
 
 
3
 
 

You can do this using the retrieve function at www.uniprot.org (4th tab element in the top bar)

Upload your list of ID's.

Look for the small blue UniProtKB (number of retrieved) entries link. Click this. Then use the customize display to select only gene names. Then click download as tab.

Ok, not very pythonic, but a few http calls from python would work.

 
Log in to add a post