How to get uniprot ID or protein length from Entrez gene id or gene name?
1
Hi,
I have to calculate the protein length from a list of Entrez gene ids and gene names. I also need to get the uniprot IDs. I tried uniprot's API to get data but it returns more than one result for gene name (HUGO gene name). Is there any way to get these data?
As an examples;
Genes :KRAS, TP53,
Entrez gene id: 3845, 7157
panther
python
David
uniprot
API
• 913 views
Using Entrezdirect :
$ esearch -db protein -query "Homo sapiens [ORGN] AND srcdb refseq [PROPERTIES]" | esummary | xtract -pattern DocumentSummary -element Gi,Caption,Title,Slen
2057283081 NP_001382313 coiled-coil domain-containing protein 30 isoform 5 [Homo sapiens] 710
2057283079 NP_001382312 coiled-coil domain-containing protein 30 isoform 5 [Homo sapiens] 710
2057283077 NP_001382308 coiled-coil domain-containing protein 30 isoform 4 [Homo sapiens] 752
2057283075 NP_001382303 intercellular adhesion molecule 3 isoform 5 [Homo sapiens] 352
2057283072 NP_001382311 coiled-coil domain-containing protein 30 isoform 5 [Homo sapiens] 710
2057283070 NP_001382304 intercellular adhesion molecule 3 isoform 6 [Homo sapiens] 256
2057283066 NP_001382309 proline-rich protein 33 [Homo sapiens] 479
2057283062 NP_001382314 coiled-coil domain-containing protein 30 isoform 5 [Homo sapiens] 710
2057283060 NP_001382305 intercellular adhesion molecule 3 isoform 7 [Homo sapiens] 211
2057275858 NP_001382307 uncharacterized LOC121725057 isoform 2 precursor [Homo sapiens] 170
You can download entire human proteome from UniProt by with this query. Click on Download
button to customize how you want the information downloaded.
Login before adding your answer.
Traffic: 2119 users visited in the last hour
Always post example ID's when asking questions about them. Without that information difficult to answer these questions.