4
6
Entering edit mode
9.7 years ago
anika.menon22 ▴ 190

Hello,

I am a newbie in bioinfo. I would like to know how to download all the pathways of an organism from KEGG database using the KEGG API. Initially I had done it using the FTP but now its no more freely available. Please help me with this!

Thanks

kegg pathway api • 25k views
0
Entering edit mode

Thanks, it is pretty helpful, I have another question about how to download a older version KEGG database (Homo sapiens Pathway) like Sep, 2015, really appreciate if anyone knows the answer!

0
Entering edit mode

I think would be better to write a new post, if you have additional/different questions. Not many people will read this post since is more than 3 years old.

14
Entering edit mode
9.7 years ago
mgalactus ▴ 760

Hi,

I don't know your informatic skills, but you could start by taking a look at the rather new KEGG rest API. If your organism is already inside the KEGG database retrieving all its pathways is relatively easy. Assume your organism is Homo sapiens (Kegg code "hsa"), you just need to open your browser and go to this page (notice the URL). This example gives you all the pathways AND the human genes associated to them.

To get info about a gene (for instance hsa:10) just go to this page (again, notice how the URL is built).

You can also retrieve this data using a script, below a simple example in python

import urllib2
# Parse it!


If your organism is not inside the KEGG database, you first need to annotate it through KAAS, obtain the KO IDs and then do a search like this (note that there is a limit in the number of KO IDs that you can put in a single URL, so maybe you need to perform more than one search).

Hope this helps...

2
Entering edit mode

@galactus8403: That was helpful. Thank you so much!

1
Entering edit mode

Thanks for the suggestion.

Could you explain the method in which the KO ids can be used to retrieve the EC number information from KEGG

1
Entering edit mode

That requires a couple of steps: first retrieve the reaction IDs from KO IDs (example here) and then retrieve the entry for each reaction ID (example here); the EC number is under the field "ENZYME"

3
Entering edit mode
9.7 years ago

Take a look to that biostar post as well A new REST based API for KEGG: Kyoto Encyclopedia of Genes and Genomes

1
Entering edit mode

The ipython notebook you provide there is really cool, congrats!

1
Entering edit mode

@Manu: Thanks a ton for the link. :)

0
Entering edit mode

Thanks, I'm glad some people find it useful at last. That encourages me to really start feeding my blog :-)

1
Entering edit mode
6.5 years ago
Guangchuang Yu ★ 2.6k

you can use clusterProfiler in R:

devtools::install_github("guangchuangyu/clusterProfiler")
library(clusterProfiler)

0
Entering edit mode

I have installed the package from Bioconductor and library it, but the function "download_KEGG" is not contained in the package. I wonder why?

0
Entering edit mode
6.6 years ago
EagleEye 7.4k

You can use GeneSCF to download with simple command (Only retrieves the updated data- real-time),

A: focus on specific pathway

A: Gene ontology in sheep

./prepare_database -db=[GO_all|GO_BP|GO_MF|GO_CC|KEGG|REACTOME|NCG] -org=[see,org_codes_help]


Gene Set Clustering based on Functional annotation (GeneSCF)

http://genescf.kandurilab.org/