Kegg Data Download
4
6
Entering edit mode
8.2 years ago
anika.menon22 ▴ 190

Hello,

I am a newbie in bioinfo. I would like to know how to download all the pathways of an organism from KEGG database using the KEGG API. Initially I had done it using the FTP but now its no more freely available. Please help me with this!

Thanks

kegg pathway api • 23k views
ADD COMMENT
0
Entering edit mode

Thanks, it is pretty helpful, I have another question about how to download a older version KEGG database (Homo sapiens Pathway) like Sep, 2015, really appreciate if anyone knows the answer!

ADD REPLY
0
Entering edit mode

I think would be better to write a new post, if you have additional/different questions. Not many people will read this post since is more than 3 years old.

ADD REPLY
14
Entering edit mode
8.2 years ago
mgalactus ▴ 760

Hi,

I don't know your informatic skills, but you could start by taking a look at the rather new KEGG rest API. If your organism is already inside the KEGG database retrieving all its pathways is relatively easy. Assume your organism is Homo sapiens (Kegg code "hsa"), you just need to open your browser and go to this page (notice the URL). This example gives you all the pathways AND the human genes associated to them.

To get info about a gene (for instance hsa:10) just go to this page (again, notice how the URL is built).

To get info about a pathway (for instance hsa:10) just go to this page.

You can also retrieve this data using a script, below a simple example in python

import urllib2
url = 'http://rest.kegg.jp/link/hsa/pathway'
data = urllib2.urlopen(url, timeout=20).read()
# Parse it!

If your organism is not inside the KEGG database, you first need to annotate it through KAAS, obtain the KO IDs and then do a search like this (note that there is a limit in the number of KO IDs that you can put in a single URL, so maybe you need to perform more than one search).

Hope this helps...

ADD COMMENT
2
Entering edit mode

@galactus8403: That was helpful. Thank you so much!

ADD REPLY
1
Entering edit mode

Thanks for the suggestion.

Could you explain the method in which the KO ids can be used to retrieve the EC number information from KEGG

ADD REPLY
1
Entering edit mode

That requires a couple of steps: first retrieve the reaction IDs from KO IDs (example here) and then retrieve the entry for each reaction ID (example here); the EC number is under the field "ENZYME"
 

ADD REPLY
3
Entering edit mode
8.2 years ago

Take a look to that biostar post as well A new REST based API for KEGG: Kyoto Encyclopedia of Genes and Genomes

ADD COMMENT
1
Entering edit mode

The ipython notebook you provide there is really cool, congrats!

ADD REPLY
1
Entering edit mode

@Manu: Thanks a ton for the link. :)

ADD REPLY
0
Entering edit mode

Thanks, I'm glad some people find it useful at last. That encourages me to really start feeding my blog :-)

ADD REPLY
1
Entering edit mode
5.0 years ago
Guangchuang Yu ★ 2.5k

you can use clusterProfiler in R:

devtools::install_github("guangchuangyu/clusterProfiler")
library(clusterProfiler)
hsa_kegg = download_KEGG('hsa')
ADD COMMENT
0
Entering edit mode

I have installed the package from Bioconductor and library it, but the function "download_KEGG" is not contained in the package. I wonder why?

ADD REPLY
0
Entering edit mode
5.1 years ago
EagleEye 7.0k

You can use GeneSCF to download with simple command (Only retrieves the updated data- real-time),

A: focus on specific pathway

A: Gene ontology in sheep

./prepare_database -db=[GO_all|GO_BP|GO_MF|GO_CC|KEGG|REACTOME|NCG] -org=[see,org_codes_help]

Gene Set Clustering based on Functional annotation (GeneSCF)

http://genescf.kandurilab.org/

ADD COMMENT

Login before adding your answer.

Traffic: 1090 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6