Extracting List Of Genes Associated With A Pathway In Kegg
2
10
Entering edit mode
9.3 years ago
Dataminer ★ 2.7k

Hi!

Is their a way to extract a list of genes present in a KEGG pathway.

For instance if I want to extraxct list of genes associated with Pathways of cancer, how can I do it?

Thank you

pathway • 25k views
0
Entering edit mode

Is it possible to extract a list of genes of a particular pathway without specifying an organism? For exemple, all genes related to methane pathway used in bacteria...

Thank you.

12
Entering edit mode
9.3 years ago

You can use the KEGG web service (http://www.kegg.jp/kegg/docs/keggapi.html). For exampe:

http://rest.kegg.jp/get/hsa05200

That will give you the entire cancer pathway entry.

Using the linked entries service, you can do this:

That'll give you all the genes linked to the cancer pathway.

I am not sure how to post to the web service. I don't think you can. So if you want to get the individual gene entries, you'll have to parse the gene names with a script and use the get service like this (10 at a time):

http://rest.kegg.jp/get/hsa:10000+hsa:1017+hsa:1019

I haven't used this myself, but here is also a bioconductor package: KEGGREST

http://www.bioconductor.org/packages/2.12/bioc/html/KEGGREST.html

0
Entering edit mode

Kindly how can we detect pathways for the list of gene groups using R ?? I used KEGGREST but , i do not know if its retrieve the pathway for a list of gene ID's from Affymetrix.? please need help.

0
Entering edit mode

Since the text in http://rest.kegg.jp/link/genes/hsa05200 is regular, so we can just replace "path:hsa04360 hsa: " with empty string "", then we can easily get the list of Entrez GeneID. path:hsa04360 hsa:10154 path:hsa04360 hsa:1020 path:hsa04360 hsa:10298 path:hsa04360 hsa:10371 path:hsa04360 hsa:10500 path:hsa04360 hsa:10501

5
Entering edit mode
9.3 years ago

using TOGOWS and Damian's example:

get the genes for hsa05200 as JSON

http://togows.dbcls.jp/entry/pathway/hsa05200/genes.json

[
{
"1871": "E2F3; E2F transcription factor 3 [KO:K06620]",
"23604": "DAPK2; death-associated protein kinase 2 [KO:K08803] [EC:2.7.11.1]",
"10319": "LAMC3; laminin, gamma 3 [KO:K06247]",
(...)
"4292": "MLH1; mutL homolog 1, colon cancer, nonpolyposis type 2 (E. coli) [KO:K08734]",
"1870": "E2F2; E2F transcription factor 2 [KO:K09389]",
"868": "CBLB; Cbl proto-oncogene, E3 ubiquitin protein ligase B [KO:K04707] [EC:6.3.2.19]",
"5155": "PDGFB; platelet-derived growth factor beta polypeptide [KO:K04359]",
"1495": "CTNNA1; catenin (cadherin-associated protein), alpha 1, 102kDa [KO:K05691]"
}

0
Entering edit mode

I' trying to get all the genes that metabolism related using the same method you suggested. However, I get an empty list.

http://togows.dbcls.jp/entry/pathway/hsa01100/genes.json

can you please tell me what is wrong?

1
Entering edit mode

Nothing is wrong! You are getting an empty list because there is no gene listed under this entry (i.e. hsa01100). You can check it here and here that no genes are listed.