Retriving compounds name from KEGG IDs
3
1
Entering edit mode
4.8 years ago

Dear all, I have list of KEGG IDs for metabolites (compounds) and I want to get the name of the all the compounds from that list. So please suggest me what should I do??? Thanks

gene • 4.6k views
0
Entering edit mode

yes something like that but for my list of IDs. Thanks

0
Entering edit mode

Sorry, but what you are asking for are 10 lines of code in python(maximum) and even less in R

0
Entering edit mode

It is more helpful to provide actual code rather than a statement about how small the needed code is.

0
Entering edit mode

4
Entering edit mode
4.8 years ago
EagleEye 6.8k

I hope this helps,

1.make a file 'sample.list' with the list of compound IDs you have,

cat sample.list

C00001

C00002

C00003

C00004

C00005

C00006

C00007

2.And then run the following CURL command below.

curl -g -s -S http://rest.kegg.jp/list/compound | grep -f sample.list | sed "s/cpd\://" > sample_extracted.table.txt

SAMPLE OUTPUT:

C00001  H2O; Water
C00007  Oxygen; O2


I assume, I understand your question properly.

0
Entering edit mode

And you got me right. I am trying to run the above command but in windows it is saying that grep is not recognized as internal or external command., so is there any alternative for that. please let me know.

Thanks

1
Entering edit mode

You can install cygwin to get a unix like environment on windows or better still run linux in a virtual machine (e.g. virtual box).

0
Entering edit mode

You can follow as genomax2 suggested or simple is to run on Linux.

0
Entering edit mode

Please upvote if this helps, so that other people know that the problem has been solved.

0
Entering edit mode

I just upvoted. Thanks

0
Entering edit mode
4.8 years ago
mgalactus ▴ 750

You can either search for them in the KEGG database website, or use something more automated like the KEGG REST API (in this specific case you need the "get" method).

Suppose you are interested in the following three compound IDs:

• C00818
• C00140
• C00259

Simply go to this webpage (have a look at how the URL has been constructed): http://rest.kegg.jp/get/C00259+C00140+C00818

0
Entering edit mode

Dear Mgalactus, thank you for your reply. The method you suggested, gives the details of the compounds in query. but I have a list of more than 2000 metabolites IDs. In that case its very difficult to get the name and details. So is there any way to get the result output in tabular format like as in *.csv format.

0
Entering edit mode
4.8 years ago
Girolamo ▴ 140

like this? http://rest.kegg.jp/list/compound