Question: Kegg Id Vs Cog Id, And The Best Method For Large Batch Id Assignment?
gravatar for JacobS
6.7 years ago by
Cleveland, Ohio
JacobS900 wrote:

I have a large RNA-Seq dataset with reads of about 200bp long. I've already used other tools to annotate these reads with gi numbers, but now want to associate them with KEGG/COG IDs for pathway analysis. Can someone please help me understand the difference (and different uses) for KEGG and COG IDs and what is the best way for large batch (on the scale of millions) annotation?


pathway annotation kegg • 7.1k views
ADD COMMENTlink modified 4.1 years ago by Biostar ♦♦ 20 • written 6.7 years ago by JacobS900
gravatar for Neilfws
6.7 years ago by
Sydney, Australia
Neilfws48k wrote:

Not a complete answer to your questions, but with regard to understanding the differences:

COG was a NCBI project to classify proteins from sequenced genomes. It is no longer maintained and you should probably not use it. If you need to know more:

KEGG is an altogether larger, actively-maintained project. You might think of it as an attempt to create a systems biology database. They use their own annotation and clustering pipeline to assign IDs, called the KEGG Orthology system. Here's a key KEGG publication.

Many software tools have been built around KEGG, for example in R/Bioconductor.

ADD COMMENTlink written 6.7 years ago by Neilfws48k

Great, very helpful post!

ADD REPLYlink written 6.7 years ago by JacobS900

@ Neilfws, I see this post is 2-year-old. I was wondering could you update with new perspectives. I was going through KEGG Vs COG, I am finding this publication PMID-25428365. So now, is it better to use KEGG or COG? Thanks.

ADD REPLYlink modified 3 months ago by RamRS25k • written 4.6 years ago by swapnil.doijad0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 706 users visited in the last hour