infer top level go category from GO:ID
2
0
Entering edit mode
6.0 years ago
Carambakaracho ★ 3.2k

Hi folks,

I integrate proteome annotation from the eggnog emapper into a structured database. I'm getting annotations in the form described on the github emapper page. The GO terms look in practice like this:

GO:0005575,GO:0005623,GO:0005886,GO:0016020,GO:0044464,GO:0071944

for the integration I need to map it to one of the principal categories

  • P (biological process)
  • F (molecular function)
  • C (cellular component)

Which ranges of GO categories are part of which of these group? Is there some explanation to this? What I found is the GAF 2.1 format definition which requires the definition of the Aspect . Is there some resource available that can map GO category to it's toplevel?

Thanks!

gene ontology GO eggnog-mapper • 2.8k views
ADD COMMENT
2
Entering edit mode
6.0 years ago

Hi Carambakaracho

API's to the rescue! ;)

I suppose this project being part of a large software development. Integrating REST API's should work. Never used one but have a look at this GO API. Might be of some help.

ADD COMMENT
0
Entering edit mode

Hi Vijay Lakhujani, great idea! Thanks for the link, I didn't know their API - I'll try to give it a go

ADD REPLY
1
Entering edit mode
6.0 years ago
Tm ★ 1.1k

From the emapper.annotations file, you can take 1st (query name) and 6th (GO terms) column in tab delimited format and can upload the same in WEGO. It will assign main category to these GO terms and will also plot graph.

ADD COMMENT
0
Entering edit mode

Hi Toralmanvar, thanks for your solution but I'm looking for something for easier general programmatic access without manual upload steps - it's supposed to become part of a more general pipeline for genome integration into a database.

ADD REPLY

Login before adding your answer.

Traffic: 2588 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6