I'm trying to understand the logic behind the Gene Ontology annotations.
Let's take one gene, for example:
ENSG00000198570. When passed into BioMart, it tells me there are three GO term accession IDs. Visualized within the GO tree, they look like this:
All three of them are offspring of
biological_process; two of them are offspring of
|Ultimately, I want to be able to analyze quite a big set of genes and see whether they cluster into big and/or small groups with same function. Therefore, e.g., if both are reported as offspring of
Right now the only option seems to be to traverse the GeneOntology XML, bottom to top, for each GO term, but it's stupidly inefficient. Maybe there's something obvious I'm missing or there's a piece of software out there that can do just what I need?
I hope what I'm saying is making sense to you.