Associate to GO Names the respective IDs
6
1
Entering edit mode
9.5 years ago
dago ★ 2.8k

I have a list of GO names. I want to associate each name to the respective GO IDs.

I can recover the information by looking the GO name on QuickGO, but I have thousends of names to combine with IDs.

Any idea how could I do that?

GO annotation • 3.2k views
ADD COMMENT
1
Entering edit mode
9.5 years ago

Using mysql:

$ mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A -D go -e 'select acc,name from term where name in ("DNA replication","fatty-acyl-CoA binding")'
+------------+------------------------+
| acc        | name                   |
+------------+------------------------+
| GO:0006260 | DNA replication        |
| GO:0000062 | fatty-acyl-CoA binding |
+------------+------------------------+
ADD COMMENT
0
Entering edit mode
9.5 years ago
pld 5.1k

Most software that does this type of annotation will be available in the output, what was used to generate these? Outside of that you can write a pretty simple script to parse the go OBO file. http://geneontology.org/page/download-ontology

ADD COMMENT
0
Entering edit mode

I blasted my genomes against nr and loaded the results in Blast2Go to obtain GO annotation. Unfortunately, I miss the scripting part. Do you know if any package in Bioconductor is doing that?Or if there is any good link I could use to get these data?

ADD REPLY
0
Entering edit mode
9.5 years ago

using SPARQL:

PREFIX rdfs:<http://www.w3.org/2000/01/rdf-schema#>
SELECT DISTINCT ?go
WHERE {
    ?go rdfs:label "DNA replication"@en .
}

NB: depending on the version of GO that you are using, you may not need the '@en'

s-query --service=http://localhost:3030/go/query --output=text "SELECT DISTINCT ?go WHERE { ?go <http://www.w3.org/2000/01/rdf-schema#label> 'DNA replication'@en . }"

-------------------------------------------
| go                                      |
===========================================
| <http://purl.org/obo/owl/GO#GO_0006260> |
-------------------------------------------
ADD COMMENT
0
Entering edit mode

I'm not familiar with SPARQL, I think it's some kind of distributed database. You run s-query to connect to a service running on localhost:3030. That will require the user's computer to have some kind of database server running, could you elaborate on what's going on here? This looks like a tool I might like to get loaded on my workstation too!

ADD REPLY
0
Entering edit mode

Yes, because I have a local SPARQL server for GO but you could just as well use BioPortal SPARQL endpoint (http://sparql.bioontology.org/). It requires to register if you want to do it programmatically though.

I just happen to find SPARQL pretty convenient for querying GO and GOA.

ADD REPLY
0
Entering edit mode
9.5 years ago
EagleEye 7.5k

This GeneSCF tool has a folder called 'annotation' where you can find plain text files with process and ids:

Gene Set Clustering based on Functional annotation (GeneSCF)

ADD COMMENT
0
Entering edit mode
9.5 years ago
Parham ★ 1.6k

In R:

library(GO.db)

goterms= Term(GOTERM)

go_definition = merge(your_list_of_GOs, goterms, by.x = "category", by.y="row.names")
ADD COMMENT

Login before adding your answer.

Traffic: 1985 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6