GO annotation with entrez gene id for non-model organism
1
0
Entering edit mode
4.8 years ago
bifStudent • 0

Hi everybody,

I am working on a shiny project which supports users lacking programming skills to perform GO enrichment analysis. Unfortunately I am struggling to enable the user to use entrez gene IDs of a non-model organism (institution is working with plant organisms and also doing microbiome analysis).

My first attempt was to convert IDs to ensembl IDs and get GO terms via biomaRt. But it seems, that resulting table is not complete and there are a lot of ensembl ids with GO terms and without entrez gene ID. I also want to avoid the use of rentrez::entrez_search, because without API-key it takes to long.

Another idea would be to use homolog of model-organism e.g. Arabidopsis thaliana for plants (didn't try that so far) but I hope for a better solution.

Thanks in advance for any help/suggestions!

R GO non-model organism entrez gene id • 1.9k views
ADD COMMENT
1
Entering edit mode
4.8 years ago
ashish ▴ 680

Usually one would BLAST the sequences and then use those results to retrieve GO terms. BLAST2GO uses this approach but the BLAST part of it is very slow and I am not sure if there is a way to access BLAST2GO using R. You can create your own annotation package for any organism using the AnnotationForge package from bioconductor. See James's answer in this thread for an example.

ADD COMMENT
0
Entering edit mode

Thanks a lot for the suggestion of AnnotationForge. It looks like this is the solution to my problem and I am trying to integrate this. It also seems to take a long time, but I guess this is the price for it.

ADD REPLY

Login before adding your answer.

Traffic: 1903 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6