Hi all,
I've performed a GO enrichment analysis using goseq. The output of goseq()
tells you the enriched categories with how many significant genes are in the categories etc, but it doesn't give you the ID of genes that are in any of those categories.
Obviously you can use getgo()
to retrieve GOs for genes of interest, but is there a way to get a column of the genes in each of the enriched GO terms using the goseq()
function.
So my desired output would be:
category over_represented_pvalue under_represented_pvalue numDEInCat numInCat term ontology Ensembl ID
GO:0000786 2.143408e-16 1 12 43 nucleosome CC. ENSG00000112655, ENSG00000158483 etc..
With the "Ensembl ID" column is the one I'm looking to add
Any help would be greatly appreciated!
Exactly what I needed! Thank you!
Hi all, I am trying to adapt this code for my work but could not able to do so. I am working on rice which is a non-native organism for goseq. I am new in using R, can anyone suggest how to modify this code for non-native organism? I have run the goseq smoothly for rice but adding the gene column is tricky business for me. I have category mapping, pwf and GO.wall files.
I am not sure, maybe
getgo()
doesn't support your genome. Could you provide a bit more details about your species and code/error?Hello there,
I am commenting to ask if you ever found a solution to this problem. I am currently working my way through goseq analysis of a non-native organism myself (nicotiana benthamiana), and my last hurdle is to try and retrieve the differentially expressed genes associated with GO terms that are of interest to us.
Thanks
Hello there,
I am commenting to ask if you ever found a solution to this problem. I am currently working my way through goseq analysis of a non-native organism myself (nicotiana benthamiana), and my last hurdle is to try and retrieve the differentially expressed genes associated with GO terms that are of interest to us.
Thanks