Question: How to annotate GEO microarray datasets with GEOquery?
3
gravatar for grayapply2009
2.5 years ago by
grayapply2009170
United States
grayapply2009170 wrote:

Hi, I read the GSE file into R as follows.

gse <- getGEO("GSE4928", GSEMatrix=TRUE)

Now I want to convert all probe IDs to gene symbols and write the entire annotated dataset back to my computer. What should I do? By the way, which dataset is stored in gse? SOFT formatted family file, MINiML formatted family file or Series Matrix File?

microarray annotation geo • 1.9k views
ADD COMMENTlink modified 2.5 years ago by Sean Davis25k • written 2.5 years ago by grayapply2009170
3
gravatar for Sean Davis
2.5 years ago by
Sean Davis25k
National Institutes of Health, Bethesda, MD
Sean Davis25k wrote:

Take a look at this code:

gse = gse[[1]] # get just the first element in the list
head(fData(gse))
symbols = fData(gse)[,'Gene Symbol']

Now, you have the gene symbols. The gse object (after choosing the first element in the list using gse[[1]]) contains the information from the Series Matrix file AND the GPL file.

ADD COMMENTlink written 2.5 years ago by Sean Davis25k

Thank you Sean. Now how can I export the Series Matrix file with the probes replaced by gene symbols?

ADD REPLYlink written 2.5 years ago by grayapply2009170
1

After that, you can use:

expr_mat = exprs(gse)        # get the expression matrix
rownames(expr_mat) = symbols # Annotate the row names with gene symbols
ADD REPLYlink modified 2.5 years ago • written 2.5 years ago by Ce Gao10

Thank you, gaoce. That works perfectly. Where can I get the tutorial for this? The GEOquery document doesn't seem to have what you guys showed me above.

ADD REPLYlink written 2.5 years ago by grayapply2009170

Unfortunately, this solution is not a general one due to the fact that not all GEO series have gene symbol or other annotation. What that means is that it is best to understand why the steps above work so that when you have another data set, you can follow the logic to come up with your own solution. That said, I am always interested in documentation improvements, so it you'd like to contribute, GEOquery is on github where I can accept pull requests.

ADD REPLYlink modified 2.4 years ago • written 2.4 years ago by Sean Davis25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1259 users visited in the last hour