Question: Expression data missing in a GEO study
gravatar for William
6.1 years ago by
William4.7k wrote:

I am downloading  GEO datasets using GEOQuery in bioconductor.

For most GEO datasets this works well.

For GEO GSE39278 however no expression data is parsed from the GEO dataset.

The R code that I am using is below. The study is downloaded and the metadata is parsed succesfully. The expression table however only contains the sample id's and no probes or expression data.

Is there something wrong with the dataset or am I missing something in GEOquery?

geo_id <- "GSE39278"
gseList <- getGEO(geo_id)
gse <- gseList[[1]]
pd <- pData(gse)
expressionTable =  exprs(gse)


geoquery bioconductor geo • 2.9k views
ADD COMMENTlink modified 3.3 years ago by vegard nygaard200 • written 6.1 years ago by William4.7k
gravatar for vegard nygaard
3.3 years ago by
Oslo University Hospital, Norway
vegard nygaard200 wrote:

I found this old unanswered post while troubleshooting.

I had the same problem for some other GEO data sets (GSE46691, GSE79956, GSE79957, GSE62667). The explanation, given by GEO-support, for the missing data is that the file accessed by getGEO ("series_matrix" or "soft" in the GEO record) does not contain processed data for a few records, or only contain a subset of the data (for unclear reasons). However, full processed data may be provided as supplementary files instead.

The GEO data set queried by William,, does not seem to have processed data available at all (see "Processed data not provided for this record" at the bottom). However the CEL files are available.

So this is more a feature of GEO and not a bug in GEOquery.

ADD COMMENTlink written 3.3 years ago by vegard nygaard200
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1242 users visited in the last hour