Question: GEO Affymetrix microarray datasets: how are series matrix files obtained from CEL files?
gravatar for Ld_60
2.7 years ago by
Ld_6040 wrote:

Hi everyone,

I am working with microarray data from the GEO repository. I am trying to understand how to obtain the feature-sample matrix format, which is represented by the directly available "series matrix" file in GEO, from the CEL files (Affymetrix platforms)?

For example, considering the GPL96 microarray platform (i.e. Affymetrix Human Genome U133A Array), in the series matrix file, the samples are described by 22,283 features. Is there any package/command which allows to obtain such matrix from the CEL files? I tried using the gcrma normalization command (Bioconductor) on the CEL files for a dataset, but I got a number of features which is around 12,000.

Also, I am confused as to whether these features are called probes or probe sets, as I've seen different designations from different people?

Thanks a lot for your help.

ADD COMMENTlink written 2.7 years ago by Ld_6040

Did you try the GEOquery package in R?

ADD REPLYlink written 2.7 years ago by Benn8.1k

Hi, thank you very much for your answer, I will check that out!

ADD REPLYlink written 2.7 years ago by Ld_6040
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2001 users visited in the last hour