Question: series_matrix.txt in GEO are they always the normalized values?
gravatar for Leite
19 months ago by
São Paulo - Brazil - Unifesp
Leite950 wrote:

Hello everyone,

I have a simple question, the file series_matrix.txt in GEO are always the normalized data of the study?



series_matrix.txt geo • 1.5k views
ADD COMMENTlink modified 19 months ago by Kevin Blighe55k • written 19 months ago by Leite950
gravatar for Kevin Blighe
19 months ago by
Kevin Blighe55k
Porto Alegre / London
Kevin Blighe55k wrote:

Hey Leite,

The answer is that, yes, the series matrix files should contain normalised, log2 values. However, the GEO provide situations in which these files may not contain normalised data:

GEO2R operates on Series Matrix files which contain data extracted directly from the VALUE column of Sample tables. Submitters are asked to supply normalized data in the VALUE column, rendering the Samples cross-comparable. The majority of GEO data do conform to this rule. GEO applies no further processing other than to perform a log2 transformation on values determined not to be in log space (see Options section). However, some studies, such as dual channel loop design data, may generate values that do not have a common reference and are not directly comparable. Some studies may contain Sample value data that are not normalized, or have a design such that the Samples were never intended to be directly compared. Yet other studies do not have sufficient replicate Samples to perform a robust statistical analysis. Users should examine the original Series to understand the experimental design, and check the 'Data processing' field or VALUE description in the original Sample records for information on what the values represent. The box plot feature on the Value distribution tab is provided to help users assess whether the distributions of values across Samples are median-centered, which is generally indicative that the data are normalized and cross-comparable.


When you obtain data, you should always check the distribution with box- and scatter plots, and histograms, in order to gauge whether thy are normalsed or not.


ADD COMMENTlink written 19 months ago by Kevin Blighe55k

Thank you so much my friend!

ADD REPLYlink written 19 months ago by Leite950
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 766 users visited in the last hour