Question: z score transformation from signal intensities of raw microarray data
gravatar for iphoenix2100
4.1 years ago by
European Union
iphoenix210040 wrote:

DEar all,

I am making this slow transition from GWAS findings to expression datasets. We are using an online repository of microarray data from different donors (arrayed for different regions of the brain). And basically we are using a set of genes from the initial GWAS findings and see how it is affecting the expression pattern in different regions of the brain.

I came across this method of z-score transformation of the values in RAW microarray data to make sense of whether the gene is up or down regulated i.e (z-score values were calculated by subtracting the total average intensity of the genes in the microarray from the intensity of each individual gene within a single experiment and dividing the result by the standard deviation (SD) of the measure of all the intensities)

Of the multiple headers in the raw data, i believe (the formula is referring to gProcessedSigError for SD ;   gMeanSignal for signal intensity genes in the array)

My question - is these two columns are enough to calculate the z-score or am I missing something. Sorry for a more specific and a basic question. Any suggestions will be of great help



some other columns may be relevance

gSurrogateUsed    gIsFound    gProcessedSignal    gProcessedSigError    gNumPixOLHi    gNumPixOLLo    gNumPix    gMeanSignal    gMedianSignal    gPixSDev    gPixNormIQR    gBGNumPix    gBGMeanSignal    gBGMedianSignal    gBGPixSDev    gBGPixNormIQR    gNumSatPix




microarray • 2.1k views
ADD COMMENTlink modified 4.1 years ago by Manvendra Singh2.0k • written 4.1 years ago by iphoenix210040
gravatar for Manvendra Singh
4.1 years ago by
Manvendra Singh2.0k
Berlin, Germany
Manvendra Singh2.0k wrote:
##### First you need to process your data, if its Illumina microarray then
library(lumi) #### load the library
dati <- lumiR("non-normalized_data.txt", columnNameGrepPattern = list(exprs = "AVG_signal",
se.exprs = "BEAD_STERR", beadNum = NA, detection = "Detection Pval"))
dati.T = lumiT(dati, method = "log2") # transform data to log2
dati.qnorm = lumiN(dati.T, method = "quantile",verbose = FALSE)  ###### quantile normalization of data
data <- exprs(dati.qnorm) ######## get the expression table of your data and now

check if

colnames(data) ###### is your sample names and 
 row.names(data)    ######is your gene names

then z-score calculation is <- scale(data)   ######## is have z-score of each gene across the samples




ADD COMMENTlink modified 4.1 years ago • written 4.1 years ago by Manvendra Singh2.0k

Thanks Mr. Singh,,

It seems the data in repository is all normalized microarray expression values or that is what the readme file of the data says ..

what options do I have in that case - can I calculate the zscore by the methods that i have mentioned in my query or not! thanks again for your suggestions

ADD REPLYlink written 4.1 years ago by iphoenix210040
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1045 users visited in the last hour