I'm not familiar with the field of real time PCR and how to deal with the data. However, I need to reproduce the result (Fig. 1A) of the paper as follow:
Basically, it is a hierarchical clustering on single cell qPCR data.
I follow the methods part of the paper for the Table S4 in the paper's SI:
"All Ct values (Tables S4 and S5) obtained from the BioMark System were converted into relative expression levels by subtracting the values from the assumed baseline value of 28. Cells with low or absent endogenous control gene expression levels were removed from analysis (∼10%). The resulting values were at times normalized to the endogenous control by subtracting, for each cell, the average of its Actb and Gapdh expression levels. As the Ct scale is logarithmic (a difference of one Ct corresponds to a doubling of measured transcript), a subtraction of the average of two genes on this scale corresponds to taking the geometric mean on a linear scale. Data shown in Figures 1A–1C, 2A, 3A, 3B, 4A, 5A, and 5D have been normalized against endogenous controls."
After subtracting from 28 and normalized the data by endogenous genes, the resulting values are mostly negative.
I think maybe I need to further normalize the data by calculating the zscore over each gene and also over each sample, but I'm not sure if these normalization make sense in this field or nor?
Really appreciate if anyone could have a look at the data and let me know how to reproduce it.