I have performed a probe wise differencial methylation analysis from a set of beta values downloaded from de TCGA. I have obtained a set of differencially methylated probes (dmp) for two groups of samples, but their log2 fold change (log2FC) range from -0.3 to 0.4. Then, I have performed exactly the same analysis, but transforming the Beta values to M values using
lumi package. Now, the log2FC range from -3 to 4.
So, looking to M values results, it seems that the effect size of the probes is quite big, but, when I look to the Beta values result, then the effect seems quite low.
So, I was thinking to filter those probes with abs (log2FC) < 1, using the results of M values, but I am worried about their low log2FC when Beta values are used. Do you think it would be okay this approach or that the better is to use the log2FC from the Beta values results (which in my case seems too low)?
I see your point. However, if b-values are compressed at their extremes, then that wouldn't mean that a small change between extreme values of b-values (0.01 vs 0.02, for example) could actually be reflecting a higher biological effect?
No, I would not interpret it as such. When I said "compressed" I was referring to compressed with respect to M-values, not biologically.
These measurements arise from: hybridizing bisulfite-treated DNA to the probes of the array. Then, the b-value is computed as the ratio of methylated signal fluorescence with respect to total signal (meth + unmeth). Thus they reflect how much methylated DNA was detected by the array probes taking into account the total. Thus, in my opinion two similar and small proportions (such as your 0.01, 0.02 example) reflect mostly the same: that a very small proportion of the DNA was methylated. And to me, that would indicate that there is a very small biological difference between the samples (even if it were statistically significant).