Question: Standardization of gene expression and methylation dataset to the same scale
gravatar for ymukhiddin
5.6 years ago by
Czech Republic
ymukhiddin0 wrote:

I am trying to build a classification model using logistic regression with regularization based on a combined methylation and gene expression datasets. Data I am considering now is for COAD cancer from TCGA. I am going to combine selected CpG sites and gene expressions as one dataset. But, I am not sure what is the best/appropriate approach to make methylation betas and expression intensities have the same scale.

Thanks in advance for the answers.

ADD COMMENTlink modified 5.6 years ago by Sean Davis26k • written 5.6 years ago by ymukhiddin0
gravatar for Sean Davis
5.6 years ago by
Sean Davis26k
National Institutes of Health, Bethesda, MD
Sean Davis26k wrote:

I'd consider converting the methylation beta values to M-values and then standardizing (possibly using robust estimators such as median and IQR).


ADD COMMENTlink written 5.6 years ago by Sean Davis26k

Thank you. I forgot about M-vales. What do you think, if I combine M-values with gene expressions and then standardize sample-wise, for example in such a way: (feature_val - median)/IQR, would it be ok?

ADD REPLYlink written 5.6 years ago by ymukhiddin0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1789 users visited in the last hour