Entering edit mode
9.0 years ago
ymukhiddin
•
0
I am trying to build a classification model using logistic regression with regularization based on a combined methylation and gene expression datasets. Data I am considering now is for COAD cancer from TCGA. I am going to combine selected CpG sites and gene expressions as one dataset. But, I am not sure what is the best/appropriate approach to make methylation betas and expression intensities have the same scale.
Thanks in advance for the answers.
Thank you. I forgot about M-vales. What do you think, if I combine M-values with gene expressions and then standardize sample-wise, for example in such a way: (feature_val - median)/IQR, would it be ok?