Question: Regression analysis with high dimensional predictors
1
gravatar for M K
4.4 years ago by
M K490
United States
M K490 wrote:

Dear All,

I am conducting a research to study the effects of DNA repeats on gene expression. I have 20,000 observation for the gene expression and about 1200 DNA repeats (predictor variables) that effect gene expression. I need to build a multiple regression model for this study. I found some techniques for variable selection, for example LASSO regression. My question is there any other technique to do that or which is the best method for doing that. BTW, in my case the predictors variables P are less than the number of observation n. 

next-gen R • 1.1k views
ADD COMMENTlink modified 4.4 years ago by Sean Davis26k • written 4.4 years ago by M K490
0
gravatar for Sean Davis
4.4 years ago by
Sean Davis26k
National Institutes of Health, Bethesda, MD
Sean Davis26k wrote:

ElasticNet and LASSO are common approaches for penalized regression. You could also consider machine-learning approaches such as random forest and GBM. Unfortunately, I do not think there is a known "best" way to do what you want to do, so you'll probably need to experiment a bit. You'll also want to keep in mind that some of these methods make assumptions about the nature of your data (continuous, discrete, bell-shaped, missing values, etc.).

ADD COMMENTlink modified 5 months ago by RamRS27k • written 4.4 years ago by Sean Davis26k

Thanks, Sean. The dependent variable in my study which is gene expression is continuous and it is approximately normally distributed, but most of predictors are taking count as o or 1. I don't know if there any other assumption for the ElasticNet and LASSO that should be satisfied to run them for variable selection.

ADD REPLYlink modified 5 months ago by RamRS27k • written 4.4 years ago by M K490
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1203 users visited in the last hour