GWAS analysis with logistic regression in R
1
0
Entering edit mode
8.2 years ago
a.elriedy • 0

hi, I got this Excel file (example below) with 191 SNPsHotspot.ID) and 201 subjects(EK...), and I would like to put it in a format or rearrange it, so it would be suitable to run logistic regression (or lasso) in R on it. So how do u think the data should be arranged so i can run a logistic regression model on it. And is there are any packages that i can feed it this excel sheet directly?

ID           Cohort Sex        HotSpot.ID       Genotype    Allele.1    Allele.2
EK001   normal  female  rs1037715   CT      C              T
EK001   normal  female  rs1042173   AA      A              A 
EK001   normal  female  rs1051730   AG      A              G
EK001   normal  female  Promotor    SL      S              L
EK001   normal  female  rs1042114   TT      T              T
EK001   normal  female  rs678849    TT      T              T
EK001   normal  female  rs2298896   TT      T              T
EK001   normal  female  rs12749204  AA      A              A
EK001   normal  female  rs2234918   CT      C              T
EK001   normal  female  rs204076    AT      A              T
EK001   normal  female  rs1506700   TT      T              T
EK001   normal  female  rs3121147   TT      T              T
EK001   normal  female  rs1344694   GG      G              G
GWA R • 2.5k views
ADD COMMENT
0
Entering edit mode
8.0 years ago
mastal511 ★ 2.1k

If you save the excel worksheet as a delimited .txt file, you can read it into R with read.table(), and then you can have it as a data.frame or matrix. As for what format to use with lasso, you would have to read the lasso documentation to find out.

ADD COMMENT

Login before adding your answer.

Traffic: 1611 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6