I have a dataset where I want to see effect of a drug on my patients who responded and not responded towards treatment. I collected their blood at three different time point or visit. For each patient I have their age and sex information with me. Now to perform differential expression analysis I used DESeq2 to perform time series analysis as I have collected blood at three different visit. I want to control age and gender effect on my data so I can see interaction between responder group and different time point. Here is the sample table and my DESeq2 design formula:
sample     Phenotype     visit     Age     Gender
1             NonResponder 1      42        female
2             NonResponder  2      42        female
3            NonResponder   3      42        female
4            NonResponder   1      49       female
5           NonResponder    2     49        female
6           NonResponder   3     49       female
7          NonResponder    1     27       male
8          NonResponder     2     27       male
9         NonResponder      3     27      male
10        Responder         1       77      female
11       Responder          2      77      female
12       Responder         3       77       female
13       Responder         1       51      male
14      Responder         2       51       male
15      Responder        3       51        male
16      Responder        1       47        male
17      Responder        2       47        male
18      Responder       3        47        male
So which design should I use to control age and gender effect on my data
**design 1:
dds=(design= ~age+gender+visit+phenotype+visit:phenotype+age:phenotype+gender:phenotype)
dds=DESeq(dds)****
**design 2:
dds=(design=~age+gender+visit+phenotype+visit:phenotype)
dds=DESeq(dds,test="LRT", reduced=~age+gender)****
I will highly appreciate help with this
Best,
Lalit
Dear Swbarnes2, Thank you so much for the reply. Actually its just a small data set which i presented in this forum. I have 10 non responder patient and 9 responder patient. I collected their blood at three different time point or visit. So in total I have total 57 samples for three visit. I did PCA analysis using phenotype age and visit information but I did not see separate cluster. Variation between PCA1 and PCA2 was not that much. PCA1 18% and PCA2 was 9%. But I am not sure which design I should use to see genes where visit and phenotype have effect and they are not affected by age and gender. I want to correct this data for age and gender. Should I use design 1 as full model dds=(design= ~age+gender+visit+phenotype+visit:phenotype+age:phenotype+gender:phenotype) dds=DESeq(dds)
or should I use design 2 as reduced model to correct my data for age and gender dds=(design=~age+gender+visit+phenotype+visit:phenotype) dds=DESeq(dds,test="LRT", reduced=~age+gender)
Best Regards, Lalit