Question: How to account for between-subjects variability in this particular RNA-seq study?
gravatar for lwc628
6.4 years ago by
United States
lwc628220 wrote:

I am stuck with the correlated and independent data combined in one study. Here's my dilemma:

Say X is a drug(explanatory variable) and Y is a gene expression(response variable).

Normally, you would give out drugs to a half of your group(randomly chosen) and placebo to the rest, measure the gene expression for each person in the group, and conduct the differential expression analysis between two conditions using a standard package such as deseq.

However, let's say between-subjects variability is huge, so you also measured the gene expression prior to taking the pill as well. In other words, each subject has two data points(before and after the taking the pill). How do you incorporate this information into the analysis? I tried to subtract the 'before' value from after 'value' for each patient, and conduct t-test on those statistics between conditions(drug & placebo), but I am not sure if I am doing it right.

I am curious if there is a standard, already established way to do this? You can assume Y follows Poisson or negative binomial.

rna-seq • 1.6k views
ADD COMMENTlink modified 6.4 years ago by karl.stamm3.9k • written 6.4 years ago by lwc628220
gravatar for karl.stamm
6.4 years ago by
United States
karl.stamm3.9k wrote:

It's called a paired analysis, and any differential expression tool with GLM can do it. I've used edgeR for this successfully. You set up the design matrix to include subject ID as a covariate and it fits coefficients to those, leaving only the signal you're interested in as residual.


This can accommodate things like age and gender discrepancies, if you have >4 pairs. The only drawback is you can't use any RNA samples that aren't paired.


Your subtraction idea is similar, but use a standard package that knows how to handle rna-seq counts.

ADD COMMENTlink written 6.4 years ago by karl.stamm3.9k

Also, check out the edgeR documentation. The authors have included many example analyses, possibly some that are similar to what you described.

To see the user's guide, you can do this inside R:

# load the library

# open the guid

ADD REPLYlink written 6.4 years ago by Ann2.3k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1056 users visited in the last hour