Question: Setup for multiple strains, multiple conditions, and replicates
1
gravatar for montezuma
5.1 years ago by
montezuma10
United States
montezuma10 wrote:

Hello all,

I have RNA-Seq data from an experiment with multiple strains and conditions and wanted to clarify the correct setup for DEG using the tuxedo pipeline.

The experiment is looking at two C. elegans strains: wild-type and mutant, both water treated or drug treated, and triplicate for each, so 12 samples total (2 x 2 x 3). What I have done:

  1. I ran tophat for each sample, and then cufflinks separately for each sample.
  2. To compare drug treatment to water treatment, I cuffmerged all cufflinks files from the wild-type strain and then ran cuffdiff from that output. I did the same process for the mutant strain.
  3. This results in DEG for drug to water treatment in each strain separately, but I also want a comparison between strains. What would be the correct way of comparing the effect of drug treatment between the wild-type and mutant strain?


​Thanks for the help, much appreciated!

 

rna-seq tuxedo cufflinks • 1.5k views
ADD COMMENTlink modified 4.9 years ago by Sukhdeep Singh10k • written 5.1 years ago by montezuma10
0
gravatar for Sukhdeep Singh
4.9 years ago by
Netherlands
Sukhdeep Singh10k wrote:

I have a similar setup and people work differently with these kinds of setup depending on the tool and workflow they use. There is no straight answer to this, after testing and searching, I can share with you what I do

What I do is use Tuxedo suite for most of the analysis.

  1. Run Tophat individually on all samples and biological replicates (you can merge technical replicates before this step) which maps the reads.
  2. Run Cuffquant which generates .cxb profiles and these can be used later via cuffdiff.
  3. I completely skip the cuffmerge, because I am only interested in DEG not novel isoform discovery.
  4. Run Cuffdiff here I combine all the replicates and conditions together. Replicates and same conditions are seperated by a comma (,) where as different conditions are seperated by space in a time series manner.

For the last step, you can run several scripts each time answering different question

WT1rep1,WT1rep2,WT2rep1,WT2rep2 Cond1rep1,Cond1rep2,Cond2rep1,Cond2rep2 -> will give you DEG b/w WT and mutant strain

WT1rep1,WT1rep2 WT2rep1,WT2rep2 -> will give you the differences between WT strains

WT1rep1,WT1rep2 Cond1rep1,Cond1rep2 and WT2rep1,WT2rep2 Cond2rep1,Cond2rep2, DEG's from these two should have a good overlap to say your strains and replicates are good in terms of reproducibility and experimentation.

Also this might be helpful

RNA seq time course data experiment design

HTH

ADD COMMENTlink modified 11 months ago by _r_am31k • written 4.9 years ago by Sukhdeep Singh10k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1824 users visited in the last hour