Question: Cuffmerge:Which samples to use for creating a merged gtf file?
gravatar for Explorer
3.0 years ago by
Explorer60 wrote:

Hi ,

I have questions regarding merging of files in cuffmerge and usage of groups in cuffdiff. I have 5 samples in condition1 and 4 samples in condition2 and 3 samples are normal samples. I would like carry out differential expression analysis using cuffdiff. So I have following questions:

i) Can cuffdiff carry out differential analysis for three conditions- Condition1, Condition2 and Normal, together? I am asking this to know what is best method to input these conditions in cuffdiff- pairwise or all conditions together?

ii) If I choose to go with pairwise comparisons - Condition1 vs Condtion2 or Normal vs Condition1 or Normal vs Condition2, is it correct to use transcript.gtf files from all 12 samples to create a merged gtf file which can be used further in either of the mentioned comparisons? Thanks for your help.

ADD COMMENTlink modified 3.0 years ago by Dan Gaston7.1k • written 3.0 years ago by Explorer60

if you have read the manual of cuffdiff, you would have know, that cuffdiff can take as many replicates as needed. from the manual:

cuffdiff [options] <transcripts.gtf> \
<sample1_replicate1.sam[,…,sample1_replicateM.sam]> \
<sample2_replicate1.sam[,…,sample2_replicateM.sam]> … \

where sampleN in your case =3. cuffdiff will than automatically calculate the DE between all possible constellations.

In this case I would create a gtf file with all samples in one go.

  1. I think it would be best to create a separate gtf file for the pair-wise comparisons. Otherwise you can create false entries in the gtf file, which belongs to different conditions.
ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by Assa Yeroslaviz1.2k
gravatar for Dan Gaston
3.0 years ago by
Dan Gaston7.1k
Dan Gaston7.1k wrote:

You should always merge all of the transcript GTF files together with cuffmerge regardless of what comparisons you will do. This essentially creates the master assembled transcriptome that contains all of the possible transcripts. This essentially just acts as a new transcriptome reference for your comparisons.

@Frymor answered your first question, as that information is in the cuffdiff manual

ADD COMMENTlink written 3.0 years ago by Dan Gaston7.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1105 users visited in the last hour