Question: What is the reason for big difference in DEGs estimation compared to previous estimation?
gravatar for anc.informatics
4 days ago by
anc.informatics0 wrote:

I tried to estimate the DEGs for a set of transcriptome from the study. In the analysis pipeline, after adapter removal using Trim galore, I first mapped the reads to reference using Hisat2. Then estimated the counts using featureCounts. These raw counts were input to DESEq2 for DEGs estimation following the tutorial.

My problem here is only around 30-40% of top up or down-regulated DEGs (for ex, among top 600 genes) estimated in the study (whole DEGs list provided as suppl files) match with my estimation. Please note that GTF file and genome index files are same in both analysis.

So at what point the big difference occurs? Is there something wrong what I did in my pipeline? Am aware that Tophat is outdated and will it make such a big difference in estimation?

rna-seq degs tophat • 85 views
ADD COMMENTlink modified 4 days ago by jared.andrews071.6k • written 4 days ago by anc.informatics0
gravatar for jared.andrews07
4 days ago by
St. Louis, MO
jared.andrews071.6k wrote:

Well, it's not necessarily Tophat, but cufflinks and DESeq2 are very different methodologies. I would not be overly worried if you don't capture 100% of their results. How does it look in the opposite direction (how many of your identified DEGs are in their list)? You can't compare apples to apples since you are using a very different (and better) method. You didn't necessarily do anything wrong.

ADD COMMENTlink written 4 days ago by jared.andrews071.6k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1598 users visited in the last hour