Does anyone else experience their results of differential gene expression analysis being vastly different from what has been published? I am still very new to R programming and bioinformatics, and am just trying to find differentially expressed genes between platinum resistant and sensitive samples in the TCGA ovarian cancer dataset.
When I try to run either limma or DESeq2, I cannot seem to replicate the results that have been published by multiple papers, even when I use the same datasets and try to follow what has been done in their methodology, and their code doesn't seem to be publicly published. Objectively, I would trust published results over my own analysis, but when I try to do individual t-tests on the genes of interest, the results tend to lie closer to my own analysis compared to what has been published.
Anyone else facing the same issue, or have any possible insights? Any help will be greatly appreciated
Honestly, those things shouldn't result in a considerable discrepancy.
The largest difference will come from the post-processing & analysis (normalization type, limma vs. deseq2, batch correction methodology, etc.).
Just from anecdotal experience.
For the OP: Start by making plots (plot your logFC and p-values against what the paper reports, make Venn Diagram overlaps of DE genes, etc.).