I am very new to this field and looking to get some feedback. I have bulk short read RNA sequencing results from cell culture samples WT and mutant. N=5 per genotype. I want to calculate DEGs and wondering which is the preferred method in this scenario: LM (linear regression model) or DESeq2?
I ran code for LM following cqn normalization and RPKM filtering and controlled for batch effect and RIN, and received 0 DEGs. In contrast, I also ran DESeq2 on raw counts and controlled for batch and RIN, and obtained hundreds of DEGs. Why are the results between the two methods so different and how do you decide on which method to use?
From my reading, I believe that DESeq2 would be the best based on my sample size. Any help or guidance greatly appreciated! Thank you in advance!
swbarnes2 Thank you for your response! I am new to RNA seq analysis and was originally recommended to use the LM method following cqn and RPKM normalization by my school, but began to grow suspicious after I began reading into various methods.
In terms of the batch correction, I am using iPSC-derived astrocytes (2 genotypes) differentiated in 5 independent experiments.
It is not clear to me that you can meaningfully correct for 5 different batches with two samples a piece.