I need to clarify a theoretical/conceptual analytical approach. Based on my understanding of the fold change patterns observed in an RNA-seq DE experiment. When successful, one gathers very unique patterns of gene/transcript expression which are observed when plotted on a scatterplot.
When two datasets/replicates are compared against each other using their respective log2 FC to assess correlation. Due to the unique fold change expression pattern, if they correlate, then a diagonal linear scatterplot distribution would be observed with a resulting line projection with a slope =~1.
m=~1 Under that rationale if I compare two replicates and observe a diagonal linear distribution, I assume there is correlation between replicates and there is confidence to move forward either merging the resulting reads or continuing forward with gathering candidate genes per replicate for further analysis. If the pattern observed has a "cross-like" distribution with a resulting linear projection with slope = ~0 then I make up there is some major difference between replicates and have to be cautious moving forward.
m=~0 Can anyone expand on what would the implications of low correlation be when performing this log2FC vs log2FC comparison? Does the "cross" or "cross-like" pattern mean that there is no correlation between replicates and data should not be used? Could this pattern be indicative of major differences like "batch" effects but still have relevant DE patterns for continuing forward with analysis?
Thanks, any insight is appreciated!