dear All,
The following makes sense intuitively, but I cannot find a reference:
- Let's say I've got 3 datasets, for example differential gene expression, of related experiments, where normalization was done in 3 different ways.
- Then I consider the pathway enrichment results found to be in common for all 3 datasets.
In other contexts it is known that standardizing helps internal validity, but it may diminish external validity. Is that the case here? That is, if I'm lucky to find common pathways while keeping the normalization heterogeneous, the result generalizes better? And if I have to resort to the same normalization process to see any consensus, that consensus is less likely to be true on related yet unseen data??
Is there any work on this, any papers? I could find work on ensemble diversity, heterogenization and causality patterns, but that's all related, not the same thing.
Thanks a lot in advance! G