Comparing lists of differentially expressed genes
2
1
Entering edit mode
5.3 years ago
ra381 ▴ 10

I've been calculating differential expression for two separate groups both of which have baseline and treatment expression measurements. I have looked at differential expression between the baseline and treatment and now have 2 lists of differentially expressed genes. Differentially expressed genes were identified with edgeR with appropriate correction for FDR.

It's an interesting question to compare the lists of differentially expressed genes and I can identify genes significantly up- and down-regulated in both groups. I can also show that the overlap in these lists is significant using Fisher's exact test.

However, when I look at genes that are significantly differentially expressed in only one group do I need to do any further analysis or test to ensure the difference is "real"? For example, one gene could be significantly DE in one group close to the cutoff for significant (e.g. P = 0.049) but just misses the cutoff in the second group (e.g. P = 0.051). I've been unable to find anything about this online so far.

RNA-Seq R edgeR differential expression • 3.9k views
3
Entering edit mode
5.3 years ago

I'd encourage you to use GSEA or a similar rank-based method for comparisons rather than choosing some p-value cut-off and comparing lists.

0
Entering edit mode

Thanks for the quick and interesting answer. I've had a look at a few methods including GSEA and RRHO (http://nar.oxfordjournals.org/content/38/17/e169) and I think I'll give this a go. Do you think it would be appropriate to just rank my 2 lists of genes (group1 and group2) by logFC and run them through RRHO?

0
Entering edit mode

I forget example how it works, but somewhere out there is a method describing how to combine fold-change and p-value, since they're not perfectly correlated.

0
Entering edit mode

This is an old thread, but a good rule of thumb for ranking genes for use in GSEA is to use this formula:

sign(log fold change) * -log10(unadjusted p-value)

0
Entering edit mode

Hi, I am just taking the opportunity of this old thread. My question is based on @antass's reply. Is it just the multiplication of the "sign" of the logfoldchange and the -log10(unadjusted p-value?). Thanks in advance.

0
Entering edit mode

Sorry, another year had passed! Yes, the sign is just the "direction" of the fold change, so that part fo the formula ultimately becomes either -1 or 1. The formula doesn't take into consideration the magnitude of the fold change.

1
Entering edit mode
5.3 years ago
EagleEye 7.1k