Question: Which statistical test to show the overlap of some genes in two different data is significant.
gravatar for archisman
12 months ago by
archisman0 wrote:

Suppose I have a gene list of 470 genes that are induced in my study. I found that in other studies people already showed about 1000 genes were involved in the same kind of pathways but with the different model systems. Now, when I did an overlap of those 1000 genes with my 470 genes I found out of 470, at least 165 are common. Which statistical test I need to perform here to show that the overlap is not due to only by chance.

rna-seq R • 246 views
ADD COMMENTlink modified 12 months ago by russhh5.5k • written 12 months ago by archisman0

I suggest LOLA (BioC package) for this task:

ADD REPLYlink written 12 months ago by ATpoint44k

Hypergeometric test? See: Hypergeometric {stats}

ADD REPLYlink modified 12 months ago • written 12 months ago by zx87549.9k
gravatar for russhh
12 months ago by
UK, U. Glasgow
russhh5.5k wrote:

A slight issue here is that you need to know what the gene-universe is before you can do these comparisons, and you need to restrict your gene counts to just those that could have been assessed in both studies: for example, although you have 470 induced genes, they might not all have been studied in the other study.

Once you've got the gene-universe and the restriction of your gene-counts to that gene-universe, the standard approach would be to use Fisher's Exact test.

ADD COMMENTlink written 12 months ago by russhh5.5k

(although it's not perfect: the false-positive rate isn't equal across the range of expression levels for RNA-Seq datasets, and a number of gene-pairs are 'technically correlated' because of sequence similarity etc; to mitigate against the first of these, I'd recommend you rerun it with several choices of RNA-Seq significance threshold, and with several choices of detectability cutoffs)

ADD REPLYlink modified 12 months ago • written 12 months ago by russhh5.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1237 users visited in the last hour