I have two kinds of gene list. One describes the status(Active, Bivalent, Repressive and Quiescent) of gene, i.e.:
Gene name Status A Active B Bivalent C Repressive ... ...
The other two describes predicted tumor suppressor gene(TSG) and oncogene(OG), i.e.:
TSG Gene name Score A 0.0001 B 100 C 1 ... ... OG Gene name Score A 0.001 B 1 C 10 ... ...
Then I associated first gene lists to the two other gene lists, respectively,to see whether the genes in the first list are TSG or OG(regardless of the score). I can get a table like this(the overlap is quite limited):
We can see that for the genes in the first list, there are more tumor suppressor genes than oncogenes(13>8). If I want to test whether repressive genes are indeed more associated with tumor suppressor genes compared to oncogenes, how I can add statistical test?
The lines in the first list are 1769 in total (exclude header); lines in the TSG list are 491 ; lines in the OG list are 501.