Hi,
I have a dataset of 400 genes with gene expression data (RNA-seq) across 45 tissues. These genes seem to cluster these tissues according to degree of proliferation, as indicated by the expression of a number of proliferation markers. I would now like to investigate whether this set of genes cluster the tissues according to proliferation more strongly than a random set of genes, what kind of test could I perform to elucidate that? When I cluster my data based on a random sample of 400 genes from the human genome, the tissues are not as clearly clustered according to proliferation so my hypothesis is that my set of genes are better at that.
Hope someone can help!