I have just clustered a list of 926 genes (with differential expression between two groups) with the Genesis 1.8.1 program and I use the Weighted pair-group average linkage (WPGMA) clustering algorithm. I automatically obtain a heatmap with the tree or dendrogram. Once here, I can manually select the clusters but how do I decide which node to cut into? Should I try to make the clusters similar in size or cut by a similar node height?
I have seen questions where you want to know the number of clusters before classification, but as this is an unsupervised method here I get the tree and the choice of clusters is made later. Is there any rule that would allow me to select the clusters in a more or less objective way?
In addition and for a correct clustering, should I analyze both groups (sample groups) separately? Each group has different levels of expression (that is why they are proteins with differential expression), and this could make it difficult for the algorithm to build clusters, isn't it?
I attach the programme manuals: https://genome.tugraz.at/Theses/Sturn2001.pdf https://genome.tugraz.at/genesisclient/GenesisManual.pdf
Thank you very much for your help