I am trying to replicate some computational experiments I found in a paper. In this paper, the authors have ~30 genechip human genome u133 plus 2.0 arrays, 1 for each experimental sample and no references. They process the .CEL files into log2 RMA normalized signal intensity files. They then create a dendrogram that demonstrates that there are 2 main phenotypes within these 30 samples, based on gene expression.
I am trying to replicate their work, but I'm not sure how they went from the log2 RMA normalized signal intensity files to a clustered dendrogram. There explanation is, "Hierarchical clustering was performed using Euclidean distance and a complete linkage metric."
I've reached out to the authors, but this paper is nearly 5 years old, no I may not get a response. Does anybody know how this can be done?