When is centering and scaling needed before doing hierarchical clustering?
0
0
Entering edit mode
6.7 years ago
halo22 ▴ 300

Hello All,

I am working on a clustering project where we have collected protein data from over 100 patients samples. This data is normalized and log transformed to achieve a uniform distribution. The goal is to cluster samples based upon their similarities, I am using hierarchal clustering and trying out combinations of distance metrics and clustering algorithms. (We haven't made a decision on distance method or clustering algorithms) My question is related to the centering and scaling issue. Is it absolutely necessary to both scale and center the data?, even in scenarios where all the data is coming from the same platform and with same units of measurement.

Appreciate your input on this one.

Thanks

clustering statistics machine learning • 1.3k views
ADD COMMENT

Login before adding your answer.

Traffic: 2969 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6