HIERARCHICAL CLUSTERING
Type of clustering that revolves around creating a nested structure of clustering, there are 2 types of hierarchical clustering:
-
Agglomerative (bottom up approach)
flowchart TD
A[each data point is considered a cluster];
B[the two less separated cluster are merged together];
A-->B
B-->|repeat until the best \n clustering scheme|A
-
Divisive (top down approach)
flowchart TD
A[the entire dataset is the only cluster];
B[the cluster with the lowest cohesion is split];
A-->B
B-->|repeat until the best \n clustering scheme|A
COMPLEXITY
the complexity of hierarchical clustering is for the computation and storage of the distance matrix
for the single step iteration the complexity is
so in conclusion the time complexity is
CONS
- poor scaling due to high complexity
- no global objective function