HIERARCHICAL CLUSTERING

Type of clustering that revolves around creating a nested structure of clustering, there are 2 types of hierarchical clustering:

  • Agglomerative (bottom up approach)

flowchart TD
	A[each data point is considered a cluster];
	B[the two less separated cluster are merged together];
	A-->B
	B-->|repeat until the best \n clustering scheme|A
  • Divisive (top down approach)

flowchart TD
	A[the entire dataset is the only cluster];
	B[the cluster with the lowest cohesion is split];
	A-->B
	B-->|repeat until the best \n clustering scheme|A

COMPLEXITY

the complexity of hierarchical clustering is for the computation and storage of the distance matrix

for the single step iteration the complexity is

so in conclusion the time complexity is

CONS

  • poor scaling due to high complexity
  • no global objective function

PREVIOUS NEXT