Nearly-Optimal Hierarchical Clustering for Well-Clustered Graphs

Abstract
This paper presents two efficient hierarchical clustering (HC) algorithms with respect to Dasgupta's cost function. For any input graph with a clear cluster-structure, our designed algorithms run in nearly-linear time in the input size of , and return an -approximate HC tree with respect to Dasgupta's cost function. We compare the performance of our algorithm against the previous state-of-the-art on synthetic and real-world datasets and show that our designed algorithm produces comparable or better HC trees with much lower running time.
View on arXivComments on this paper