24
0

Four Algorithms for Correlation Clustering: A Survey

Abstract

In the Correlation Clustering problem, we are given a set of objects with pairwise similarity information. Our aim is to partition these objects into clusters that match this information as closely as possible. More specifically, the pairwise information is given as a weighted graph GG with its edges labelled as ``similar" or ``dissimilar" by a binary classifier. The goal is to produce a clustering that minimizes the weight of ``disagreements": the sum of the weights of similar edges across clusters and dissimilar edges within clusters. In this exposition we focus on the case when GG is complete and unweighted. We explore four approximation algorithms for the Correlation Clustering problem under this assumption. In particular, we describe the following algorithms: (i) the 1742917429-approximation algorithm by Bansal, Blum, and Chawla, (ii) the 44-approximation algorithm by Charikar, Guruswami, and Wirth (iii) the 33-approximation algorithm by Ailon, Charikar, and Newman (iv) the 2.062.06-approximation algorithm by Chawla, Makarychev, Schramm, and Yaroslavtsev.

View on arXiv
Comments on this paper