150
v1v2 (latest)

KFC: A Scalable Approximation Algorithm for kk-center Fair Clustering

Neural Information Processing Systems (NeurIPS), 2020
Abstract

In this paper, we study the problem of fair clustering on the kk-center objective. In fair clustering, the input is NN points, each belonging to at least one of ll protected groups, e.g. male, female, Asian, Hispanic. The objective is to cluster the NN points into kk clusters to minimize a classical clustering objective function. However, there is an additional constraint that each cluster needs to be fair, under some notion of fairness. This ensures that no group is either "over-represented" or "under-represented" in any cluster. Our work builds on the work of Chierichetti et al. (NIPS 2017), Bera et al. (NeurIPS 2019), Ahmadian et al. (KDD 2019), and Bercea et al. (APPROX 2019). We obtain a randomized 33-approximation algorithm for the kk-center objective function, beating the previous state of the art (44-approximation). We test our algorithm on real datasets, and show that our algorithm is effective in finding good clusters without over-representation or under-representation, surpassing the current state of the art in runtime speed, clustering cost, while achieving similar fairness violations.

View on arXiv
Comments on this paper