ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.03519
17
197

Scalable Fair Clustering

10 February 2019
A. Backurs
Piotr Indyk
Krzysztof Onak
B. Schieber
A. Vakilian
Tal Wagner
ArXivPDFHTML
Abstract

We study the fair variant of the classic kkk-median problem introduced by Chierichetti et al. [2017]. In the standard kkk-median problem, given an input pointset PPP, the goal is to find kkk centers CCC and assign each input point to one of the centers in CCC such that the average distance of points to their cluster center is minimized. In the fair variant of kkk-median, the points are colored, and the goal is to minimize the same average distance objective while ensuring that all clusters have an "approximately equal" number of points of each color. Chierichetti et al. proposed a two-phase algorithm for fair kkk-clustering. In the first step, the pointset is partitioned into subsets called fairlets that satisfy the fairness requirement and approximately preserve the kkk-median objective. In the second step, fairlets are merged into kkk clusters by one of the existing kkk-median algorithms. The running time of this algorithm is dominated by the first step, which takes super-quadratic time. In this paper, we present a practical approximate fairlet decomposition algorithm that runs in nearly linear time. Our algorithm additionally allows for finer control over the balance of resulting clusters than the original work. We complement our theoretical bounds with empirical evaluation.

View on arXiv
Comments on this paper