308
v1v2v3 (latest)

Diversity-aware clustering: Computational Complexity and Approximation Algorithms

Main:15 Pages
1 Figures
Bibliography:3 Pages
1 Tables
Appendix:3 Pages
Abstract

In this work, we study diversity-aware clustering problems where the data points are associated with multiple attributes resulting in intersecting groups. A clustering solution needs to ensure that the number of chosen cluster centers from each group should be within the range defined by a lower and upper bound threshold for each group, while simultaneously minimizing the clustering objective, which can be either kk-median, kk-means or kk-supplier. We study the computational complexity of the proposed problems, offering insights into their NP-hardness, polynomial-time inapproximability, and fixed-parameter intractability. We present parameterized approximation algorithms with approximation ratios 1+2e+ϵ1.7361+ \frac{2}{e} + \epsilon \approx 1.736, 1+8e+ϵ3.9431+\frac{8}{e} + \epsilon \approx 3.943, and 55 for diversity-aware kk-median, diversity-aware kk-means and diversity-aware kk-supplier, respectively. Assuming Gap-ETH, the approximation ratios are tight for the diversity-aware kk-median and diversity-aware kk-means problems. Our results imply the same approximation factors for their respective fair variants with disjoint groups -- fair kk-median, fair kk-means, and fair kk-supplier -- with lower bound requirements.

View on arXiv
Comments on this paper