41
13

FPT Approximation for Socially Fair Clustering

Abstract

In this work, we study the socially fair kk-median/kk-means problem. We are given a set of points PP in a metric space X\mathcal{X} with a distance function d(.,.)d(.,.). There are \ell groups: P1,,PPP_1,\dotsc,P_{\ell} \subseteq P. We are also given a set FF of feasible centers in X\mathcal{X}. The goal of the socially fair kk-median problem is to find a set CFC \subseteq F of kk centers that minimizes the maximum average cost over all the groups. That is, find CC that minimizes the objective function Φ(C,P)maxjxPjd(C,x)/Pj\Phi(C,P) \equiv \max_{j} \sum_{x \in P_j} d(C,x)/|P_j|, where d(C,x)d(C,x) is the distance of xx to the closest center in CC. The socially fair kk-means problem is defined similarly by using squared distances, i.e., d2(.,.)d^{2}(.,.) instead of d(.,.)d(.,.). In this work, we design (5+ε)(5+\varepsilon) and (33+ε)(33 + \varepsilon) approximation algorithms for the socially fair kk-median and kk-means problems, respectively. For the parameters: kk and \ell, the algorithms have an FPT (fixed parameter tractable) running time of f(k,,ε)nf(k,\ell,\varepsilon) \cdot n for f(k,,ε)=2O(k/ε)f(k,\ell,\varepsilon) = 2^{{O}(k \, \ell/\varepsilon)} and n=PFn = |P \cup F|. We also study a special case of the problem where the centers are allowed to be chosen from the point set PP, i.e., PFP \subseteq F. For this special case, our algorithms give better approximation guarantees of (4+ε)(4+\varepsilon) and (18+ε)(18+\varepsilon) for the socially fair kk-median and kk-means problems, respectively. Furthermore, we convert these algorithms to constant pass log-space streaming algorithms. Lastly, we show FPT hardness of approximation results for the problem with a small gap between our upper and lower bounds.

View on arXiv
Comments on this paper