83

On Socially Fair Low-Rank Approximation and Column Subset Selection

Main:23 Pages
2 Figures
Bibliography:8 Pages
Appendix:4 Pages
Abstract

Low-rank approximation and column subset selection are two fundamental and related problems that are applied across a wealth of machine learning applications. In this paper, we study the question of socially fair low-rank approximation and socially fair column subset selection, where the goal is to minimize the loss over all sub-populations of the data. We show that surprisingly, even constant-factor approximation to fair low-rank approximation requires exponential time under certain standard complexity hypotheses. On the positive side, we give an algorithm for fair low-rank approximation that, for a constant number of groups and constant-factor accuracy, runs in 2poly(k)2^{\text{poly}(k)} time rather than the na\"{i}ve npoly(k)n^{\text{poly}(k)}, which is a substantial improvement when the dataset has a large number nn of observations. We then show that there exist bicriteria approximation algorithms for fair low-rank approximation and fair column subset selection that run in polynomial time.

View on arXiv
Comments on this paper