75

Low-Rank Approximation from Communication Complexity

Abstract

In masked lowrank approximationmasked\ low-rank\ approximation, one is given ARn×nA \in \mathbb{R}^{n \times n} and binary mask W{0,1}n×nW \in \{0,1\}^{n \times n}. The goal is to find a rank-kk matrix LL for which: cost(L) = \sum_{i=1}^{n} \sum_{j = 1}^{n} W_{i,j} \cdot (A_{i,j} - L_{i,j} )^2 \leq OPT + \epsilon \|A\|_F^2 , where OPT=minrankk L^cost(L^)OPT = \min_{rank-k\ \hat{L}} cost(\hat L) and ϵ\epsilon is a given error parameter. Depending on the choice of WW, this problem captures factor analysis, low-rank plus diagonal decomposition, robust PCA, low-rank matrix completion, low-rank plus block matrix approximation, and many problems. Many of these problems are NP-hard, and while some algorithms with provable guarantees are known, they either 1) run in time nΩ(k2/ϵ)n^{\Omega(k^2/\epsilon)} or 2) make strong assumptions, e.g., that AA is incoherent or that WW is random. We consider bicriteria algorithmsbicriteria\ algorithms, which output a rank-kk' matrix LL, with k>kk' > k, for which cost(L)OPT+ϵAF2cost(L) \leq OPT + \epsilon \|A\|_F^2. We show, rather surprisingly, that a common polynomial time heuristic, which simply sets AA to 00 where WW is 00, and then finds a standard low-rank approximation, achieves this error bound with rank kk' depending on public coin partition number of WW. This partition number is in turn bounded by the randomized communication complexityrandomized\ communication\ complexity of WW, when interpreted as a two-player communication matrix. For many important examples of masked low-rank approximation, including all those listed above, this result yields bicriteria approximation guarantees with k=kpoly(logn/ϵ)k' = k \cdot poly(\log n/\epsilon). Further, we show that different models of communication yield algorithms for natural variants of masked low-rank approximation. For example, multi-player number-in-hand communication complexity connects to masked tensor decomposition and non-deterministic communication complexity to masked Boolean low-rank factorization.

View on arXiv
Comments on this paper