87

Low-Rank Approximation from Communication Complexity

Abstract

In low-rank approximation with missing entries, given ARn×nA\in \mathbb{R}^{n\times n} and binary W{0,1}n×nW \in \{0,1\}^{n\times n}, the goal is to find a rank-kk matrix LL for which: cost(L)=\sum_{i=1}^{n} \sum_{j=1}^{n}W_{i,j}\cdot (A_{i,j} - L_{i,j})^2\le OPT+\epsilon \|A\|_F^2, where OPT=minrankk L^cost(L^)OPT=\min_{rank-k\ \hat{L}}cost(\hat L). This problem is also known as matrix completion and, depending on the choice of WW, captures low-rank plus diagonal decomposition, robust PCA, low-rank recovery from monotone missing data, and a number of other important problems. Many of these problems are NP-hard, and while algorithms with provable guarantees are known in some cases, they either 1) run in time nΩ(k2/ϵ)n^{\Omega(k^2/\epsilon)}, or 2) make strong assumptions, e.g., that AA is incoherent or that WW is random. In this work, we consider bicriteria algorithmsbicriteria\ algorithms, which output LL with rank k>kk' > k. We prove that a common heuristic, which simply sets AA to 00 where WW is 00, and then computes a standard low-rank approximation, achieves the above approximation bound with rank kk' depending on the communication complexitycommunication\ complexity of WW. Namely, interpreting WW as the communication matrix of a Boolean function f(x,y)f(x,y) with x,y{0,1}lognx,y\in \{0,1\}^{\log n}, it suffices to set k=O(k2Rϵ1sided(f))k'=O(k\cdot 2^{R^{1-sided}_{\epsilon}(f)}), where Rϵ1sided(f)R^{1-sided}_{\epsilon}(f) is the randomized communication complexity of ff with 11-sided error probability ϵ\epsilon. For many problems, this yields bicriteria algorithms with k=kpoly((logn)/ϵ)k'=k\cdot poly((\log n)/\epsilon). We prove a similar bound using the randomized communication complexity with 22-sided error. Further, we show that different models of communication yield algorithms for natural variants of the problem. E.g., multi-player communication complexity connects to tensor decomposition and non-deterministic communication complexity to Boolean low-rank factorization.

View on arXiv
Comments on this paper