276
v1v2v3v4 (latest)

Simple Heuristics Yield Provable Algorithms for Masked Low-Rank Approximation

Abstract

In masked lowrank approximationmasked\ low-rank\ approximation, one is given ARn×nA \in \mathbb{R}^{n \times n} and binary mask matrix W{0,1}n×nW \in \{0,1\}^{n \times n}. The goal is to find a rank-kk matrix LL for which: cost(L)=i=1nj=1nWi,j(Ai,jLi,j)2OPT+ϵAF2,cost(L) = \sum_{i=1}^{n} \sum_{j = 1}^{n} W_{i,j} \cdot (A_{i,j} - L_{i,j} )^2 \leq OPT + \epsilon \|A\|_F^2 , where OPT=minrankk L^cost(L^)OPT = \min_{rank-k\ \hat{L}} cost(\hat L) and ϵ\epsilon is a given error parameter. Depending on the choice of WW, this problem captures factor analysis, low-rank plus diagonal decomposition, robust PCA, low-rank matrix completion, low-rank plus block matrix approximation, and many problems. Many of these problems are NP-hard, and while some algorithms with provable guarantees are known, they either 1) run in time nΩ(k2/ϵ)n^{\Omega(k^2/\epsilon)} or 2) make strong assumptions, e.g., that AA is incoherent or that WW is random. In this work, we show that a common polynomial time heuristic, which simply sets AA to 00 where WW is 00, and then finds a standard low-rank approximation, yields bicriteria approximation guarantees for this problem. In particular, for rank k>kk' > k depending on the public coin partition numberpublic\ coin\ partition\ number of WW, the heuristic outputs rank-kk' LL with cost(L)OPT+ϵAF2(L) \leq OPT + \epsilon \|A\|_F^2. This partition number is in turn bounded by the randomized communication complexityrandomized\ communication\ complexity of WW, when interpreted as a two-player communication matrix. For many important examples of masked low-rank approximation, including all those listed above, this result yields bicriteria approximation guarantees with k=kpoly(logn/ϵ)k' = k \cdot poly(\log n/\epsilon). Further, we show that different models of communication yield algorithms for natural variants of masked low-rank approximation. For example, multi-player number-in-hand communication complexity connects to masked tensor decomposition and non-deterministic communication complexity to masked Boolean low-rank factorization.

View on arXiv
Comments on this paper