Single Pass Entrywise-Transformed Low Rank Approximation

In applications such as natural language processing or computer vision, one is given a large matrix and would like to compute a matrix decomposition, e.g., a low rank approximation, of a function applied entrywise to . A very important special case is the likelihood function . A natural way to do this would be to simply apply to each entry of , and then compute the matrix decomposition, but this requires storing all of as well as multiple passes over its entries. Recent work of Liang et al.\ shows how to find a rank- factorization to for an matrix using only words of memory, with overall error , where is the best rank- approximation to and is the square of the sum of Euclidean lengths of rows of . Their algorithm uses three passes over the entries of . The authors pose the open question of obtaining an algorithm with words of memory using only a single pass over the entries of . In this paper we resolve this open question, obtaining the first single-pass algorithm for this problem and for the same class of functions studied by Liang et al. Moreover, our error is , where is the sum of squares of Euclidean lengths of rows of . Thus our error is significantly smaller, as it removes the factor of and also . We also give an algorithm for regression, pointing out an error in previous work, and empirically validate our results.
View on arXiv