234

Fast Exact Matrix Completion: A Unifying Optimization Framework

Abstract

We consider the problem of matrix completion of rank kk on an n×mn\times m matrix. We show that both the general case and the case with side information can be formulated as a combinatorical problem of selecting kk vectors from pp column features. We demonstrate that it is equivalent to a separable optimization problem that is amenable to stochastic gradient descent. We design fastImpute, based on projected stochastic gradient descent, to enable efficient scaling of the algorithm of sizes of 105×10510^5 \times 10^5. We report experiments on both synthetic and real-world datasets that show fastImpute is competitive in both the accuracy of the matrix recovered and the time needed across all cases. Furthermore, when a high number of entries are missing, fastImpute is over 75%75\% lower in MAPE and 1010x faster than current state-of-the-art matrix completion methods in both the case with side information and without.

View on arXiv
Comments on this paper