73

Approaching Optimality for Solving Dense Linear Systems with Low-Rank Structure

Michał Dereziński
Aaron Sidford
Main:34 Pages
Bibliography:5 Pages
1 Tables
Abstract

We provide new high-accuracy randomized algorithms for solving linear systems and regression problems that are well-conditioned except for kk large singular values. For solving such d×dd \times d positive definite system our algorithms succeed whp. and run in time O~(d2+kω)\tilde O(d^2 + k^\omega). For solving such regression problems in a matrix ARn×d\mathbf{A} \in \mathbb{R}^{n \times d} our methods succeed whp. and run in time O~(nnz(A)+d2+kω)\tilde O(\mathrm{nnz}(\mathbf{A}) + d^2 + k^\omega) where ω\omega is the matrix multiplication exponent and nnz(A)\mathrm{nnz}(\mathbf{A}) is the number of non-zeros in A\mathbf{A}. Our methods nearly-match a natural complexity limit under dense inputs for these problems and improve upon a trade-off in prior approaches that obtain running times of either O~(d2.065+kω)\tilde O(d^{2.065}+k^\omega) or O~(d2+dkω1)\tilde O(d^2 + dk^{\omega-1}) for d×dd\times d systems. Moreover, we show how to obtain these running times even under the weaker assumption that all but kk of the singular values have a suitably bounded generalized mean. Consequently, we give the first nearly-linear time algorithm for computing a multiplicative approximation to the nuclear norm of an arbitrary dense matrix. Our algorithms are built on three general recursive preconditioning frameworks, where matrix sketching and low-rank update formulas are carefully tailored to the problems' structure.

View on arXiv
Comments on this paper