150

A Deterministic Streaming Sketch for Ridge Regression

International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Abstract

We provide a deterministic space-efficient algorithm for estimating ridge regression. For nn data points with dd features and a large enough regularization parameter, we provide a solution within ε\varepsilon L2_2 error using only O(d/ε)O(d/\varepsilon) space. This is the first o(d2)o(d^2) space algorithm for this classic problem. The algorithm sketches the covariance matrix by variants of Frequent Directions, which implies it can operate in insertion-only streams and a variety of distributed data settings. In comparisons to randomized sketching algorithms on synthetic and real-world datasets, our algorithm has less empirical error using less space and similar time.

View on arXiv
Comments on this paper