42
15
v1v2v3v4 (latest)

Parallelizing Spectral Algorithms for Kernel Learning

Abstract

We consider a distributed learning approach in supervised learning for a large class of spectral regularization methods in an RKHS framework. The data set of size n is partitioned into m=O(nα)m=O(n^\alpha) disjoint subsets. On each subset, some spectral regularization method (belonging to a large class, including in particular Kernel Ridge Regression, L2L^2-boosting and spectral cut-off) is applied. The regression function ff is then estimated via simple averaging, leading to a substantial reduction in computation time. We show that minimax optimal rates of convergence are preserved if m grows sufficiently slowly (corresponding to an upper bound for α\alpha) as nn \to \infty, depending on the smoothness assumptions on ff and the intrinsic dimensionality. In spirit, our approach is classical.

View on arXiv
Comments on this paper