ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.05594
8
68

Gaussian Process Optimization with Adaptive Sketching: Scalable and No Regret

13 March 2019
Daniele Calandriello
Luigi Carratino
A. Lazaric
Michal Valko
Lorenzo Rosasco
    GP
ArXivPDFHTML
Abstract

Gaussian processes (GP) are a well studied Bayesian approach for the optimization of black-box functions. Despite their effectiveness in simple problems, GP-based algorithms hardly scale to high-dimensional functions, as their per-iteration time and space cost is at least quadratic in the number of dimensions ddd and iterations ttt. Given a set of AAA alternatives to choose from, the overall runtime O(t3A)O(t^3A)O(t3A) is prohibitive. In this paper we introduce BKB (budgeted kernelized bandit), a new approximate GP algorithm for optimization under bandit feedback that achieves near-optimal regret (and hence near-optimal convergence rate) with near-constant per-iteration complexity and remarkably no assumption on the input space or covariance of the GP. We combine a kernelized linear bandit algorithm (GP-UCB) with randomized matrix sketching based on leverage score sampling, and we prove that randomly sampling inducing points based on their posterior variance gives an accurate low-rank approximation of the GP, preserving variance estimates and confidence intervals. As a consequence, BKB does not suffer from variance starvation, an important problem faced by many previous sparse GP approximations. Moreover, we show that our procedure selects at most O~(deff)\tilde{O}(d_{eff})O~(deff​) points, where deffd_{eff}deff​ is the effective dimension of the explored space, which is typically much smaller than both ddd and ttt. This greatly reduces the dimensionality of the problem, thus leading to a O(TAdeff2)O(TAd_{eff}^2)O(TAdeff2​) runtime and O(Adeff)O(A d_{eff})O(Adeff​) space complexity.

View on arXiv
Comments on this paper