321
v1v2v3 (latest)

Bandit optimisation of functions in the Matérn kernel RKHS

International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Abstract

We consider the problem of optimising functions in the reproducing kernel Hilbert space (RKHS) of a Mat\érn kernel with smoothness parameter ν\nu over the domain [0,1]d[0,1]^d under noisy bandit feedback. Our contribution, the π\pi-GP-UCB algorithm, is the first practical approach with guaranteed sublinear regret for all ν>1\nu>1 and d1d \geq 1. Empirical validation suggests better performance and drastically improved computational scalablity compared with its predecessor, Improved GP-UCB.

View on arXiv
Comments on this paper