120

Tractable Instances of Bilinear Maximization: Implementing LinUCB on Ellipsoids

International Journal of Intelligent Systems and Applications in Engineering (IJISAE), 2025
Main:8 Pages
8 Figures
Bibliography:1 Pages
Appendix:18 Pages
Abstract

We consider the maximization of xθx^\top \theta over (x,θ)X×Θ(x,\theta) \in \mathcal{X} \times \Theta, with XRd\mathcal{X} \subset \mathbb{R}^d convex and ΘRd\Theta \subset \mathbb{R}^d an ellipsoid. This problem is fundamental in linear bandits, as the learner must solve it at every time step using optimistic algorithms. We first show that for some sets X\mathcal{X} e.g. p\ell_p balls with p>2p>2, no efficient algorithms exist unless P=NP\mathcal{P} = \mathcal{NP}. We then provide two novel algorithms solving this problem efficiently when X\mathcal{X} is a centered ellipsoid. Our findings provide the first known method to implement optimistic algorithms for linear bandits in high dimensions.

View on arXiv
Comments on this paper