ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.08376
17
2

Ensemble sampling for linear bandits: small ensembles suffice

14 November 2023
David Janz
A. Litvak
Csaba Szepesvári
ArXivPDFHTML
Abstract

We provide the first useful and rigorous analysis of ensemble sampling for the stochastic linear bandit setting. In particular, we show that, under standard assumptions, for a ddd-dimensional stochastic linear bandit with an interaction horizon TTT, ensemble sampling with an ensemble of size of order dlog⁡Td \log TdlogT incurs regret at most of the order (dlog⁡T)5/2T(d \log T)^{5/2} \sqrt{T}(dlogT)5/2T​. Ours is the first result in any structured setting not to require the size of the ensemble to scale linearly with TTT -- which defeats the purpose of ensemble sampling -- while obtaining near T\smash{\sqrt{T}}T​ order regret. Our result is also the first to allow for infinite action sets.

View on arXiv
Comments on this paper