Upper Confidence Bounds for Combining Stochastic Bandits

24 December 2020

Papers citing "Upper Confidence Bounds for Combining Stochastic Bandits"

6 / 6 papers shown

Title
Offline-to-online hyperparameter transfer for stochastic bandits Dravyansh Sharma Arun Sai Suggala OffRL 103 4 0 06 Jan 2025
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals Ziyi Liu Idan Attias Daniel M. Roy CML 51 1 0 01 Jul 2024
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 50 3 0 16 Feb 2023
Universal and data-adaptive algorithms for model selection in linear contextual bandits Vidya Muthukumar A. Krishnamurthy 71 5 0 08 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 98 22 0 25 Oct 2021
Pareto Optimal Model Selection in Linear Bandits Yinglun Zhu Robert D. Nowak 43 14 0 12 Feb 2021