From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits

18 November 2021

Papers citing "From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits"

4 / 4 papers shown

Title
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees D. Tiapkin Denis Belomestny Daniele Calandriello Eric Moulines Rémi Munos A. Naumov Mark Rowland Michal Valko Pierre Menard 92 10 0 28 Sep 2022
Top Two Algorithms Revisited Marc Jourdan Rémy Degenne Dorian Baudry R. D. Heide E. Kaufmann 76 42 0 13 Jun 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses D. Tiapkin Denis Belomestny Eric Moulines A. Naumov S. Samsonov Yunhao Tang Michal Valko Pierre Menard 95 19 0 16 May 2022
A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits Joel Q. L. Chang Vincent Y. F. Tan 108 14 0 25 Aug 2021