Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09724
Cited By
From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits
18 November 2021
Dorian Baudry
Patrick Saux
Odalric-Ambrym Maillard
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits"
4 / 4 papers shown
Title
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Mark Rowland
Michal Valko
Pierre Menard
92
10
0
28 Sep 2022
Top Two Algorithms Revisited
Marc Jourdan
Rémy Degenne
Dorian Baudry
R. D. Heide
E. Kaufmann
76
42
0
13 Jun 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
D. Tiapkin
Denis Belomestny
Eric Moulines
A. Naumov
S. Samsonov
Yunhao Tang
Michal Valko
Pierre Menard
95
19
0
16 May 2022
A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits
Joel Q. L. Chang
Vincent Y. F. Tan
108
14
0
25 Aug 2021
1