ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09724
  4. Cited By
From Optimality to Robustness: Dirichlet Sampling Strategies in
  Stochastic Bandits

From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits

18 November 2021
Dorian Baudry
Patrick Saux
Odalric-Ambrym Maillard
ArXiv (abs)PDFHTML

Papers citing "From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits"

4 / 4 papers shown
Title
Optimistic Posterior Sampling for Reinforcement Learning with Few
  Samples and Tight Guarantees
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Mark Rowland
Michal Valko
Pierre Menard
92
10
0
28 Sep 2022
Top Two Algorithms Revisited
Top Two Algorithms Revisited
Marc Jourdan
Rémy Degenne
Dorian Baudry
R. D. Heide
E. Kaufmann
76
42
0
13 Jun 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
D. Tiapkin
Denis Belomestny
Eric Moulines
A. Naumov
S. Samsonov
Yunhao Tang
Michal Valko
Pierre Menard
95
19
0
16 May 2022
A Unifying Theory of Thompson Sampling for Continuous Risk-Averse
  Bandits
A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits
Joel Q. L. Chang
Vincent Y. F. Tan
108
14
0
25 Aug 2021
1