Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.10089
Cited By
Perturbed-History Exploration in Stochastic Multi-Armed Bandits
26 February 2019
B. Kveton
Csaba Szepesvári
Mohammad Ghavamzadeh
Craig Boutilier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Perturbed-History Exploration in Stochastic Multi-Armed Bandits"
6 / 6 papers shown
Title
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
48
0
0
15 Jun 2024
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
B. Kveton
R. Song
16
2
0
03 Feb 2023
Maillard Sampling: Boltzmann Exploration Done Optimally
Jieming Bian
Kwang-Sung Jun
19
12
0
05 Nov 2021
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Jordan T. Ash
Cyril Zhang
Surbhi Goel
A. Krishnamurthy
Sham Kakade
35
6
0
21 Oct 2021
Policy Optimization as Online Learning with Mediator Feedback
Alberto Maria Metelli
Matteo Papini
P. DÓro
Marcello Restelli
OffRL
19
10
0
15 Dec 2020
BanditPAM: Almost Linear Time
k
k
k
-Medoids Clustering via Multi-Armed Bandits
Mo Tiwari
Martin Jinye Zhang
James Mayclin
Sebastian Thrun
Chris Piech
Ilan Shomorony
12
11
0
11 Jun 2020
1