ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.10089
  4. Cited By
Perturbed-History Exploration in Stochastic Multi-Armed Bandits

Perturbed-History Exploration in Stochastic Multi-Armed Bandits

26 February 2019
B. Kveton
Csaba Szepesvári
Mohammad Ghavamzadeh
Craig Boutilier
ArXivPDFHTML

Papers citing "Perturbed-History Exploration in Stochastic Multi-Armed Bandits"

6 / 6 papers shown
Title
Graph Neural Thompson Sampling
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
48
0
0
15 Jun 2024
Multiplier Bootstrap-based Exploration
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
B. Kveton
R. Song
16
2
0
03 Feb 2023
Maillard Sampling: Boltzmann Exploration Done Optimally
Maillard Sampling: Boltzmann Exploration Done Optimally
Jieming Bian
Kwang-Sung Jun
19
12
0
05 Nov 2021
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Jordan T. Ash
Cyril Zhang
Surbhi Goel
A. Krishnamurthy
Sham Kakade
35
6
0
21 Oct 2021
Policy Optimization as Online Learning with Mediator Feedback
Policy Optimization as Online Learning with Mediator Feedback
Alberto Maria Metelli
Matteo Papini
P. DÓro
Marcello Restelli
OffRL
19
10
0
15 Dec 2020
BanditPAM: Almost Linear Time $k$-Medoids Clustering via Multi-Armed
  Bandits
BanditPAM: Almost Linear Time kkk-Medoids Clustering via Multi-Armed Bandits
Mo Tiwari
Martin Jinye Zhang
James Mayclin
Sebastian Thrun
Chris Piech
Ilan Shomorony
12
11
0
11 Jun 2020
1