Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.04959
Cited By
Regret Bounds for Batched Bandits
11 October 2019
Hossein Esfandiari
Amin Karbasi
Abbas Mehrabian
Vahab Mirrokni
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regret Bounds for Batched Bandits"
40 / 40 papers shown
Title
Batched Nonparametric Bandits via k-Nearest Neighbor UCB
Sakshi Arya
OffRL
25
0
0
15 May 2025
A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond
Zicheng Hu
Cheng Chen
72
0
0
11 Feb 2025
Adversarial Online Learning with Temporal Feedback Graphs
Khashayar Gatmiry
Jon Schneider
25
0
0
30 Jun 2024
Optimal Batched Linear Bandits
Xuanfei Ren
Tianyuan Jin
Pan Xu
40
2
0
06 Jun 2024
A Batch Sequential Halving Algorithm without Performance Degradation
Sotetsu Koyamada
Soichiro Nishimori
Shin Ishii
35
0
0
01 Jun 2024
Batched Stochastic Bandit for Nondegenerate Functions
Yu Liu
Yunlu Shu
Tianyu Wang
52
0
0
09 May 2024
Replicability is Asymptotically Free in Multi-armed Bandits
Junpei Komiyama
Shinji Ito
Yuichi Yoshida
Souta Koshino
35
1
0
12 Feb 2024
Falcon: Fair Active Learning using Multi-armed Bandits
Ki Hyun Tae
Hantian Zhang
Jaeyoung Park
Kexin Rong
Steven Euijong Whang
FaML
14
2
0
23 Jan 2024
Experiment Planning with Function Approximation
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
37
3
0
10 Jan 2024
Best Arm Identification in Batched Multi-armed Bandit Problems
Sheng Cao
Simai He
Ruoqing Jiang
Jin Xu
Hongsong Yuan
17
1
0
21 Dec 2023
Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity
Emmeran Johnson
Ciara Pike-Burke
Patrick Rebeschini
OffRL
29
2
0
02 Oct 2023
Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs
L. Yang
Xuchuang Wang
Mohammad Hajiesmaili
Lijun Zhang
John C. S. Lui
Don Towsley
38
5
0
08 Aug 2023
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms
Khashayar Khosravi
R. Leme
Chara Podimata
Apostolis Tsorvantzis
26
0
0
21 Jul 2023
Robust and differentially private stochastic linear bandits
Vasileios Charisopoulos
Hossein Esfandiari
Vahab Mirrokni
FedML
29
1
0
23 Apr 2023
Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches
Ethan Che
Hongseok Namkoong
OffRL
51
1
0
21 Mar 2023
A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization
Yasong Feng
Weijian Luo
Yimin Huang
Tianyu Wang
26
8
0
03 Feb 2023
Anonymous Bandits for Multi-User Systems
Hossein Esfandiari
Vahab Mirrokni
Jon Schneider
PICV
26
0
0
21 Oct 2022
Reward Imputation with Sketching for Contextual Batched Bandits
Xiao Zhang
Ninglu Shao
Zihua Si
Jun Xu
Wen Wang
Hanjing Su
Jirong Wen
OffRL
25
1
0
13 Oct 2022
Replicable Bandits
Hossein Esfandiari
Alkis Kalavasis
Amin Karbasi
Andreas Krause
Vahab Mirrokni
Grigoris Velegkas
37
14
0
04 Oct 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation
Dan Qiao
Yu Wang
OffRL
75
13
0
03 Oct 2022
An Asymptotically Optimal Batched Algorithm for the Dueling Bandit Problem
Arpit Agarwal
R. Ghuge
V. Nagarajan
27
1
0
25 Sep 2022
Differentially Private Stochastic Linear Bandits: (Almost) for Free
Osama A. Hanna
Antonious M. Girgis
Christina Fragouli
Suhas Diggavi
FedML
29
18
0
07 Jul 2022
Better Best of Both Worlds Bounds for Bandits with Switching Costs
I Zaghloul Amir
Guy Azov
Tomer Koren
Roi Livni
15
14
0
07 Jun 2022
Batched Dueling Bandits
Arpit Agarwal
R. Ghuge
V. Nagarajan
120
10
0
22 Feb 2022
Synthetically Controlled Bandits
Vivek Farias
C. Moallemi
Tianyi Peng
Andrew Zheng
33
13
0
14 Feb 2022
The Impact of Batch Learning in Stochastic Linear Bandits
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
21
2
0
14 Feb 2022
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality
Jiawei Huang
Jinglin Chen
Li Zhao
Tao Qin
Nan Jiang
Tie-Yan Liu
OffRL
35
24
0
14 Feb 2022
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
Dan Qiao
Ming Yin
Ming Min
Yu Wang
43
28
0
13 Feb 2022
Solving Multi-Arm Bandit Using a Few Bits of Communication
Osama A. Hanna
Lin F. Yang
Christina Fragouli
26
16
0
11 Nov 2021
The Impact of Batch Learning in Stochastic Bandits
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
OffRL
22
2
0
03 Nov 2021
Lipschitz Bandits with Batched Feedback
Yasong Feng
Zengfeng Huang
Tianyu Wang
18
14
0
19 Oct 2021
Gaussian Process Bandit Optimization with Few Batches
Zihan Li
Jonathan Scarlett
GP
135
47
0
15 Oct 2021
Batched Thompson Sampling
Cem Kalkanli
Ayfer Özgür
OffRL
54
19
0
01 Oct 2021
Design of Experiments for Stochastic Contextual Linear Bandits
Andrea Zanette
Kefan Dong
Jonathan Lee
Emma Brunskill
OffRL
32
17
0
21 Jul 2021
Differentially Private Multi-Armed Bandits in the Shuffle Model
J. Tenenbaum
Haim Kaplan
Yishay Mansour
Uri Stemmer
FedML
19
28
0
05 Jun 2021
Parallelizing Thompson Sampling
Amin Karbasi
Vahab Mirrokni
M. Shadravan
59
23
0
02 Jun 2021
An Algorithm for Stochastic and Adversarial Bandits with Switching Costs
Chloé Rouyer
Yevgeny Seldin
Nicolò Cesa-Bianchi
AAML
23
24
0
19 Feb 2021
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
122
167
0
06 Jan 2021
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design
Yufei Ruan
Jiaqi Yang
Yuanshuo Zhou
OffRL
102
51
0
04 Jul 2020
Maximal Objectives in the Multi-armed Bandit with Applications
Eren Ozbay
Vijay Kamble
35
0
0
11 Jun 2020
1