Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.00330
Cited By
v1
v2 (latest)
Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism
International Conference on Machine Learning (ICML), 2020
31 October 2020
Brijen Thananjeyan
Kirthevasan Kandasamy
Ion Stoica
Sai Li
Ken Goldberg
Joseph E. Gonzalez
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism"
4 / 4 papers shown
Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter
Qinghao Hu
S. Yang
Junxian Guo
Xiaozhe Yao
Yujun Lin
Yuxian Gu
Han Cai
Chuang Gan
Ana Klimovic
Song Han
OffRL
LRM
AI4CE
213
6
0
20 Nov 2025
Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting
Alessio Russo
Alberto Maria Metelli
Marcello Restelli
254
1
0
02 Oct 2024
BORA: Bayesian Optimization for Resource Allocation
Social Science Research Network (SSRN), 2022
Antonio Candelieri
Andrea Ponti
Francesco Archetti
166
0
0
12 Oct 2022
PAC Best Arm Identification Under a Deadline
Brijen Thananjeyan
Kirthevasan Kandasamy
Ion Stoica
Sai Li
Ken Goldberg
Joseph E. Gonzalez
214
4
0
06 Jun 2021
1
Page 1 of 1