Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2007.01980
Cited By
v1
v2
v3 (latest)
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design
4 July 2020
Yufei Ruan
Jiaqi Yang
Yuanshuo Zhou
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design"
41 / 41 papers shown
The Adaptivity Barrier in Batched Nonparametric Bandits: Sharp Characterization of the Price of Unknown Margin
Rong Jiang
Cong Ma
197
0
0
05 Nov 2025
Robust Batched Bandits
Yunwen Guo
Yunlun Shu
Gongyi Zhuo
Tianyu Wang
125
0
0
04 Oct 2025
Stochastic Matching Bandits with Rare Optimization Updates
Jung-hun Kim
Min-hwan Oh
201
0
0
04 Sep 2025
Achieving Limited Adaptivity for Multinomial Logistic Bandits
Sukruta Prakash Midigeshi
Tanmay Goyal
Gaurav Sinha
151
1
0
05 Aug 2025
Optimal and Practical Batched Linear Bandit Algorithm
Sanghoon Yu
Min-hwan Oh
356
1
0
11 Jul 2025
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Runze Zhao
Yue Yu
Adams Yiyue Zhu
Chen Yang
Dongruo Zhou
283
1
0
20 May 2025
Breaking the
log
(
1
/
Δ
2
)
\log(1/\Delta_2)
lo
g
(
1/
Δ
2
)
Barrier: Better Batched Best Arm Identification with Adaptive Grids
Tianyuan Jin
Qin Zhang
Dongruo Zhou
280
0
0
29 Jan 2025
Optimal Batched Linear Bandits
Xuanfei Ren
Tianyuan Jin
Pan Xu
371
6
0
06 Jun 2024
Batched Stochastic Bandit for Nondegenerate Functions
IEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2024
Yu Liu
Yunlu Shu
Tianyu Wang
607
2
0
09 May 2024
Generalized Linear Bandits with Limited Adaptivity
Neural Information Processing Systems (NeurIPS), 2024
Ayush Sawarni
Nirjhar Das
Siddharth Barman
Gaurav Sinha
828
16
0
10 Apr 2024
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
529
4
0
27 Feb 2024
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao
Yu Wang
OffRL
322
4
0
02 Feb 2024
Experiment Planning with Function Approximation
Neural Information Processing Systems (NeurIPS), 2024
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
237
6
0
10 Jan 2024
Federated Linear Bandits with Finite Adversarial Actions
Neural Information Processing Systems (NeurIPS), 2023
Li Fan
Ruida Zhou
Chao Tian
Cong Shen
FedML
373
3
0
02 Nov 2023
Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity
International Conference on Learning Representations (ICLR), 2023
Emmeran Johnson
Ciara Pike-Burke
Patrick Rebeschini
OffRL
371
2
0
02 Oct 2023
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
Neural Information Processing Systems (NeurIPS), 2023
Ruiqi Zhang
Andrea Zanette
OffRL
OnRL
331
11
0
10 Jul 2023
From Random Search to Bandit Learning in Metric Measure Spaces
Chuying Han
Yasong Feng
Tianyu Wang
488
3
0
19 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
International Conference on Machine Learning (ICML), 2023
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
397
13
0
10 May 2023
CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design
International Conference on Machine Learning (ICML), 2023
Desi R. Ivanova
Joel Jennings
Tom Rainforth
Cheng Zhang
Adam Foster
351
4
0
27 Feb 2023
Active learning for data streams: a survey
Machine-mediated learning (ML), 2023
Davide Cacciarelli
M. Kulahci
361
95
0
17 Feb 2023
A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization
Yasong Feng
Weijian Luo
Yimin Huang
Tianyu Wang
410
10
0
03 Feb 2023
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
Annual Conference Computational Learning Theory (COLT), 2022
Osama A. Hanna
Lin F. Yang
Christina Fragouli
380
20
0
08 Nov 2022
Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Zihan Zhang
Yuhang Jiang
Yuanshuo Zhou
Xiangyang Ji
OffRL
257
14
0
15 Oct 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation
International Conference on Learning Representations (ICLR), 2022
Dan Qiao
Yu Wang
OffRL
335
15
0
03 Oct 2022
Contextual Bandits with Large Action Spaces: Made Practical
International Conference on Machine Learning (ICML), 2022
Yinglun Zhu
Dylan J. Foster
John Langford
Paul Mineiro
376
34
0
12 Jul 2022
A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Neural Information Processing Systems (NeurIPS), 2022
Jiafan He
Tianhao Wang
Yifei Min
Quanquan Gu
FedML
313
41
0
07 Jul 2022
Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
International Conference on Machine Learning (ICML), 2022
Sanae Amani
Tor Lattimore
András Gyorgy
Lin F. Yang
FedML
241
14
0
26 May 2022
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality
International Conference on Learning Representations (ICLR), 2022
Jiawei Huang
Jinglin Chen
Li Zhao
Tao Qin
Nan Jiang
Tie-Yan Liu
OffRL
352
29
0
14 Feb 2022
A Benchmark for Low-Switching-Cost Reinforcement Learning
Shusheng Xu
Yancheng Liang
Yunfei Li
S. Du
Yi Wu
OffRL
179
0
0
13 Dec 2021
Lipschitz Bandits with Batched Feedback
Yasong Feng
Zengfeng Huang
Tianyu Wang
470
18
0
19 Oct 2021
Batched Thompson Sampling
Cem Kalkanli
Ayfer Özgür
OffRL
283
24
0
01 Oct 2021
Design of Experiments for Stochastic Contextual Linear Bandits
Neural Information Processing Systems (NeurIPS), 2021
Andrea Zanette
Kefan Dong
Jonathan Lee
Emma Brunskill
OffRL
225
20
0
21 Jul 2021
Parallelizing Thompson Sampling
Neural Information Processing Systems (NeurIPS), 2021
Amin Karbasi
Vahab Mirrokni
M. Shadravan
233
28
0
02 Jun 2021
Parallelizing Contextual Bandits
Jeffrey Chan
Aldo Pacchiano
Nilesh Tripuraneni
Yun S. Song
Peter L. Bartlett
Michael I. Jordan
290
3
0
21 May 2021
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap
Neural Information Processing Systems (NeurIPS), 2021
Yuanhao Wang
Ruosong Wang
Sham Kakade
OffRL
454
48
0
23 Mar 2021
Online Convex Optimization with Continuous Switching Constraint
Neural Information Processing Systems (NeurIPS), 2021
Guanghui Wang
Yuanyu Wan
Tianbao Yang
Lijun Zhang
176
11
0
21 Mar 2021
Encrypted Linear Contextual Bandit
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Evrard Garcelon
Vianney Perchet
Matteo Pirotta
FedML
234
2
0
17 Mar 2021
Batched Neural Bandits
ACM / IMS Journal of Data Science (JIDS), 2021
Quanquan Gu
Amin Karbasi
Khashayar Khosravi
Vahab Mirrokni
Dongruo Zhou
BDL
OffRL
140
28
0
25 Feb 2021
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Neural Information Processing Systems (NeurIPS), 2021
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
559
169
0
06 Jan 2021
Impact of Representation Learning in Linear Bandits
Jiaqi Yang
Wei Hu
Jason D. Lee
S. Du
328
57
0
13 Oct 2020
Double Explore-then-Commit: Asymptotic Optimality and Beyond
Annual Conference Computational Learning Theory (COLT), 2020
Tianyuan Jin
Pan Xu
Xiaokui Xiao
Quanquan Gu
224
31
0
21 Feb 2020
1
Page 1 of 1