Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2202.06385
Cited By
v1
v2 (latest)
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
International Conference on Machine Learning (ICML), 2022
13 February 2022
Dan Qiao
Ming Yin
Ming Min
Yu Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost"
16 / 16 papers shown
The Adaptivity Barrier in Batched Nonparametric Bandits: Sharp Characterization of the Price of Unknown Margin
Rong Jiang
Cong Ma
209
0
0
05 Nov 2025
A Tutorial: An Intuitive Explanation of Offline Reinforcement Learning Theory
Fengdi Che
OffRL
184
0
0
11 Aug 2025
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
International Conference on Learning Representations (ICLR), 2024
Zhong Zheng
Haochen Zhang
Lingzhou Xue
OffRL
447
9
0
10 Oct 2024
To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning
Tao Ma
Xuzhi Yang
Zoltan Szabo
OffRL
415
1
0
01 Jul 2024
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
538
4
0
27 Feb 2024
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
Neural Information Processing Systems (NeurIPS), 2023
Ruiqi Zhang
Andrea Zanette
OffRL
OnRL
341
11
0
10 Jul 2023
Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time
Neural Information Processing Systems (NeurIPS), 2023
Xiang Ji
Gen Li
OffRL
431
9
0
24 May 2023
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2023
Gen Li
Yuling Yan
Yuxin Chen
Jianqing Fan
OffRL
370
16
0
14 Apr 2023
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback
Neural Information Processing Systems (NeurIPS), 2023
Yunchang Yang
Hangshi Zhong
Tianhao Wu
B. Liu
Liwei Wang
S. Du
OffRL
601
10
0
03 Feb 2023
Near-Optimal Differentially Private Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Dan Qiao
Yu Wang
368
17
0
09 Dec 2022
Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Zihan Zhang
Yuhang Jiang
Yuanshuo Zhou
Xiangyang Ji
OffRL
258
14
0
15 Oct 2022
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient
Ming Yin
Mengdi Wang
Yu Wang
OffRL
418
12
0
03 Oct 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation
International Conference on Learning Representations (ICLR), 2022
Dan Qiao
Yu Wang
OffRL
338
15
0
03 Oct 2022
Doubly Fair Dynamic Pricing
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Jianyu Xu
Dan Qiao
Yu Wang
319
11
0
23 Sep 2022
Offline Reinforcement Learning with Differential Privacy
Neural Information Processing Systems (NeurIPS), 2022
Dan Qiao
Yu Wang
OffRL
438
29
0
02 Jun 2022
Online Sub-Sampling for Reinforcement Learning with General Function Approximation
Dingwen Kong
Ruslan Salakhutdinov
Ruosong Wang
Lin F. Yang
OffRL
277
1
0
14 Jun 2021
1
Page 1 of 1