Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.13586
Cited By
Settling the Sample Complexity of Online Reinforcement Learning
25 July 2023
Zihan Zhang
Yuxin Chen
Jason D. Lee
S. Du
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Settling the Sample Complexity of Online Reinforcement Learning"
23 / 23 papers shown
Title
Minimax Optimal Reinforcement Learning with Quasi-Optimism
Harin Lee
Min-hwan Oh
OffRL
38
0
0
02 Mar 2025
A Refined Analysis of UCBVI
Simone Drago
Marco Mussi
Alberto Maria Metelli
58
0
0
24 Feb 2025
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms
Thanh Nguyen-Tang
Raman Arora
55
1
0
01 Nov 2024
Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
Safwan Labbi
D. Tiapkin
Lorenzo Mancini
Paul Mangold
Eric Moulines
FedML
50
0
0
30 Oct 2024
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Zhong Zheng
Haochen Zhang
Lingzhou Xue
OffRL
55
2
0
10 Oct 2024
State-free Reinforcement Learning
Mingyu Chen
Aldo Pacchiano
Xuezhou Zhang
38
0
0
27 Sep 2024
HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning
Huawen Hu
Enze Shi
Chenxi Yue
Shuocun Yang
Zihao Wu
Yiwei Li
Tianyang Zhong
Tuo Zhang
Tianming Liu
Shu Zhang
35
1
0
18 Sep 2024
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs
Kevin Tan
Wei Fan
Yuting Wei
OffRL
38
1
0
08 Aug 2024
Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
D. Tiapkin
Evgenii Chzhen
Gilles Stoltz
41
0
0
08 Jul 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
40
1
0
11 Jun 2024
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
Xutong Liu
Siwei Wang
Jinhang Zuo
Han Zhong
Xuchuang Wang
Zhiyong Wang
Shuai Li
Mohammad Hajiesmaili
J. C. Lui
Wei Chen
54
1
0
03 Jun 2024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRL
OOD
42
1
0
04 Apr 2024
Scale-free Adversarial Reinforcement Learning
Mingyu Chen
Xuezhou Zhang
38
2
0
01 Mar 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
46
12
0
11 Feb 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity
Guhao Feng
Han Zhong
OffRL
34
1
0
28 Dec 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRL
OnRL
38
12
0
17 May 2023
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Gen Li
Yuling Yan
Yuxin Chen
Jianqing Fan
OffRL
32
12
0
14 Apr 2023
On Gap-dependent Bounds for Offline Reinforcement Learning
Xinqi Wang
Qiwen Cui
S. Du
OffRL
38
11
0
01 Jun 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
40
40
0
14 Mar 2022
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
Andrew Wagenmaker
Yifang Chen
Max Simchowitz
S. Du
Kevin G. Jamieson
55
31
0
07 Dec 2021
UCB Momentum Q-learning: Correcting the bias without forgetting
Pierre Menard
O. D. Domingues
Xuedong Shang
Michal Valko
42
35
0
01 Mar 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
311
1,662
0
04 May 2020
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
79
181
0
07 Feb 2020
1