Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.15085
Cited By
v1
v2
v3 (latest)
Is Pessimism Provably Efficient for Offline RL?
International Conference on Machine Learning (ICML), 2020
30 December 2020
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Is Pessimism Provably Efficient for Offline RL?"
50 / 290 papers shown
Offline congestion games: How feedback type affects data coverage requirement
International Conference on Learning Representations (ICLR), 2022
Haozhe Jiang
Qiwen Cui
Zhihan Xiong
Maryam Fazel
S. Du
OffRL
172
1
0
24 Oct 2022
A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design
Rui Ai
Boxiang Lyu
Zhaoran Wang
Zhuoran Yang
Michael I. Jordan
253
4
0
19 Oct 2022
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
International Conference on Learning Representations (ICLR), 2022
Yuda Song
Yi Zhou
Ayush Sekhari
J. Andrew Bagnell
A. Krishnamurthy
Wen Sun
OffRL
OnRL
413
132
0
13 Oct 2022
The Role of Coverage in Online Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Tengyang Xie
Dylan J. Foster
Yu Bai
Nan Jiang
Sham Kakade
OffRL
262
70
0
09 Oct 2022
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization
International Conference on Learning Representations (ICLR), 2022
Jihwan Jeong
Xiaoyu Wang
Michael Gimelfarb
Hyunwoo J. Kim
Baher Abdulhai
Scott Sanner
OffRL
196
13
0
07 Oct 2022
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient
Ming Yin
Mengdi Wang
Yu Wang
OffRL
332
12
0
03 Oct 2022
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
Neural Information Processing Systems (NeurIPS), 2022
Fengzhuo Zhang
Boyi Liu
Kaixin Wang
Vincent Y. F. Tan
Zhuoran Yang
Zhaoran Wang
OffRL
LRM
251
14
0
20 Sep 2022
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Xiaoteng Ma
Zhipeng Liang
Jose H. Blanchet
MingWen Liu
Li Xia
Jiheng Zhang
Qianchuan Zhao
Zhengyuan Zhou
OOD
OffRL
325
34
0
14 Sep 2022
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
OffRL
216
1
0
12 Sep 2022
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
321
9
0
23 Aug 2022
Sampling Through the Lens of Sequential Decision Making
J. Dou
Alvin Pan
Runxue Bao
Haiyi Mao
Lei Luo
Zhi-Hong Mao
383
21
0
17 Aug 2022
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity
Journal of machine learning research (JMLR), 2022
Laixi Shi
Yuejie Chi
OOD
OffRL
408
90
0
11 Aug 2022
Online Learning with Off-Policy Feedback
International Conference on Algorithmic Learning Theory (ALT), 2022
Germano Gabbianelli
Matteo Papini
Gergely Neu
OffRL
170
4
0
18 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
International Conference on Machine Learning (ICML), 2022
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
166
56
0
05 Jul 2022
An Empirical Study of Implicit Regularization in Deep Offline RL
Çağlar Gülçehre
Srivatsan Srinivasan
Jakub Sygnowski
Georg Ostrovski
Mehrdad Farajtabar
Matt Hoffman
Razvan Pascanu
Arnaud Doucet
OffRL
307
20
0
05 Jul 2022
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward
IEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022
Tengyu Xu
Yue Wang
Shaofeng Zou
Yingbin Liang
OffRL
239
15
0
13 Jun 2022
Federated Offline Reinforcement Learning
Journal of the American Statistical Association (JASA), 2022
D. Zhou
Yufeng Zhang
Aaron Sonabend-W
Zhaoran Wang
Junwei Lu
Tianxi Cai
OffRL
342
18
0
11 Jun 2022
Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Ming Yin
Wenjing Chen
Mengdi Wang
Yu Wang
OffRL
165
6
0
10 Jun 2022
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Jiafei Lyu
Xiaoteng Ma
Xiu Li
Zongqing Lu
OffRL
324
134
0
09 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
259
21
0
07 Jun 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Neural Information Processing Systems (NeurIPS), 2022
Rui Yang
Chenjia Bai
Xiaoteng Ma
Zhaoran Wang
Chongjie Zhang
Lei Han
OffRL
424
101
0
06 Jun 2022
Pessimistic Off-Policy Optimization for Learning to Rank
European Conference on Artificial Intelligence (ECAI), 2022
Matej Cief
Branislav Kveton
Michal Kompan
OffRL
336
3
0
06 Jun 2022
Reward Poisoning Attacks on Offline Multi-Agent Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Young Wu
Jermey McMahan
Xiaojin Zhu
Qiaomin Xie
AAML
OffRL
455
23
0
04 Jun 2022
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
David Brandfonbrener
Rémi Tachet des Combes
Romain Laroche
OffRL
186
5
0
02 Jun 2022
Offline Reinforcement Learning with Differential Privacy
Neural Information Processing Systems (NeurIPS), 2022
Dan Qiao
Yu Wang
OffRL
313
28
0
02 Jun 2022
On Gap-dependent Bounds for Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Xinqi Wang
Qiwen Cui
S. Du
OffRL
233
16
0
01 Jun 2022
Byzantine-Robust Online and Offline Distributed Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yiding Chen
Xuezhou Zhang
Jianchao Tan
Mengdi Wang
Xiaojin Zhu
OffRL
325
22
0
01 Jun 2022
Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus
Neural Information Processing Systems (NeurIPS), 2022
Qiwen Cui
S. Du
OffRL
196
23
0
01 Jun 2022
Robust Anytime Learning of Markov Decision Processes
Neural Information Processing Systems (NeurIPS), 2022
Marnix Suilen
T. D. Simão
David Parker
N. Jansen
236
20
0
31 May 2022
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters
Neural Information Processing Systems (NeurIPS), 2022
Seyed Kamyar Seyed Ghasemipour
S. Gu
Ofir Nachum
OffRL
220
87
0
27 May 2022
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
International Conference on Learning Representations (ICLR), 2022
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
380
26
0
26 May 2022
Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
Neural Information Processing Systems (NeurIPS), 2022
Jiawei Huang
Li Zhao
Tao Qin
Wei Chen
Nan Jiang
Tie-Yan Liu
OffRL
474
4
0
25 May 2022
When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Jianxiong Li
Xianyuan Zhan
Haoran Xu
Xiangyu Zhu
Jingjing Liu
Ya Zhang
OffRL
319
31
0
23 May 2022
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Anurag Koul
Mariano Phielipp
Alan Fern
OffRL
205
0
0
22 May 2022
Pessimism for Offline Linear Contextual Bandits using
ℓ
p
\ell_p
ℓ
p
Confidence Sets
Neural Information Processing Systems (NeurIPS), 2022
Gen Li
Cong Ma
Nathan Srebro
OffRL
297
18
0
21 May 2022
Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Boxiang Lyu
Zhaoran Wang
Mladen Kolar
Zhuoran Yang
OffRL
194
9
0
05 May 2022
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Aviral Kumar
Joey Hong
Anika Singh
Sergey Levine
OffRL
283
96
0
12 Apr 2022
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Jinglin Chen
Nan Jiang
OffRL
339
37
0
25 Mar 2022
Bellman Residual Orthogonalization for Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Andrea Zanette
Martin J. Wainwright
OffRL
341
8
0
24 Mar 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
IEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
287
43
0
14 Mar 2022
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
International Conference on Learning Representations (ICLR), 2022
Ming Yin
Yaqi Duan
Mengdi Wang
Yu Wang
OffRL
253
68
0
11 Mar 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
International Conference on Machine Learning (ICML), 2022
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
OffRL
296
104
0
28 Feb 2022
LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation
Neural Information Processing Systems (NeurIPS), 2022
Geon-hyeong Kim
Jongmin Lee
Youngsoo Jang
Hongseok Yang
Kyungmin Kim
OffRL
297
25
0
28 Feb 2022
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
Journal of the American Statistical Association (JASA), 2022
C. Shi
Shuang Luo
Yuan Le
Hongtu Zhu
R. Song
OffRL
OnRL
238
16
0
26 Feb 2022
Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
Delin Qu
Boxiang Lyu
Qing-xin Meng
Zhaoran Wang
Zhuoran Yang
Sai Li
243
9
0
25 Feb 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
281
156
0
23 Feb 2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
International Conference on Machine Learning (ICML), 2022
Han Zhong
Wei Xiong
Jiyuan Tan
Liwei Wang
Tong Zhang
Zhaoran Wang
Zhuoran Yang
OffRL
209
42
0
15 Feb 2022
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality
International Conference on Learning Representations (ICLR), 2022
Jiawei Huang
Jinglin Chen
Li Zhao
Tao Qin
Nan Jiang
Tie-Yan Liu
OffRL
315
27
0
14 Feb 2022
Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Annual Conference Computational Learning Theory (COLT), 2022
Wenhao Zhan
Baihe Huang
Audrey Huang
Nan Jiang
Jason D. Lee
OffRL
620
120
0
09 Feb 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
296
148
0
05 Feb 2022
Previous
1
2
3
4
5
6
Next