Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.15085
Cited By
v1
v2
v3 (latest)
Is Pessimism Provably Efficient for Offline RL?
International Conference on Machine Learning (ICML), 2020
30 December 2020
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Is Pessimism Provably Efficient for Offline RL?"
50 / 290 papers shown
Title
Unsupervised Behavior Extraction via Random Intent Priors
Neural Information Processing Systems (NeurIPS), 2023
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
257
14
0
28 Oct 2023
Pessimistic Off-Policy Multi-Objective Optimization
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
S. Alizadeh
Aniruddha Bhargava
Karthick Gopalswamy
Lalit P. Jain
Branislav Kveton
Ge Liu
OffRL
207
2
0
28 Oct 2023
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Conference on Learning for Dynamics & Control (L4DC), 2023
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OOD
OffRL
300
12
0
27 Oct 2023
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
Neural Information Processing Systems (NeurIPS), 2023
Chen Ye
Rui Yang
Quanquan Gu
Tong Zhang
OffRL
380
29
0
23 Oct 2023
Contrastive Preference Learning: Learning from Human Feedback without RL
Joey Hejna
Rafael Rafailov
Harshit S. Sikchi
Chelsea Finn
S. Niekum
W. B. Knox
Dorsa Sadigh
OffRL
516
71
0
20 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
388
24
0
19 Oct 2023
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Conference on Robot Learning (CoRL), 2023
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
243
34
0
18 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Neural Information Processing Systems (NeurIPS), 2023
Wenzhuo Zhou
OffRL
283
5
0
10 Oct 2023
When is Agnostic Reinforcement Learning Statistically Tractable?
Neural Information Processing Systems (NeurIPS), 2023
Zeyu Jia
Gene Li
Alexander Rakhlin
Ayush Sekhari
Nathan Srebro
OffRL
280
7
0
09 Oct 2023
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Qiwei Di
Heyang Zhao
Jiafan He
Quanquan Gu
OffRL
225
8
0
02 Oct 2023
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Zhihan Liu
Hao Hu
Shenao Zhang
Hongyi Guo
Shuqi Ke
Boyi Liu
Zhaoran Wang
LLMAG
LRM
464
45
0
29 Sep 2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
Journal of Artificial Intelligence Research (JAIR), 2023
Xiaoyu Wen
Xudong Yu
Rui Yang
Chenjia Bai
Zhen Wang
OffRL
OnRL
169
13
0
29 Sep 2023
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
246
1
0
28 Sep 2023
Importance-Weighted Offline Learning Done Right
International Conference on Algorithmic Learning Theory (ALT), 2023
Germano Gabbianelli
Gergely Neu
Matteo Papini
OffRL
181
12
0
27 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Neural Information Processing Systems (NeurIPS), 2023
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
276
12
0
26 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
265
7
0
23 Sep 2023
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
182
36
0
22 Sep 2023
An Offline Learning Approach to Propagator Models
Social Science Research Network (SSRN), 2023
Eyal Neuman
Wolfgang Stockinger
Yufei Zhang
OffRL
209
7
0
06 Sep 2023
Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms
Neural Information Processing Systems (NeurIPS), 2023
Qining Zhang
Lei Ying
509
6
0
01 Sep 2023
Settling the Sample Complexity of Online Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2023
Zihan Zhang
Yuxin Chen
Jason D. Lee
S. Du
OffRL
698
34
0
25 Jul 2023
Bayesian Safe Policy Learning with Chance Constrained Optimization: Application to Military Security Assessment during the Vietnam War
Zeyang Jia
Eli Ben-Michael
Kosuke Imai
256
6
0
17 Jul 2023
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement
Neural Information Processing Systems (NeurIPS), 2023
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Minshuo Chen
Mengdi Wang
DiffM
238
44
0
13 Jul 2023
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
Neural Information Processing Systems (NeurIPS), 2023
Ruiqi Zhang
Andrea Zanette
OffRL
OnRL
264
9
0
10 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
International Conference on Learning Representations (ICLR), 2023
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
364
6
0
01 Jul 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
308
118
0
26 Jun 2023
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Neural Information Processing Systems (NeurIPS), 2023
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Xuetao Zhang
Bin Wang
OffRL
376
12
0
26 Jun 2023
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data
Sunil Madhow
Dan Xiao
Ming Yin
Yu-Xiang Wang
OffRL
243
0
0
24 Jun 2023
Active Coverage for PAC Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2023
Aymen Al Marjani
Andrea Tirinzoni
E. Kaufmann
OffRL
171
5
0
23 Jun 2023
Deep Generative Models for Decision-Making and Control
Michael Janner
274
3
0
15 Jun 2023
Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources
International Conference on Machine Learning (ICML), 2023
Chengshuai Shi
Wei Xiong
Cong Shen
Jing Yang
OffRL
184
5
0
14 Jun 2023
Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Lequn Wang
A. Krishnamurthy
Aleksandrs Slivkins
OffRL
290
12
0
13 Jun 2023
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective
Neural Information Processing Systems (NeurIPS), 2023
Zeyu Zhang
Yi-Hsun Su
Hui Yuan
Yiran Wu
R. Balasubramanian
Qingyun Wu
Huazheng Wang
Mengdi Wang
OffRL
CML
360
7
0
13 Jun 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Yuhang Ran
Yi-Chen Li
Fuxiang Zhang
Zongzhang Zhang
Yang Yu
OffRL
209
41
0
11 Jun 2023
Survival Instinct in Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Anqi Li
Dipendra Kumar Misra
Andrey Kolobov
Ching-An Cheng
OffRL
253
20
0
05 Jun 2023
On Optimal Caching and Model Multiplexing for Large Model Inference
Banghua Zhu
Ying Sheng
Lianmin Zheng
Clark W. Barrett
Sai Li
Jiantao Jiao
282
27
0
03 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
International Conference on Learning Representations (ICLR), 2023
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
185
8
0
01 Jun 2023
Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning
Peizhong Ju
A. Ghosh
Ness B. Shroff
252
5
0
01 Jun 2023
Improving Offline RL by Blending Heuristics
International Conference on Learning Representations (ICLR), 2023
Sinong Geng
Aldo Pacchiano
Andrey Kolobov
Ching-An Cheng
OffRL
209
10
0
01 Jun 2023
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation
International Conference on Machine Learning (ICML), 2023
Jianhao Wang
Jin Zhang
Haozhe Jiang
Junyu Zhang
Liwei Wang
Chongjie Zhang
OffRL
246
12
0
31 May 2023
High-probability sample complexities for policy evaluation with linear function approximation
IEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2023
Gen Li
Weichen Wu
Yuejie Chi
Cong Ma
Alessandro Rinaldo
Yuting Wei
OffRL
393
9
0
30 May 2023
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
International Conference on Machine Learning (ICML), 2023
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
205
33
0
30 May 2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
Neural Information Processing Systems (NeurIPS), 2023
Zhihan Liu
Miao Lu
Wei Xiong
Han Zhong
Haotian Hu
Shenao Zhang
Sirui Zheng
Zhuoran Yang
Zhaoran Wang
OffRL
328
24
0
29 May 2023
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li
Zhuoran Yang
Mengdi Wang
OffRL
433
80
0
29 May 2023
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Neural Information Processing Systems (NeurIPS), 2023
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Matthieu Geist
Yuejie Chi
OOD
398
50
0
26 May 2023
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
442
23
0
25 May 2023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Ya Zhang
OffRL
OnRL
267
25
0
25 May 2023
Provable Offline Preference-Based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Wenhao Zhan
Masatoshi Uehara
Nathan Kallus
Jason D. Lee
Wen Sun
OffRL
329
38
0
24 May 2023
Offline Primal-Dual Reinforcement Learning for Linear MDPs
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Germano Gabbianelli
Gergely Neu
Nneka Okolo
Matteo Papini
OffRL
232
12
0
22 May 2023
Offline Reinforcement Learning with Additional Covering Distributions
Chenjie Mao
OffRL
227
0
0
22 May 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRL
OnRL
248
16
0
17 May 2023
Previous
1
2
3
4
5
6
Next