Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.09659
Cited By
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
16 May 2023
Jose H. Blanchet
Miao Lu
Tong Zhang
Han Zhong
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage"
29 / 29 papers shown
Title
Learning an Optimal Assortment Policy under Observational Data
Yuxuan Han
Han Zhong
Miao Lu
Jose H. Blanchet
Zhengyuan Zhou
OffRL
55
0
0
10 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
73
0
0
10 Feb 2025
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Natalia Zhang
X. Wang
Qiwen Cui
Runlong Zhou
Sham Kakade
Simon S. Du
OffRL
35
0
0
10 Jan 2025
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Rui Yang
Jie Wang
Guoping Wu
B. Li
AAML
OffRL
29
1
0
01 Nov 2024
Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Zhishuai Liu
Weixin Wang
Pan Xu
16
1
0
30 Sep 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
58
1
0
22 Aug 2024
Tractable Equilibrium Computation in Markov Games through Risk Aversion
Eric Mazumdar
Kishan Panaganti
Laixi Shi
16
1
0
20 Jun 2024
Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
19
0
0
17 Jun 2024
Roping in Uncertainty: Robustness and Regularization in Markov Games
Jeremy McMahan
Giovanni Artiglio
Qiaomin Xie
26
2
0
13 Jun 2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
Zhihan Liu
Miao Lu
Shenao Zhang
Boyi Liu
Hongyi Guo
Yingxiang Yang
Jose H. Blanchet
Zhaoran Wang
25
41
0
26 May 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
23
0
0
11 Apr 2024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRL
OOD
61
4
0
04 Apr 2024
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
He Wang
Laixi Shi
Yuejie Chi
OffRL
12
6
0
19 Mar 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
Zhishuai Liu
Pan Xu
OOD
OffRL
21
8
0
23 Feb 2024
On the Foundation of Distributionally Robust Reinforcement Learning
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
OffRL
11
16
0
15 Nov 2023
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
25
15
0
19 Oct 2023
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
16
10
0
05 Sep 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
20
19
0
17 Jul 2023
Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation
Wenhao Ding
Laixi Shi
Yuejie Chi
Ding Zhao
OOD
19
18
0
15 Jul 2023
Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets
Mengmeng Li
Daniel Kuhn
Tobias Sutter
19
9
0
30 May 2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
Zhihan Liu
Miao Lu
Wei Xiong
Han Zhong
Haotian Hu
Shenao Zhang
Sirui Zheng
Zhuoran Yang
Zhaoran Wang
OffRL
17
22
0
29 May 2023
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
M. Geist
Yuejie Chi
OOD
10
23
0
26 May 2023
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Paria Rashidinejad
Hanlin Zhu
Kunhe Yang
Stuart J. Russell
Jiantao Jiao
OffRL
22
23
0
01 Nov 2022
Robust
Q
Q
Q
-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty
Ariel Neufeld
J. Sester
OOD
16
14
0
30 Sep 2022
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
31
22
0
26 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
79
67
0
15 May 2022
Online Robust Reinforcement Learning with Model Uncertainty
Yue Wang
Shaofeng Zou
OOD
OffRL
63
96
0
29 Sep 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
89
144
0
13 Jul 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
1