ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.09659
  4. Cited By
Double Pessimism is Provably Efficient for Distributionally Robust
  Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

16 May 2023
Jose H. Blanchet
Miao Lu
Tong Zhang
Han Zhong
    OffRL
ArXivPDFHTML

Papers citing "Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage"

29 / 29 papers shown
Title
Learning an Optimal Assortment Policy under Observational Data
Learning an Optimal Assortment Policy under Observational Data
Yuxuan Han
Han Zhong
Miao Lu
Jose H. Blanchet
Zhengyuan Zhou
OffRL
55
0
0
10 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
73
0
0
10 Feb 2025
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Natalia Zhang
X. Wang
Qiwen Cui
Runlong Zhou
Sham Kakade
Simon S. Du
OffRL
35
0
0
10 Jan 2025
Uncertainty-based Offline Variational Bayesian Reinforcement Learning
  for Robustness under Diverse Data Corruptions
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Rui Yang
Jie Wang
Guoping Wu
B. Li
AAML
OffRL
29
1
0
01 Nov 2024
Upper and Lower Bounds for Distributionally Robust Off-Dynamics
  Reinforcement Learning
Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Zhishuai Liu
Weixin Wang
Pan Xu
16
1
0
30 Sep 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
58
1
0
22 Aug 2024
Tractable Equilibrium Computation in Markov Games through Risk Aversion
Tractable Equilibrium Computation in Markov Games through Risk Aversion
Eric Mazumdar
Kishan Panaganti
Laixi Shi
16
1
0
20 Jun 2024
Statistical Learning of Distributionally Robust Stochastic Control in
  Continuous State Spaces
Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
19
0
0
17 Jun 2024
Roping in Uncertainty: Robustness and Regularization in Markov Games
Roping in Uncertainty: Robustness and Regularization in Markov Games
Jeremy McMahan
Giovanni Artiglio
Qiaomin Xie
26
2
0
13 Jun 2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is
  Implicitly an Adversarial Regularizer
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
Zhihan Liu
Miao Lu
Shenao Zhang
Boyi Liu
Hongyi Guo
Yingxiang Yang
Jose H. Blanchet
Zhaoran Wang
25
41
0
26 May 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
23
0
0
11 Apr 2024
Distributionally Robust Reinforcement Learning with Interactive Data
  Collection: Fundamental Hardness and Near-Optimal Algorithm
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRL
OOD
61
4
0
04 Apr 2024
Sample Complexity of Offline Distributionally Robust Linear Markov
  Decision Processes
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
He Wang
Laixi Shi
Yuejie Chi
OffRL
12
6
0
19 Mar 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable
  Efficiency with Linear Function Approximation
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
Zhishuai Liu
Pan Xu
OOD
OffRL
21
8
0
23 Feb 2024
On the Foundation of Distributionally Robust Reinforcement Learning
On the Foundation of Distributionally Robust Reinforcement Learning
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
OffRL
11
16
0
15 Nov 2023
Towards Robust Offline Reinforcement Learning under Diverse Data
  Corruption
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
25
15
0
19 Oct 2023
Distributionally Robust Model-based Reinforcement Learning with Large
  State Spaces
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
16
10
0
05 Sep 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function
  Approximation
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
20
19
0
17 Jul 2023
Seeing is not Believing: Robust Reinforcement Learning against Spurious
  Correlation
Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation
Wenhao Ding
Laixi Shi
Yuejie Chi
Ding Zhao
OOD
19
18
0
15 Jul 2023
Policy Gradient Algorithms for Robust MDPs with Non-Rectangular
  Uncertainty Sets
Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets
Mengmeng Li
Daniel Kuhn
Tobias Sutter
19
9
0
30 May 2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning,
  and Exploration
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
Zhihan Liu
Miao Lu
Wei Xiong
Han Zhong
Haotian Hu
Shenao Zhang
Sirui Zheng
Zhuoran Yang
Zhaoran Wang
OffRL
17
22
0
29 May 2023
The Curious Price of Distributional Robustness in Reinforcement Learning
  with a Generative Model
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
M. Geist
Yuejie Chi
OOD
10
23
0
26 May 2023
Optimal Conservative Offline RL with General Function Approximation via
  Augmented Lagrangian
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Paria Rashidinejad
Hanlin Zhu
Kunhe Yang
Stuart J. Russell
Jiantao Jiao
OffRL
22
23
0
01 Nov 2022
Robust $Q$-learning Algorithm for Markov Decision Processes under
  Wasserstein Uncertainty
Robust QQQ-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty
Ariel Neufeld
J. Sester
OOD
16
14
0
30 Sep 2022
Pessimism in the Face of Confounders: Provably Efficient Offline
  Reinforcement Learning in Partially Observable Markov Decision Processes
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
31
22
0
26 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
79
67
0
15 May 2022
Online Robust Reinforcement Learning with Model Uncertainty
Online Robust Reinforcement Learning with Model Uncertainty
Yue Wang
Shaofeng Zou
OOD
OffRL
63
96
0
29 Sep 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial
  Coverage
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
89
144
0
13 Jul 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
1