ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.18447
  4. Cited By
Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning

Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning

24 May 2025
Chi Zhang
Ziying Jia
George Atia
Sihong He
Yue Wang
ArXivPDFHTML

Papers citing "Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning"

50 / 65 papers shown
Title
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from
  Shifted-Dynamics Data
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRL
OnRL
76
2
0
06 Nov 2024
The Limits of Transfer Reinforcement Learning with Latent Low-rank
  Structure
The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure
Tyler Sam
Yudong Chen
Chao Yu
OffRL
52
1
0
28 Oct 2024
On the Convergence Rates of Federated Q-Learning across Heterogeneous
  Environments
On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments
Muxing Wang
Pengkun Yang
Lili Su
FedML
54
2
0
05 Sep 2024
Model-Free Robust Reinforcement Learning with Sample Complexity Analysis
Model-Free Robust Reinforcement Learning with Sample Complexity Analysis
Yudan Wang
Shaofeng Zou
Yue Wang
OOD
38
4
0
24 Jun 2024
Distributionally Robust Reinforcement Learning with Interactive Data
  Collection: Fundamental Hardness and Near-Optimal Algorithm
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRL
OOD
86
8
0
04 Apr 2024
Sample Complexity of Offline Distributionally Robust Linear Markov
  Decision Processes
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
He Wang
Laixi Shi
Yuejie Chi
OffRL
55
9
0
19 Mar 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable
  Efficiency with Linear Function Approximation
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
Zhishuai Liu
Pan Xu
OOD
OffRL
68
10
0
23 Feb 2024
MICRO: Model-Based Offline Reinforcement Learning with a Conservative
  Bellman Operator
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
58
5
0
07 Dec 2023
Domain Randomization via Entropy Maximization
Domain Randomization via Entropy Maximization
Gabriele Tiboni
Pascal Klink
Jan Peters
Tatiana Tommasi
Carlo DÉramo
Georgia Chalvatzaki
58
16
0
03 Nov 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive
  Context-Aware Policies
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
56
11
0
25 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
141
135
0
10 Oct 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and
  Challenges
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
36
87
0
05 Sep 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function
  Approximation
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
69
23
0
17 Jul 2023
The Curious Price of Distributional Robustness in Reinforcement Learning
  with a Generative Model
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Matthieu Geist
Yuejie Chi
OOD
57
35
0
26 May 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup
  and Beyond
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
40
21
0
18 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
37
12
0
17 May 2023
Improved Sample Complexity Bounds for Distributionally Robust
  Reinforcement Learning
Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning
Zaiyan Xu
Kishan Panaganti
D. Kalathil
OOD
OffRL
34
36
0
05 Mar 2023
Robust Markov Decision Processes without Model Estimation
Robust Markov Decision Processes without Model Estimation
Wenhao Yang
Hanfengzhai Wang
Tadashi Kozuno
S. Jordan
Zhihua Zhang
61
4
0
02 Feb 2023
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
44
4
0
31 Jan 2023
Policy Gradient for Rectangular Robust Markov Decision Processes
Policy Gradient for Rectangular Robust Markov Decision Processes
Navdeep Kumar
E. Derman
Matthieu Geist
Kfir Y. Levy
Shie Mannor
50
21
0
31 Jan 2023
Policy Gradient in Robust MDPs with Global Convergence Guarantee
Policy Gradient in Robust MDPs with Global Convergence Guarantee
Qiuhao Wang
C. Ho
Marek Petrik
63
27
0
20 Dec 2022
Online Policy Optimization for Robust MDP
Online Policy Optimization for Robust MDP
Jing Dong
Jingwei Li
Baoxiang Wang
J.N. Zhang
OffRL
61
14
0
28 Sep 2022
Robust Reinforcement Learning using Offline Data
Robust Reinforcement Learning using Offline Data
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OffRL
62
72
0
10 Aug 2022
Online vs. Offline Adaptive Domain Randomization Benchmark
Online vs. Offline Adaptive Domain Randomization Benchmark
Gabriele Tiboni
Karol Arndt
Giuseppe Averta
Ville Kyrki
Tatiana Tommasi
OffRL
24
5
0
29 Jun 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Rui Yang
Chenjia Bai
Xiaoteng Ma
Zhaoran Wang
Chongjie Zhang
Lei Han
OffRL
62
77
0
06 Jun 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
75
35
0
29 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
86
73
0
15 May 2022
Unbiased Multilevel Monte Carlo methods for intractable distributions:
  MLMC meets MCMC
Unbiased Multilevel Monte Carlo methods for intractable distributions: MLMC meets MCMC
Guanyang Wang
T. Wang
52
15
0
11 Apr 2022
Federated Reinforcement Learning with Environment Heterogeneity
Federated Reinforcement Learning with Environment Heterogeneity
Hao Jin
Yang Peng
Wenhao Yang
Shusen Wang
Zhihua Zhang
79
70
0
06 Apr 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards
  Optimal Sample Complexity
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
OffRL
57
93
0
28 Feb 2022
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL
  With Upside Down RL
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL
Kai Arulkumaran
Dylan R. Ashley
Jürgen Schmidhuber
R. Srivastava
OffRL
75
7
0
24 Feb 2022
Training Robots without Robots: Deep Imitation Learning for
  Master-to-Robot Policy Transfer
Training Robots without Robots: Deep Imitation Learning for Master-to-Robot Policy Transfer
Heecheol Kim
Yoshiyuki Ohmura
Akihiko Nagakubo
Yasuo Kuniyoshi
40
24
0
19 Feb 2022
Contextualize Me -- The Case for Context in Reinforcement Learning
Contextualize Me -- The Case for Context in Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
Aditya Mohan
Sebastian Dohler
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
52
30
0
09 Feb 2022
Sample Complexity of Robust Reinforcement Learning with a Generative
  Model
Sample Complexity of Robust Reinforcement Learning with a Generative Model
Kishan Panaganti
D. Kalathil
98
74
0
02 Dec 2021
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Robert Kirk
Amy Zhang
Edward Grefenstette
Tim Rocktaschel
OffRL
61
161
0
18 Nov 2021
Twice regularized MDPs and the equivalence between robustness and
  regularization
Twice regularized MDPs and the equivalence between robustness and regularization
E. Derman
Matthieu Geist
Shie Mannor
64
55
0
12 Oct 2021
Understanding Domain Randomization for Sim-to-real Transfer
Understanding Domain Randomization for Sim-to-real Transfer
Xiaoyu Chen
Jiachen Hu
Chi Jin
Lihong Li
Liwei Wang
94
115
0
07 Oct 2021
Online Robust Reinforcement Learning with Model Uncertainty
Online Robust Reinforcement Learning with Model Uncertainty
Yue Wang
Shaofeng Zou
OOD
OffRL
95
102
0
29 Sep 2021
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain
  Randomization Reinforcement Learning
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain Randomization Reinforcement Learning
Haoyi Niu
Jianming Hu
Zheyu Cui
Jianming Hu
103
17
0
25 Jul 2021
Towards Theoretical Understandings of Robust Markov Decision Processes:
  Sample Complexity and Asymptotics
Towards Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
42
34
0
09 May 2021
Survey on reinforcement learning for language processing
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
41
103
0
12 Apr 2021
Towards Personalized Federated Learning
Towards Personalized Federated Learning
A. Tan
Han Yu
Li-zhen Cui
Qiang Yang
FedML
AI4CE
282
855
0
01 Mar 2021
Multi-Task Reinforcement Learning with Context-based Representations
Multi-Task Reinforcement Learning with Context-based Representations
Shagun Sodhani
Amy Zhang
Joelle Pineau
46
185
0
11 Feb 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A
  Focussed Mini-Review
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
74
146
0
08 Feb 2021
Is Pessimism Provably Efficient for Offline RL?
Is Pessimism Provably Efficient for Offline RL?
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
88
352
0
30 Dec 2020
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in
  Reinforcement Learning
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
Younggyo Seo
Kimin Lee
I. Clavera
Thanard Kurutach
Jinwoo Shin
Pieter Abbeel
51
37
0
26 Oct 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a
  Survey
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
104
724
0
24 Sep 2020
Probabilistic Active Meta-Learning
Probabilistic Active Meta-Learning
Jean Kaddour
Steindór Sæmundsson
M. Deisenroth
49
35
0
17 Jul 2020
Multi-Task Reinforcement Learning with Soft Modularization
Multi-Task Reinforcement Learning with Soft Modularization
Ruihan Yang
Huazhe Xu
Yi Wu
Xiaolong Wang
44
179
0
30 Mar 2020
Adaptive Personalized Federated Learning
Adaptive Personalized Federated Learning
Yuyang Deng
Mohammad Mahdi Kamani
M. Mahdavi
FedML
278
549
0
30 Mar 2020
12
Next