Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.18447
Cited By
Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning
24 May 2025
Chi Zhang
Ziying Jia
George Atia
Sihong He
Yue Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning"
50 / 65 papers shown
Title
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRL
OnRL
76
2
0
06 Nov 2024
The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure
Tyler Sam
Yudong Chen
Chao Yu
OffRL
52
1
0
28 Oct 2024
On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments
Muxing Wang
Pengkun Yang
Lili Su
FedML
54
2
0
05 Sep 2024
Model-Free Robust Reinforcement Learning with Sample Complexity Analysis
Yudan Wang
Shaofeng Zou
Yue Wang
OOD
38
4
0
24 Jun 2024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRL
OOD
86
8
0
04 Apr 2024
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
He Wang
Laixi Shi
Yuejie Chi
OffRL
55
9
0
19 Mar 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
Zhishuai Liu
Pan Xu
OOD
OffRL
68
10
0
23 Feb 2024
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
58
5
0
07 Dec 2023
Domain Randomization via Entropy Maximization
Gabriele Tiboni
Pascal Klink
Jan Peters
Tatiana Tommasi
Carlo DÉramo
Georgia Chalvatzaki
58
16
0
03 Nov 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
56
11
0
25 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
141
135
0
10 Oct 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
36
87
0
05 Sep 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
69
23
0
17 Jul 2023
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Matthieu Geist
Yuejie Chi
OOD
57
35
0
26 May 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
40
21
0
18 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
37
12
0
17 May 2023
Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning
Zaiyan Xu
Kishan Panaganti
D. Kalathil
OOD
OffRL
34
36
0
05 Mar 2023
Robust Markov Decision Processes without Model Estimation
Wenhao Yang
Hanfengzhai Wang
Tadashi Kozuno
S. Jordan
Zhihua Zhang
61
4
0
02 Feb 2023
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
44
4
0
31 Jan 2023
Policy Gradient for Rectangular Robust Markov Decision Processes
Navdeep Kumar
E. Derman
Matthieu Geist
Kfir Y. Levy
Shie Mannor
50
21
0
31 Jan 2023
Policy Gradient in Robust MDPs with Global Convergence Guarantee
Qiuhao Wang
C. Ho
Marek Petrik
63
27
0
20 Dec 2022
Online Policy Optimization for Robust MDP
Jing Dong
Jingwei Li
Baoxiang Wang
J.N. Zhang
OffRL
61
14
0
28 Sep 2022
Robust Reinforcement Learning using Offline Data
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OffRL
62
72
0
10 Aug 2022
Online vs. Offline Adaptive Domain Randomization Benchmark
Gabriele Tiboni
Karol Arndt
Giuseppe Averta
Ville Kyrki
Tatiana Tommasi
OffRL
24
5
0
29 Jun 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Rui Yang
Chenjia Bai
Xiaoteng Ma
Zhaoran Wang
Chongjie Zhang
Lei Han
OffRL
62
77
0
06 Jun 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
75
35
0
29 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
86
73
0
15 May 2022
Unbiased Multilevel Monte Carlo methods for intractable distributions: MLMC meets MCMC
Guanyang Wang
T. Wang
52
15
0
11 Apr 2022
Federated Reinforcement Learning with Environment Heterogeneity
Hao Jin
Yang Peng
Wenhao Yang
Shusen Wang
Zhihua Zhang
79
70
0
06 Apr 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
OffRL
57
93
0
28 Feb 2022
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL
Kai Arulkumaran
Dylan R. Ashley
Jürgen Schmidhuber
R. Srivastava
OffRL
75
7
0
24 Feb 2022
Training Robots without Robots: Deep Imitation Learning for Master-to-Robot Policy Transfer
Heecheol Kim
Yoshiyuki Ohmura
Akihiko Nagakubo
Yasuo Kuniyoshi
40
24
0
19 Feb 2022
Contextualize Me -- The Case for Context in Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
Aditya Mohan
Sebastian Dohler
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
52
30
0
09 Feb 2022
Sample Complexity of Robust Reinforcement Learning with a Generative Model
Kishan Panaganti
D. Kalathil
98
74
0
02 Dec 2021
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Robert Kirk
Amy Zhang
Edward Grefenstette
Tim Rocktaschel
OffRL
61
161
0
18 Nov 2021
Twice regularized MDPs and the equivalence between robustness and regularization
E. Derman
Matthieu Geist
Shie Mannor
64
55
0
12 Oct 2021
Understanding Domain Randomization for Sim-to-real Transfer
Xiaoyu Chen
Jiachen Hu
Chi Jin
Lihong Li
Liwei Wang
94
115
0
07 Oct 2021
Online Robust Reinforcement Learning with Model Uncertainty
Yue Wang
Shaofeng Zou
OOD
OffRL
95
102
0
29 Sep 2021
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain Randomization Reinforcement Learning
Haoyi Niu
Jianming Hu
Zheyu Cui
Jianming Hu
103
17
0
25 Jul 2021
Towards Theoretical Understandings of Robust Markov Decision Processes: Sample Complexity and Asymptotics
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
42
34
0
09 May 2021
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
41
103
0
12 Apr 2021
Towards Personalized Federated Learning
A. Tan
Han Yu
Li-zhen Cui
Qiang Yang
FedML
AI4CE
282
855
0
01 Mar 2021
Multi-Task Reinforcement Learning with Context-based Representations
Shagun Sodhani
Amy Zhang
Joelle Pineau
46
185
0
11 Feb 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
74
146
0
08 Feb 2021
Is Pessimism Provably Efficient for Offline RL?
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
88
352
0
30 Dec 2020
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
Younggyo Seo
Kimin Lee
I. Clavera
Thanard Kurutach
Jinwoo Shin
Pieter Abbeel
51
37
0
26 Oct 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
104
724
0
24 Sep 2020
Probabilistic Active Meta-Learning
Jean Kaddour
Steindór Sæmundsson
M. Deisenroth
49
35
0
17 Jul 2020
Multi-Task Reinforcement Learning with Soft Modularization
Ruihan Yang
Huazhe Xu
Yi Wu
Xiaolong Wang
44
179
0
30 Mar 2020
Adaptive Personalized Federated Learning
Yuyang Deng
Mohammad Mahdi Kamani
M. Mahdavi
FedML
278
549
0
30 Mar 2020
1
2
Next