Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.02200
Cited By
Offline RL Policies Should be Trained to be Adaptive
5 July 2022
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Offline RL Policies Should be Trained to be Adaptive"
38 / 38 papers shown
Title
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Z. Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
140
0
0
03 May 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
99
0
0
22 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
91
0
0
31 Dec 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Rui Yang
Jie Wang
Guoping Wu
B. Li
AAML
OffRL
31
1
0
01 Nov 2024
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
H. Chen
Yi-Ju Chang
Dacheng Tao
Lichao Sun
OffRL
36
0
0
21 Oct 2024
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Jiayu Chen
Wentse Chen
Jeff Schneider
OffRL
29
1
0
15 Oct 2024
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
Qian Lin
Zongkai Liu
Danying Mo
Chao Yu
OffRL
26
0
0
16 Sep 2024
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
35
1
0
30 Jun 2024
Augmenting Offline RL with Unlabeled Data
Zhao Wang
Briti Gangopadhyay
Jia-Fong Yeh
Shingo Takamatsu
OffRL
26
0
0
11 Jun 2024
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Haotian Hu
Yiqin Yang
Jianing Ye
Chengjie Wu
Ziqing Mai
Yujing Hu
Tangjie Lv
Changjie Fan
Qianchuan Zhao
Chongjie Zhang
OffRL
OnRL
37
3
0
31 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
40
2
0
23 May 2024
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Xinyu Zhang
Wenjie Qiu
Yi-Chen Li
Lei Yuan
Chengxing Jia
Zongzhang Zhang
Yang Yu
OffRL
30
1
0
17 Feb 2024
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
32
6
0
28 Oct 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
21
12
0
27 Oct 2023
Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning
Sachit Kuhar
Shuo Cheng
Shivang Chopra
Matthew Bronars
Danfei Xu
43
8
0
22 Oct 2023
Efficient Planning with Latent Diffusion
Wenhao Li
DiffM
38
4
0
30 Sep 2023
Budgeting Counterfactual for Offline RL
Yao Liu
Pratik Chaudhari
Rasool Fakoor
OffRL
25
2
0
12 Jul 2023
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Automatic Trade-off Adaptation in Offline RL
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
29
0
0
16 Jun 2023
Katakomba: Tools and Benchmarks for Data-Driven NetHack
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
OffRL
30
5
0
14 Jun 2023
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
Jifeng Hu
Yan Sun
Sili Huang
Siyuan Guo
Hechang Chen
Li Shen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
DiffM
OffRL
40
13
0
08 Jun 2023
Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Botao Hao
Rahul Jain
Dengwang Tang
Zheng Wen
OffRL
24
2
0
20 Mar 2023
A Survey of Demonstration Learning
André Rosa de Sousa Porfírio Correia
Luís A. Alexandre
OffRL
28
17
0
20 Mar 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
8
21
0
14 Mar 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
34
62
0
02 Feb 2023
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
30
24
0
31 Jan 2023
Confidence-Conditioned Value Functions for Offline Reinforcement Learning
Joey Hong
Aviral Kumar
Sergey Levine
OffRL
28
20
0
08 Dec 2022
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning
Marc Rigter
Bruno Lacerda
Nick Hawes
OffRL
11
6
0
30 Nov 2022
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
47
360
0
28 Nov 2022
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Qinqing Zheng
Mikael Henaff
Brandon Amos
Aditya Grover
OffRL
18
20
0
12 Oct 2022
Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans
John J. Nay
ELM
AILaw
88
27
0
14 Sep 2022
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Jiafei Lyu
Xiaoteng Ma
Xiu Li
Zongqing Lu
OffRL
29
101
0
09 Jun 2022
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
838
0
12 Oct 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
138
84
0
22 Sep 2021
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
147
471
0
06 Aug 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
272
109
0
13 Jul 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
1