ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.13464
  4. Cited By
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
v1v2v3v4 (latest)

When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2022
27 June 2022
Haoyi Niu
Sanjay Kariyappa
Yiwen Qiu
Ming Li
Guyue Zhou
Jianming Hu
Xianyuan Zhan
    OffRLOnRL
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (57★)

Papers citing "When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning"

47 / 47 papers shown
Title
Clustering-Based Weight Orthogonalization for Stabilizing Deep Reinforcement Learning
Clustering-Based Weight Orthogonalization for Stabilizing Deep Reinforcement LearningIEEE International Joint Conference on Neural Network (IJCNN), 2025
Guoqing Ma
Y. Zhang
Yuming Dai
Guangfu Hao
Yang Chen
S. Yu
OffRL
64
0
0
02 Nov 2025
Guardian: Decoupling Exploration from Safety in Reinforcement Learning
Guardian: Decoupling Exploration from Safety in Reinforcement Learning
Kaitong Cai
Jusheng Zhang
Jing Yang
Keze Wang
OffRLOnRL
196
0
0
26 Oct 2025
Towards Robust Zero-Shot Reinforcement Learning
Towards Robust Zero-Shot Reinforcement Learning
Kexin Zheng
Lauriane Teyssier
Yinan Zheng
Yu Luo
Xiayuan Zhan
OffRL
263
0
0
17 Oct 2025
Multi-Fidelity Hybrid Reinforcement Learning via Information Gain Maximization
Multi-Fidelity Hybrid Reinforcement Learning via Information Gain Maximization
Houssem Sifaou
Osvaldo Simeone
OffRL
120
0
0
18 Sep 2025
SLA-MORL: SLA-Aware Multi-Objective Reinforcement Learning for HPC Resource Optimization
SLA-MORL: SLA-Aware Multi-Objective Reinforcement Learning for HPC Resource Optimization
S. A. Mostafa
Aravind Mohan
Jianwu Wang
96
1
0
05 Aug 2025
UniLegs: Universal Multi-Legged Robot Control through Morphology-Agnostic Policy Distillation
UniLegs: Universal Multi-Legged Robot Control through Morphology-Agnostic Policy Distillation
Weijie Xi
Zhanxiang Cao
Chenlin Ming
Jianying Zheng
Guyue Zhou
146
0
0
30 Jul 2025
DmC: Nearest Neighbor Guidance Diffusion Model for Offline Cross-domain Reinforcement Learning
DmC: Nearest Neighbor Guidance Diffusion Model for Offline Cross-domain Reinforcement Learning
Linh Le Pham Van
Minh Hoang Nguyen
D. Kieu
Hung Le
Hung The Tran
Sunil R. Gupta
DiffM
178
2
0
28 Jul 2025
Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data
Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data
Lingkai Kong
Haichuan Wang
Tonghan Wang
Guojun Xiong
Milind Tambe
OffRL
298
6
0
29 May 2025
Hybrid Cross-domain Robust Reinforcement Learning
Hybrid Cross-domain Robust Reinforcement Learning
Linh Le Pham Van
Minh Hoang Nguyen
Hung Le
H. Tran
Sunil R. Gupta
OffRL
182
2
0
29 May 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
291
1
0
17 Apr 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
171
0
0
10 Mar 2025
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Jiani Zheng
Lu Wang
Fangkai Yang
Chen Zhang
Shansong Liu
Wenjie Yin
Qingwei Lin
Dongmei Zhang
Saravan Rajmohan
Qi Zhang
OffRL
250
13
0
26 Feb 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Data Center Cooling System Optimization Using Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Xianyuan Zhan
Xiangyu Zhu
Peng Cheng
Xiao Hu
Ziteng He
...
Chenhui Liu
Tianshun Hong
Huiwen Zheng
Yunxin Liu
Feng Zhao
AI4CE
355
1
0
17 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-TuningInternational Conference on Learning Representations (ICLR), 2025
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
363
8
0
04 Feb 2025
Dual Alignment Maximin Optimization for Offline Model-based RL
Dual Alignment Maximin Optimization for Offline Model-based RL
Chi Zhou
Wang Luo
Haoran Li
Congying Han
Tiande Guo
Zicheng Zhang
OffRL
416
0
0
02 Feb 2025
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from
  Shifted-Dynamics Data
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics DataInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRLOnRL
217
4
0
06 Nov 2024
Dual Action Policy for Robust Sim-to-Real Reinforcement Learning
Dual Action Policy for Robust Sim-to-Real Reinforcement LearningInternational Conference on Artificial Neural Networks (ICANN), 2024
Ng Wen Zheng Terence
Chen Jianda
133
0
0
16 Oct 2024
Off-dynamics Conditional Diffusion Planners
Off-dynamics Conditional Diffusion PlannersIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Wen Zheng Terence Ng
Jianda Chen
Tianwei Zhang
DiffMOffRL
280
0
0
16 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with
  Stationary Distribution Shift Regularization
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift RegularizationInternational Conference on Learning Representations (ICLR), 2024
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
276
4
0
02 Oct 2024
xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
Haoyi Niu
Qimao Chen
Tenglong Liu
Jianxiong Li
Guyue Zhou
Yi Zhang
Jianming Hu
Xianyuan Zhan
250
1
0
13 Sep 2024
Provable Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Provable Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Weiqin Chen
Xinjie Zhang
Sandipan Mishra
Santiago Paternain
OffRL
343
5
0
22 Aug 2024
Solving Motion Planning Tasks with a Scalable Generative Model
Solving Motion Planning Tasks with a Scalable Generative Model
Yihan Hu
Siqi Chai
Zhening Yang
Jingyu Qian
Kun Li
Wenxin Shao
Haichao Zhang
Wei Xu
Qiang Liu
174
36
0
03 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and
  Imperfect Simulators
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
210
1
0
30 Jun 2024
Residual Learning and Context Encoding for Adaptive Offline-to-Online
  Reinforcement Learning
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
Mohammadreza Nakhaei
Aidan Scannell
Joni Pajarinen
OffRL
224
1
0
12 Jun 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
239
4
0
11 Jun 2024
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
268
4
0
29 May 2024
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical
  Behaviors in Deep Off-Policy RL
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRLOnRL
220
7
0
28 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Fuchun Sun
Jingwen Yang
Zongqing Lu
Xiu Li
238
21
0
24 May 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline
  Reinforcement Learning
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2024
Xiaoyu Wen
Chenjia Bai
Kang Xu
Xudong Yu
Yang Zhang
Xuelong Li
Zhen Wang
288
7
0
10 May 2024
Improving Offline Reinforcement Learning with Inaccurate Simulators
Improving Offline Reinforcement Learning with Inaccurate Simulators
Yiwen Hou
Haoyuan Sun
Jinming Ma
Feng Wu
OffRL
123
8
0
07 May 2024
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning
  with Value-based Dataset
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based DatasetIEEE International Conference on Robotics and Automation (ICRA), 2024
Dongsu Lee
Chanin Eom
Minhae Kwon
GPOffRL
100
14
0
03 Apr 2024
A Comprehensive Survey of Cross-Domain Policy Transfer for Embodied
  Agents
A Comprehensive Survey of Cross-Domain Policy Transfer for Embodied Agents
Haoyi Niu
Jianming Hu
Guyue Zhou
Xianyuan Zhan
129
20
0
07 Feb 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via
  Orthogonal-gradient Update
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
292
20
0
01 Feb 2024
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion
  Model
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Yinan Zheng
Jianxiong Li
Dongjie Yu
Yujie Yang
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
OffRL
209
48
0
19 Jan 2024
A Conservative Approach for Few-Shot Transfer in Off-Dynamics
  Reinforcement Learning
A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning
Paul Daoudi
Christophe Prieur
Bogdan Robu
M. Barlier
Ludovic Dos Santos
OffRL
221
1
0
24 Dec 2023
MICRO: Model-Based Offline Reinforcement Learning with a Conservative
  Bellman Operator
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
259
10
0
07 Dec 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with
  Multi-Step On-Policy Optimization
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy OptimizationInternational Conference on Learning Representations (ICLR), 2023
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRLOnRL
278
23
0
06 Nov 2023
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics GapsIEEE International Conference on Robotics and Automation (ICRA), 2023
Haoyi Niu
Tianying Ji
Bingqi Liu
Haocheng Zhao
Xiangyu Zhu
Jianying Zheng
Pengfei Huang
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRLOnRLAI4CE
335
17
0
22 Sep 2023
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
Xiao-Yin Liu
Xiao-Hu Zhou
Mei-Jiang Gui
Shiqi Liu
Zhen-Qiu Feng
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRLOOD
482
7
0
16 Sep 2023
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local
  Value Regularization
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value RegularizationNeural Information Processing Systems (NeurIPS), 2023
Xiangsen Wang
Haoran Xu
Yinan Zheng
Xianyuan Zhan
OffRL
306
32
0
21 Jul 2023
Look Beneath the Surface: Exploiting Fundamental Symmetry for
  Sample-Efficient Offline RL
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RLNeural Information Processing Systems (NeurIPS), 2023
Peng Cheng
Xianyuan Zhan
Zhihao Wu
Wenjia Zhang
Shoucheng Song
Han Wang
Youfang Lin
Li Jiang
OffRL
558
15
0
07 Jun 2023
State Regularized Policy Optimization on Data with Dynamics Shift
State Regularized Policy Optimization on Data with Dynamics ShiftNeural Information Processing Systems (NeurIPS), 2023
Zhenghai Xue
Qingpeng Cai
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
319
24
0
06 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy
  Actor-Critic
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticInternational Conference on Machine Learning (ICML), 2023
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRLOnRL
346
20
0
05 Jun 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Cross-Domain Policy Adaptation via Value-Guided Data FilteringNeural Information Processing Systems (NeurIPS), 2023
Kang Xu
Chenjia Bai
Xiaoteng Ma
Dong Wang
Bingyan Zhao
Zhen Wang
Xuelong Li
Wei Li
255
26
0
28 May 2023
Making Offline RL Online: Collaborative World Models for Offline Visual
  Reinforcement Learning
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Q. Wang
Jun Yang
Yunbo Wang
Xin Jin
Wenjun Zeng
Xiaokang Yang
OffRLOnRL
279
4
0
24 May 2023
(Re)$^2$H2O: Autonomous Driving Scenario Generation via Reversely
  Regularized Hybrid Offline-and-Online Reinforcement Learning
(Re)2^22H2O: Autonomous Driving Scenario Generation via Reversely Regularized Hybrid Offline-and-Online Reinforcement Learning
Haoyi Niu
Kun Ren
Yi Tian Xu
Ziyuan Yang
Yi-Hsin Lin
Yan Zhang
Jianming Hu
OffRL
163
9
0
27 Feb 2023
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient
Hybrid RL: Using Both Offline and Online Data Can Make RL EfficientInternational Conference on Learning Representations (ICLR), 2022
Yuda Song
Yi Zhou
Ayush Sekhari
J. Andrew Bagnell
A. Krishnamurthy
Wen Sun
OffRLOnRL
279
131
0
13 Oct 2022
1