Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2401.10700
Cited By
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
19 January 2024
Yinan Zheng
Jianxiong Li
Dongjie Yu
Yujie Yang
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
31 / 31 papers shown
Title
Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution
Shiyao Sang
51
0
0
30 Oct 2025
Towards Robust Zero-Shot Reinforcement Learning
Kexin Zheng
Lauriane Teyssier
Yinan Zheng
Yu Luo
Xiayuan Zhan
OffRL
315
0
0
17 Oct 2025
Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling
Tianyi Tan
Yinan Zheng
Ruiming Liang
Zexu Wang
Kexin Zheng
Jinliang Zheng
Jianxiong Li
Xianyuan Zhan
Jingjing Liu
80
3
0
13 Oct 2025
Boundary-to-Region Supervision for Offline Safe Reinforcement Learning
HuiKang Su
Dengyun Peng
Zifeng Zhuang
Yuhan Liu
Qiguang Chen
Donglin Wang
Qinghe Liu
OffRL
112
0
0
30 Sep 2025
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving
Pengxiang Li
Yinan Zheng
Y. Wang
Huimin Wang
Hang Zhao
Jingjing Liu
Xianyuan Zhan
Kun Zhan
Xianpeng Lang
88
5
0
24 Sep 2025
Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model
Fan Ding
Xuewen Luo
Hwa Hui Tew
Ruturaj Reddy
Xikun Wang
Junn Yong Loo
DiffM
101
0
0
23 Aug 2025
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
Zijian Guo
İlker Işık
Hijaz Ahmad
Wenchao Li
OffRL
AI4CE
319
2
0
03 Aug 2025
Semi-gradient DICE for Offline Constrained Reinforcement Learning
Woosung Kim
JunHo Seo
Jongmin Lee
Byung-Jun Lee
OffRL
104
0
0
10 Jun 2025
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
Hossein Goli
Michael Gimelfarb
Nathan Samuel de Lara
Haruki Nishimura
Masha Itkina
Florian Shkurti
OffRL
203
1
0
27 May 2025
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Zhikun Tao
Gang Xiong
He Fang
Zhen Shen
Yunjun Han
Qing-Shan Jia
OffRL
350
0
0
13 May 2025
Learning Conservative Neural Control Barrier Functions from Offline Data
Ihab Tabbara
Hussein Sibai
OffRL
279
0
0
01 May 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
315
1
0
17 Apr 2025
SACA: A Scenario-Aware Collision Avoidance Framework for Autonomous Vehicles Integrating LLMs-Driven Reasoning
Shiyue Zhao
Junzhi Zhang
Neda Masoud
Heye Huang
Xingpeng Xia
Chengkun He
LRM
352
2
0
31 Mar 2025
Don't Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
502
0
0
18 Feb 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Xianyuan Zhan
Xiangyu Zhu
Peng Cheng
Xiao Hu
Ziteng He
...
Chenhui Liu
Tianshun Hong
Huiwen Zheng
Yunxin Liu
Feng Zhao
AI4CE
363
2
0
17 Feb 2025
Skill Expansion and Composition in Parameter Space
International Conference on Learning Representations (ICLR), 2025
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
292
10
0
09 Feb 2025
From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control
Tailin Wu
Xiaowei Qian
Wenhao Deng
Rui Wang
Haodong Feng
...
Tao Zhang
Long Wei
Yue Wang
Zhi-Ming Ma
Tailin Wu
AI4CE
450
1
0
04 Feb 2025
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance
International Conference on Learning Representations (ICLR), 2025
Yinan Zheng
Ruiming Liang
Kexin Zheng
Jinliang Zheng
Liyuan Mao
...
Weihao Gu
Rui Ai
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
282
62
0
26 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
313
1
0
31 Dec 2024
Off-dynamics Conditional Diffusion Planners
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Wen Zheng Terence Ng
Jianda Chen
Tianwei Zhang
DiffM
OffRL
280
0
0
16 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
International Conference on Learning Representations (ICLR), 2024
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
284
4
0
02 Oct 2024
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
IEEE International Conference on Robotics and Automation (ICRA), 2024
Jianxiong Li
Zhihao Wang
Jinliang Zheng
Xiaoai Zhou
Guanming Wang
...
Yu Liu
Jingjing Liu
Ya-Qin Zhang
Junzhi Yu
Xianyuan Zhan
199
4
0
02 Oct 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Conference on Robot Learning (CoRL), 2024
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
217
3
0
18 Sep 2024
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement Learning
Conference on Robot Learning (CoRL), 2024
Piotr Kicki
Davide Tateo
Puze Liu
Jonas Guenster
Jan Peters
Krzysztof Walas
204
5
0
26 Aug 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Yi-Fan Yao
Zhepeng Cen
Wenhao Ding
Hao-ming Lin
Shiqi Liu
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
OnRL
208
8
0
19 Jul 2024
FREA: Feasibility-Guided Generation of Safety-Critical Scenarios with Reasonable Adversariality
Keyu Chen
Yuheng Lei
Hao Cheng
Haoran Wu
Wenchao Sun
Sifa Zheng
AAML
254
6
0
05 Jun 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
189
0
0
04 Jun 2024
Instruction-Guided Visual Masking
Jinliang Zheng
Jianxiong Li
Si Cheng
Yinan Zheng
Jiaming Li
Jihao Liu
Yu Liu
Jingjing Liu
Xianyuan Zhan
236
16
0
30 May 2024
Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Puze Liu
Haitham Bou-Ammar
Jan Peters
Davide Tateo
171
14
0
13 Apr 2024
Policy Bifurcation in Safe Reinforcement Learning
Wenjun Zou
Yao Lyu
Jie Li
Yujie Yang
Shengbo Eben Li
Jingliang Duan
Xianyuan Zhan
Jingjing Liu
Yaqin Zhang
Keqiang Li
303
2
0
19 Mar 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
304
20
0
01 Feb 2024
1