ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.10700
  4. Cited By
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion
  Model

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

19 January 2024
Yinan Zheng
Jianxiong Li
Dongjie Yu
Yujie Yang
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"

31 / 31 papers shown
Title
Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution
Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution
Shiyao Sang
51
0
0
30 Oct 2025
Towards Robust Zero-Shot Reinforcement Learning
Towards Robust Zero-Shot Reinforcement Learning
Kexin Zheng
Lauriane Teyssier
Yinan Zheng
Yu Luo
Xiayuan Zhan
OffRL
315
0
0
17 Oct 2025
Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling
Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling
Tianyi Tan
Yinan Zheng
Ruiming Liang
Zexu Wang
Kexin Zheng
Jinliang Zheng
Jianxiong Li
Xianyuan Zhan
Jingjing Liu
80
3
0
13 Oct 2025
Boundary-to-Region Supervision for Offline Safe Reinforcement Learning
Boundary-to-Region Supervision for Offline Safe Reinforcement Learning
HuiKang Su
Dengyun Peng
Zifeng Zhuang
Yuhan Liu
Qiguang Chen
Donglin Wang
Qinghe Liu
OffRL
112
0
0
30 Sep 2025
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving
Pengxiang Li
Yinan Zheng
Y. Wang
Huimin Wang
Hang Zhao
Jingjing Liu
Xianyuan Zhan
Kun Zhan
Xianpeng Lang
88
5
0
24 Sep 2025
Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model
Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model
Fan Ding
Xuewen Luo
Hwa Hui Tew
Ruturaj Reddy
Xikun Wang
Junn Yong Loo
DiffM
101
0
0
23 Aug 2025
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
Zijian Guo
İlker Işık
Hijaz Ahmad
Wenchao Li
OffRLAI4CE
319
2
0
03 Aug 2025
Semi-gradient DICE for Offline Constrained Reinforcement Learning
Woosung Kim
JunHo Seo
Jongmin Lee
Byung-Jun Lee
OffRL
104
0
0
10 Jun 2025
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
Hossein Goli
Michael Gimelfarb
Nathan Samuel de Lara
Haruki Nishimura
Masha Itkina
Florian Shkurti
OffRL
203
1
0
27 May 2025
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Zhikun Tao
Gang Xiong
He Fang
Zhen Shen
Yunjun Han
Qing-Shan Jia
OffRL
350
0
0
13 May 2025
Learning Conservative Neural Control Barrier Functions from Offline Data
Learning Conservative Neural Control Barrier Functions from Offline Data
Ihab Tabbara
Hussein Sibai
OffRL
279
0
0
01 May 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
315
1
0
17 Apr 2025
SACA: A Scenario-Aware Collision Avoidance Framework for Autonomous Vehicles Integrating LLMs-Driven Reasoning
SACA: A Scenario-Aware Collision Avoidance Framework for Autonomous Vehicles Integrating LLMs-Driven Reasoning
Shiyue Zhao
Junzhi Zhang
Neda Masoud
Heye Huang
Xingpeng Xia
Chengkun He
LRM
352
2
0
31 Mar 2025
Don't Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Don't Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
502
0
0
18 Feb 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Data Center Cooling System Optimization Using Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Xianyuan Zhan
Xiangyu Zhu
Peng Cheng
Xiao Hu
Ziteng He
...
Chenhui Liu
Tianshun Hong
Huiwen Zheng
Yunxin Liu
Feng Zhao
AI4CE
363
2
0
17 Feb 2025
Skill Expansion and Composition in Parameter Space
Skill Expansion and Composition in Parameter SpaceInternational Conference on Learning Representations (ICLR), 2025
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
292
10
0
09 Feb 2025
From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control
From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE Control
Tailin Wu
Xiaowei Qian
Wenhao Deng
Rui Wang
Haodong Feng
...
Tao Zhang
Long Wei
Yue Wang
Zhi-Ming Ma
Tailin Wu
AI4CE
450
1
0
04 Feb 2025
Diffusion-Based Planning for Autonomous Driving with Flexible GuidanceInternational Conference on Learning Representations (ICLR), 2025
Yinan Zheng
Ruiming Liang
Kexin Zheng
Jinliang Zheng
Liyuan Mao
...
Weihao Gu
Rui Ai
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
282
62
0
26 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRLOnRL
313
1
0
31 Dec 2024
Off-dynamics Conditional Diffusion Planners
Off-dynamics Conditional Diffusion PlannersIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Wen Zheng Terence Ng
Jianda Chen
Tianwei Zhang
DiffMOffRL
280
0
0
16 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with
  Stationary Distribution Shift Regularization
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift RegularizationInternational Conference on Learning Representations (ICLR), 2024
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
284
4
0
02 Oct 2024
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal LearningIEEE International Conference on Robotics and Automation (ICRA), 2024
Jianxiong Li
Zhihao Wang
Jinliang Zheng
Xiaoai Zhou
Guanming Wang
...
Yu Liu
Jingjing Liu
Ya-Qin Zhang
Junzhi Yu
Xianyuan Zhan
199
4
0
02 Oct 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Handling Long-Term Safety and Uncertainty in Safe Reinforcement LearningConference on Robot Learning (CoRL), 2024
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
217
3
0
18 Sep 2024
Bridging the gap between Learning-to-plan, Motion Primitives and Safe
  Reinforcement Learning
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement LearningConference on Robot Learning (CoRL), 2024
Piotr Kicki
Davide Tateo
Puze Liu
Jonas Guenster
Jan Peters
Krzysztof Walas
204
5
0
26 Aug 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement
  Learning
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Yi-Fan Yao
Zhepeng Cen
Wenhao Ding
Hao-ming Lin
Shiqi Liu
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRLOnRL
208
8
0
19 Jul 2024
FREA: Feasibility-Guided Generation of Safety-Critical Scenarios with
  Reasonable Adversariality
FREA: Feasibility-Guided Generation of Safety-Critical Scenarios with Reasonable Adversariality
Keyu Chen
Yuheng Lei
Hao Cheng
Haoran Wu
Wenchao Sun
Sifa Zheng
AAML
254
6
0
05 Jun 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in
  Tabular MDP
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
189
0
0
04 Jun 2024
Instruction-Guided Visual Masking
Instruction-Guided Visual Masking
Jinliang Zheng
Jianxiong Li
Si Cheng
Yinan Zheng
Jiaming Li
Jihao Liu
Yu Liu
Jingjing Liu
Xianyuan Zhan
236
16
0
30 May 2024
Safe Reinforcement Learning on the Constraint Manifold: Theory and
  Applications
Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Puze Liu
Haitham Bou-Ammar
Jan Peters
Davide Tateo
171
14
0
13 Apr 2024
Policy Bifurcation in Safe Reinforcement Learning
Policy Bifurcation in Safe Reinforcement Learning
Wenjun Zou
Yao Lyu
Jie Li
Yujie Yang
Shengbo Eben Li
Jingliang Duan
Xianyuan Zhan
Jingjing Liu
Yaqin Zhang
Keqiang Li
303
2
0
19 Mar 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via
  Orthogonal-gradient Update
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
304
20
0
01 Feb 2024
1