ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.07089
  4. Cited By
Constrained Update Projection Approach to Safe Policy Optimization
v1v2 (latest)

Constrained Update Projection Approach to Safe Policy Optimization

Neural Information Processing Systems (NeurIPS), 2022
15 September 2022
Long Yang
Jiaming Ji
Juntao Dai
Linrui Zhang
Binbin Zhou
Pengfei Li
Yaodong Yang
Gang Pan
ArXiv (abs)PDFHTML

Papers citing "Constrained Update Projection Approach to Safe Policy Optimization"

32 / 32 papers shown
Title
KFCPO: Kronecker-Factored Approximated Constrained Policy Optimization
KFCPO: Kronecker-Factored Approximated Constrained Policy Optimization
Joonyoung Lim
Younghwan Yoo
60
0
0
02 Nov 2025
Safe In-Context Reinforcement Learning
Safe In-Context Reinforcement Learning
Amir Moeini
Minjae Kwon
Alper Kamil Bozkurt
Yuichi Motai
Rohan Chandra
Lu Feng
Shangtong Zhang
OffRL
108
1
0
29 Sep 2025
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
Yarden As
Chengrui Qu
Benjamin Unger
Dongho Kang
Max van der Hart
Laixi Shi
Stelian Coros
Adam Wierman
Andreas Krause
OffRL
268
0
0
23 Sep 2025
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
S. Hazra
P. Dasgupta
Soumyajit Dey
86
0
0
11 Sep 2025
Proactive Constrained Policy Optimization with Preemptive Penalty
Proactive Constrained Policy Optimization with Preemptive Penalty
Ning Yang
Pengyu Wang
Guoqing Liu
Haifeng Zhang
Pin Lyu
Jun Wang
120
0
0
03 Aug 2025
A universal policy wrapper with guarantees
A universal policy wrapper with guarantees
Anton Bolychev
Georgiy Malaniya
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
OffRL
137
0
0
18 May 2025
Safe Reinforcement Learning using Finite-Horizon Gradient-based
  Estimation
Safe Reinforcement Learning using Finite-Horizon Gradient-based EstimationInternational Conference on Machine Learning (ICML), 2024
Juntao Dai
Yaodong Yang
Qian Zheng
Gang Pan
OffRL
236
3
0
15 Dec 2024
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Pusen Dong
Tianchen Zhu
Yue Qiu
Haoyi Zhou
Jianxin Li
330
1
0
12 Dec 2024
Embedding Safety into RL: A New Take on Trust Region Methods
Embedding Safety into RL: A New Take on Trust Region Methods
Nikola Milosevic
Johannes Müller
Nico Scherf
375
4
0
05 Nov 2024
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Zixuan Yang
Jiaqi Zheng
Guihai Chen
OffRL
287
0
0
19 Oct 2024
Flipping-based Policy for Chance-Constrained Markov Decision Processes
Flipping-based Policy for Chance-Constrained Markov Decision ProcessesNeural Information Processing Systems (NeurIPS), 2024
Xun Shen
Shuo Jiang
Akifumi Wachi
Kaumune Hashimoto
Sebastien Gros
63
1
0
09 Oct 2024
Constrained Reinforcement Learning for Safe Heat Pump Control
Constrained Reinforcement Learning for Safe Heat Pump Control
Baohe Zhang
Lilli Frison
Thomas Brox
Joschka Bödecker
AI4CE
135
1
0
29 Sep 2024
Autoregressive Policy Optimization for Constrained Allocation Tasks
Autoregressive Policy Optimization for Constrained Allocation TasksNeural Information Processing Systems (NeurIPS), 2024
David Winkel
Niklas Strauß
Maximilian Bernhard
Zongyue Li
Thomas Seidl
Matthias Schubert
138
0
0
27 Sep 2024
Exterior Penalty Policy Optimization with Penalty Metric Network under
  Constraints
Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Shiqing Gao
Jiaxin Ding
Luoyi Fu
Xinbing Wang
Cheng Zhou
108
2
0
22 Jul 2024
$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function
  Design for Safe Reinforcement Learning via Large Language Model
E2CFD\mathrm{E^{2}CFD}E2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
203
0
0
08 Jul 2024
Diffusion Models for Offline Multi-agent Reinforcement Learning with
  Safety Constraints
Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
Jianuo Huang
OffRL
179
0
0
30 Jun 2024
Safety through feedback in Constrained RL
Safety through feedback in Constrained RL
Shashank Reddy Chirra
Pradeep Varakantham
P. Paruchuri
OffRL
323
2
0
28 Jun 2024
Verification-Guided Shielding for Deep Reinforcement Learning
Verification-Guided Shielding for Deep Reinforcement Learning
Davide Corsi
Guy Amir
Andoni Rodríguez
César Sánchez
Guy Katz
Roy Fox
AAMLOffRL
241
10
0
10 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Yuheng Huang
Zhan Shu
Lei Ma
308
2
0
06 Jun 2024
Enhancing Efficiency of Safe Reinforcement Learning via Sample
  Manipulation
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Shangding Gu
Laixi Shi
Yuhao Ding
Alois Knoll
C. Spanos
Adam Wierman
Ming Jin
OffRL
230
5
0
31 May 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
223
19
0
02 May 2024
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework
  for Network Resource Allocation
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation
Tianfu Wang
Qilin Fan
Chao Wang
Long Yang
Leilei Ding
Nicholas Jing Yuan
Hui Xiong
205
7
0
19 Apr 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
Off-Policy Primal-Dual Safe Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Zifan Wu
Bo Tang
Qian Lin
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
250
7
0
26 Jan 2024
Imitate the Good and Avoid the Bad: An Incremental Approach to Safe
  Reinforcement Learning
Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Huy Hoang
Tien Mai
Pradeep Varakantham
220
8
0
16 Dec 2023
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Safety-Gymnasium: A Unified Safe Reinforcement Learning BenchmarkNeural Information Processing Systems (NeurIPS), 2023
Jiaming Ji
Borong Zhang
Jiayi Zhou
Xuehai Pan
Weidong Huang
Ruiyang Sun
Yiran Geng
Yifan Zhong
Juntao Dai
Yaodong Yang
OffRL
298
106
0
19 Oct 2023
Reduced Policy Optimization for Continuous Control with Hard Constraints
Reduced Policy Optimization for Continuous Control with Hard Constraints
Shutong Ding
Jingya Wang
Yali Du
Ye-ling Shi
154
7
0
14 Oct 2023
SafeDreamer: Safe Reinforcement Learning with World Models
SafeDreamer: Safe Reinforcement Learning with World ModelsInternational Conference on Learning Representations (ICLR), 2023
Weidong Huang
Jiaming Ji
Borong Zhang
Chunhe Xia
Yao-Chun Yang
OffRL
148
31
0
14 Jul 2023
A General Perspective on Objectives of Reinforcement Learning
A General Perspective on Objectives of Reinforcement Learning
Longyu Yang
OffRL
28
0
0
05 Jun 2023
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning
  Research
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
Jiaming Ji
Jiayi Zhou
Borong Zhang
Juntao Dai
Xuehai Pan
Ruiyang Sun
Weidong Huang
Yiran Geng
Mickel Liu
Yaodong Yang
OffRL
262
68
0
16 May 2023
Risk Sensitive Dead-end Identification in Safety-Critical Offline
  Reinforcement Learning
Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Taylor W. Killian
S. Parbhoo
Marzyeh Ghassemi
OffRL
170
8
0
13 Jan 2023
Evaluating Model-free Reinforcement Learning toward Safety-critical
  Tasks
Evaluating Model-free Reinforcement Learning toward Safety-critical TasksAAAI Conference on Artificial Intelligence (AAAI), 2022
Linrui Zhang
Qin Zhang
Li Shen
Bo Yuan
Xueqian Wang
Dacheng Tao
OffRL
202
36
0
12 Dec 2022
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding
Jianchao Tan
Jiali Duan
Tamer Bacsar
Mihailo R. Jovanović
282
23
0
06 Jun 2022
1