ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.03152
  4. Cited By
Projection-Based Constrained Policy Optimization

Projection-Based Constrained Policy Optimization

7 October 2020
Tsung-Yen Yang
Justinian P. Rosca
Karthik Narasimhan
Peter J. Ramadge
ArXiv (abs)PDFHTML

Papers citing "Projection-Based Constrained Policy Optimization"

50 / 163 papers shown
Title
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
Yarden As
Chengrui Qu
Benjamin Unger
Dongho Kang
Max van der Hart
Laixi Shi
Stelian Coros
Adam Wierman
Andreas Krause
OffRL
16
0
0
23 Sep 2025
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
S. Hazra
P. Dasgupta
Soumyajit Dey
40
0
0
11 Sep 2025
Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks
Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks
Andrea Fox
Francesco De Pellegrini
Eitan Altman
OffRL
28
0
0
01 Sep 2025
HAEPO: History-Aggregated Exploratory Policy Optimization
HAEPO: History-Aggregated Exploratory Policy Optimization
Gaurish Trivedi
Alakh Sharma
Kartikey Singh Bhandari
Dhruv Kumar
Pratik Narang
Jagat Sesh Challa
20
0
0
26 Aug 2025
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Shaocong Ma
Ziyi Chen
Yi Zhou
Heng Huang
OffRL
32
0
0
24 Aug 2025
Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference
Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference
Denis Blessing
Julius Berner
Lorenz Richter
Carles Domingo-Enrich
Yuanqi Du
Arash Vahdat
Gerhard Neumann
40
2
0
17 Aug 2025
Proactive Constrained Policy Optimization with Preemptive Penalty
Proactive Constrained Policy Optimization with Preemptive Penalty
Ning Yang
Pengyu Wang
Guoqing Liu
Haifeng Zhang
Pin Lyu
Jun Wang
84
0
0
03 Aug 2025
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
Zijian Guo
İlker Işık
Hijaz Ahmad
Wenchao Li
OffRLAI4CE
131
0
0
03 Aug 2025
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
David Bossens
Atsushi Nitanda
66
0
0
29 Jun 2025
Situational-Constrained Sequential Resources Allocation via Reinforcement Learning
Situational-Constrained Sequential Resources Allocation via Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Libo Zhang
Yang Chen
Toru Takisaka
Kaiqi Zhao
Weidong Li
Jiamou Liu
92
0
0
17 Jun 2025
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Sourav Ganguly
Arnob Ghosh
Kishan Panaganti
Adam Wierman
86
0
0
25 May 2025
A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety
A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety
Ankita Kushwaha
Kiran Ravish
Preeti Lamba
Pawan Kumar
76
0
0
22 May 2025
Runtime Safety through Adaptive Shielding: From Hidden Parameter Inference to Provable Guarantees
Runtime Safety through Adaptive Shielding: From Hidden Parameter Inference to Provable Guarantees
Minjae Kwon
Tyler Ingebrand
Ufuk Topcu
Lu Feng
68
1
0
20 May 2025
Context-aware Constrained Reinforcement Learning Based Energy-Efficient Power Scheduling for Non-stationary XR Data Traffic
Kexuan Wang
An Liu
111
0
0
13 Mar 2025
Safe Explicable Policy Search
Safe Explicable Policy Search
Akkamahadevi Hanni
Jonathan Montaño
Yu Zhang
141
0
0
10 Mar 2025
Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces
Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces
Amirhossein Roknilamouki
A. Ghosh
Ming Shi
Fatemeh Nourzad
Eylem Ekici
Ness B. Shroff
121
0
0
25 Feb 2025
Don't Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Don't Trade Off Safety: Diffusion Regularization for Constrained Offline RL
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
293
0
0
18 Feb 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
350
17
0
29 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRLOnRL
225
0
0
31 Dec 2024
Safe Reinforcement Learning using Finite-Horizon Gradient-based
  Estimation
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
Juntao Dai
Yaodong Yang
Qian Zheng
Gang Pan
OffRL
174
3
0
15 Dec 2024
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
Pusen Dong
Tianchen Zhu
Yue Qiu
Haoyi Zhou
Jianxin Li
238
1
0
12 Dec 2024
Embedding Safety into RL: A New Take on Trust Region Methods
Embedding Safety into RL: A New Take on Trust Region Methods
Nikola Milosevic
Johannes Müller
Nico Scherf
203
4
0
05 Nov 2024
Adversarial Constrained Policy Optimization: Improving Constrained
  Reinforcement Learning by Adapting Budgets
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
Jianmina Ma
Jingtian Ji
Yue Gao
67
0
0
28 Oct 2024
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Xiyue Peng
Hengquan Guo
Jiawei Zhang
Dongqing Zou
Ziyu Shao
Honghao Wei
Xin Liu
216
4
0
25 Oct 2024
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Zixuan Yang
Jiaqi Zheng
Guihai Chen
OffRL
156
0
0
19 Oct 2024
Flipping-based Policy for Chance-Constrained Markov Decision Processes
Flipping-based Policy for Chance-Constrained Markov Decision Processes
Xun Shen
Shuo Jiang
Akifumi Wachi
Kaumune Hashimoto
Sebastien Gros
43
1
0
09 Oct 2024
Absolute State-wise Constrained Policy Optimization: High-Probability
  State-wise Constraints Satisfaction
Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction
Weiye Zhao
Feihan Li
Yifan Sun
Yujie Wang
Rui Chen
Tianhao Wei
Changliu Liu
93
2
0
02 Oct 2024
Bridging the gap between Learning-to-plan, Motion Primitives and Safe
  Reinforcement Learning
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement Learning
Piotr Kicki
Davide Tateo
Puze Liu
Jonas Guenster
Jan Peters
Krzysztof Walas
140
4
0
26 Aug 2024
MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled
  Cyber-Physical Systems
MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems
Renzhi Wang
Zhehua Zhou
Yuheng Huang
Xuan Xie
Xiaofei Xie
Lei Ma
92
1
0
07 Aug 2024
Exterior Penalty Policy Optimization with Penalty Metric Network under
  Constraints
Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Shiqing Gao
Jiaxin Ding
Luoyi Fu
Xinbing Wang
Cheng Zhou
72
2
0
22 Jul 2024
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement
  Learning
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
Minjae Cho
Chuangchuang Sun
OffRL
169
0
0
17 Jul 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia Herbert
245
15
0
12 Jul 2024
$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function
  Design for Safe Reinforcement Learning via Large Language Model
E2CFD\mathrm{E^{2}CFD}E2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
139
0
0
08 Jul 2024
Constrained Meta Agnostic Reinforcement Learning
Constrained Meta Agnostic Reinforcement Learning
Karam Daaboul
Florian Kuhm
Tim Joseph
J. Marius Zoellner
136
0
0
20 Jun 2024
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Zahra Shahrooei
Ali Baheri
127
3
0
17 Jun 2024
e-COP : Episodic Constrained Optimization of Policies
e-COP : Episodic Constrained Optimization of Policies
Akhil Agnihotri
Rahul Jain
Deepak Ramachandran
Sahil Singla
OffRL
97
1
0
13 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Yuheng Huang
Zhan Shu
Lei Ma
196
1
0
06 Jun 2024
Enhancing Efficiency of Safe Reinforcement Learning via Sample
  Manipulation
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Shangding Gu
Laixi Shi
Yuhao Ding
Alois Knoll
C. Spanos
Adam Wierman
Ming Jin
OffRL
114
4
0
31 May 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
114
3
0
29 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective
  Reinforcement Learning
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
107
7
0
26 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
169
3
0
20 May 2024
SOMTP: Self-Supervised Learning-Based Optimizer for MPC-Based Safe
  Trajectory Planning Problems in Robotics
SOMTP: Self-Supervised Learning-Based Optimizer for MPC-Based Safe Trajectory Planning Problems in Robotics
Yifan Liu
You Wang
Guang Chen
114
2
0
15 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
146
9
0
02 May 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
167
16
0
02 May 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
136
19
0
22 Apr 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic
  Queuing Network Optimization: Technical Report
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
92
2
0
05 Apr 2024
Long and Short-Term Constraints Driven Safe Reinforcement Learning for
  Autonomous Driving
Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving
Xuemin Hu
Pan Chen
Yijun Wen
Bo Tang
Long Chen
96
4
0
27 Mar 2024
Enhancing LLM Safety via Constrained Direct Preference Optimization
Enhancing LLM Safety via Constrained Direct Preference Optimization
Zixuan Liu
Xiaolin Sun
Zizhan Zheng
152
36
0
04 Mar 2024
Concurrent Learning of Policy and Unknown Safety Constraints in
  Reinforcement Learning
Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning
Lunet Yifru
Ali Baheri
OffRL
128
1
0
24 Feb 2024
A Survey of Constraint Formulations in Safe Reinforcement Learning
A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi
Xun Shen
Yanan Sui
158
23
0
03 Feb 2024
1234
Next