ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.05869
  4. Cited By
CRPO: A New Approach for Safe Reinforcement Learning with Convergence
  Guarantee

CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee

11 November 2020
Tengyu Xu
Yingbin Liang
Guanghui Lan
ArXivPDFHTML

Papers citing "CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee"

50 / 85 papers shown
Title
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Zhifeng Hu
Chong Han
37
0
0
08 May 2025
Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Chenggang Wang
Xinyi Wang
Yutong Dong
Lei Song
Xinping Guan
24
0
0
01 May 2025
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Songyuan Zhang
Oswin So
Mitchell Black
Zachary Serlin
Chuchu Fan
26
0
0
21 Apr 2025
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints
Max Buckley
Konstantinos Papathanasiou
Andreas Spanopoulos
50
0
0
09 Mar 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OOD
OffRL
36
1
0
27 Feb 2025
Safety Representations for Safer Policy Learning
Safety Representations for Safer Policy Learning
Kaustubh Mani
Vincent Mai
Charlie Gauthier
Annie Chen
Samer Nashed
Liam Paull
38
0
0
27 Feb 2025
Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
45
0
0
18 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
94
2
0
28 Jan 2025
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
S. Hazra
P. Dasgupta
Soumyajit Dey
29
0
0
21 Jan 2025
Adversarial Constrained Policy Optimization: Improving Constrained
  Reinforcement Learning by Adapting Budgets
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
Jianmina Ma
Jingtian Ji
Yue Gao
18
0
0
28 Oct 2024
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable
  Near-Optimality under All-task Optimum Comparator
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
Siyuan Xu
Minghui Zhu
OffRL
28
1
0
13 Oct 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
20
1
0
12 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
35
0
0
03 Oct 2024
The Perfect Blend: Redefining RLHF with Mixture of Judges
The Perfect Blend: Redefining RLHF with Mixture of Judges
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
...
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
31
9
0
30 Sep 2024
An Offline Adaptation Framework for Constrained Multi-Objective
  Reinforcement Learning
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
Qian Lin
Zongkai Liu
Danying Mo
Chao Yu
OffRL
21
0
0
16 Sep 2024
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement
  Learning
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement Learning
Zihan Wang
N. Mahmoudian
18
2
0
13 Sep 2024
Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource
  Allocation and Task Offloading in TeraHertz Band Space Networks
Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks
Zhifeng Hu
Chong Han
Wolfgang H. Gerstacker
I. F. Akyildiz
16
0
0
12 Sep 2024
Last-Iterate Convergence of General Parameterized Policies in
  Constrained MDPs
Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Washim Uddin Mondal
Vaneet Aggarwal
36
1
0
21 Aug 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained
  Reinforcement Learning
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
33
1
0
15 Jul 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia L. Herbert
32
6
0
12 Jul 2024
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Zahra Shahrooei
Ali Baheri
21
2
0
17 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
32
1
0
06 Jun 2024
Enhancing Efficiency of Safe Reinforcement Learning via Sample
  Manipulation
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Shangding Gu
Laixi Shi
Yuhao Ding
Alois Knoll
C. Spanos
Adam Wierman
Ming Jin
OffRL
24
2
0
31 May 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
27
0
0
29 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
27
12
0
26 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective
  Reinforcement Learning
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
35
1
0
26 May 2024
Federated Reinforcement Learning with Constraint Heterogeneity
Federated Reinforcement Learning with Constraint Heterogeneity
Hao Jin
Liangyu Zhang
Zhihua Zhang
22
0
0
06 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
33
4
0
02 May 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
39
9
0
02 May 2024
Myopically Verifiable Probabilistic Certificates for Safe Control and
  Learning
Myopically Verifiable Probabilistic Certificates for Safe Control and Learning
Zhuoyuan Wang
Haoming Jing
Christian Kurniawan
Albert Chern
Yorie Nakahira
27
1
0
23 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
29
5
0
22 Apr 2024
Primal Methods for Variational Inequality Problems with Functional Constraints
Primal Methods for Variational Inequality Problems with Functional Constraints
Liang Zhang
Niao He
Michael Muehlebach
32
2
0
19 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective
  Reinforcement Learning
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
19
0
0
01 Mar 2024
A Survey of Constraint Formulations in Safe Reinforcement Learning
A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi
Xun Shen
Yanan Sui
26
10
0
03 Feb 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
Off-Policy Primal-Dual Safe Reinforcement Learning
Zifan Wu
Bo Tang
Qian Lin
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
11
3
0
26 Jan 2024
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Peide Huang
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
66
6
0
23 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region
  Conditional Value at Risk
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
12
17
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
20
3
0
01 Dec 2023
State-Wise Safe Reinforcement Learning With Pixel Observations
State-Wise Safe Reinforcement Learning With Pixel Observations
S. Zhan
Yixuan Wang
Qingyuan Wu
Ruochen Jiao
Chao Huang
Qi Zhu
25
10
0
03 Nov 2023
Reinforcement Learning in a Safety-Embedded MDP with Trajectory
  Optimization
Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization
Fan Yang
Wen-Min Zhou
Zuxin Liu
Ding Zhao
David Held
12
1
0
10 Oct 2023
Deep Reinforcement Learning Based Cross-Layer Design in Terahertz Mesh
  Backhaul Networks
Deep Reinforcement Learning Based Cross-Layer Design in Terahertz Mesh Backhaul Networks
Zhifeng Hu
Chong Han
Xudong Wang
6
3
0
08 Oct 2023
Evaluation of Constrained Reinforcement Learning Algorithms for Legged
  Locomotion
Evaluation of Constrained Reinforcement Learning Algorithms for Legged Locomotion
Joonho Lee
Lukas Schroth
Victor Klemm
Marko Bjelonic
Alexander Reske
Marco Hutter
11
14
0
27 Sep 2023
Iterative Reachability Estimation for Safe Reinforcement Learning
Iterative Reachability Estimation for Safe Reinforcement Learning
Milan Ganai
Zheng Gong
Chenning Yu
Sylvia L. Herbert
Sicun Gao
15
17
0
24 Sep 2023
Price of Safety in Linear Best Arm Identification
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
8
3
0
15 Sep 2023
Task-Oriented Cross-System Design for Timely and Accurate Modeling in
  the Metaverse
Task-Oriented Cross-System Design for Timely and Accurate Modeling in the Metaverse
Zhen Meng
Kan Chen
Yufeng Diao
Changyang She
G. Zhao
Muhammad Ali Imran
B. Vucetic
19
12
0
11 Sep 2023
Not Only Rewards But Also Constraints: Applications on Legged Robot
  Locomotion
Not Only Rewards But Also Constraints: Applications on Legged Robot Locomotion
Yunho Kim
H. Oh
J. Lee
Jinhyeok Choi
Gwanghyeon Ji
Moonkyu Jung
D. Youm
Jemin Hwangbo
19
42
0
24 Aug 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan-Chia Cheng
J. Yang
Yitao Liang
OOD
33
1
0
10 Aug 2023
Probabilistic Constrained Reinforcement Learning with Formal
  Interpretability
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
16
4
0
13 Jul 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
  Constrained MDPs
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
K. Zhang
Alejandro Ribeiro
36
19
0
20 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
27
11
0
01 Jun 2023
12
Next