Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.05869
Cited By
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
11 November 2020
Tengyu Xu
Yingbin Liang
Guanghui Lan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee"
50 / 85 papers shown
Title
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Zhifeng Hu
Chong Han
37
0
0
08 May 2025
Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Chenggang Wang
Xinyi Wang
Yutong Dong
Lei Song
Xinping Guan
24
0
0
01 May 2025
Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL
Songyuan Zhang
Oswin So
Mitchell Black
Zachary Serlin
Chuchu Fan
26
0
0
21 Apr 2025
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints
Max Buckley
Konstantinos Papathanasiou
Andreas Spanopoulos
50
0
0
09 Mar 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OOD
OffRL
36
1
0
27 Feb 2025
Safety Representations for Safer Policy Learning
Kaustubh Mani
Vincent Mai
Charlie Gauthier
Annie Chen
Samer Nashed
Liam Paull
38
0
0
27 Feb 2025
Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Junyu Guo
Zhi Zheng
Donghao Ying
Ming Jin
Shangding Gu
C. Spanos
Javad Lavaei
OffRL
45
0
0
18 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
94
2
0
28 Jan 2025
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics
S. Hazra
P. Dasgupta
Soumyajit Dey
29
0
0
21 Jan 2025
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
Jianmina Ma
Jingtian Ji
Yue Gao
18
0
0
28 Oct 2024
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
Siyuan Xu
Minghui Zhu
OffRL
28
1
0
13 Oct 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
16
1
0
12 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
35
0
0
03 Oct 2024
The Perfect Blend: Redefining RLHF with Mixture of Judges
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
...
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
31
9
0
30 Sep 2024
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
Qian Lin
Zongkai Liu
Danying Mo
Chao Yu
OffRL
21
0
0
16 Sep 2024
Vision-driven UAV River Following: Benchmarking with Safe Reinforcement Learning
Zihan Wang
N. Mahmoudian
18
2
0
13 Sep 2024
Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks
Zhifeng Hu
Chong Han
Wolfgang H. Gerstacker
I. F. Akyildiz
16
0
0
12 Sep 2024
Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Washim Uddin Mondal
Vaneet Aggarwal
33
1
0
21 Aug 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
33
1
0
15 Jul 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia L. Herbert
32
6
0
12 Jul 2024
Optimal Transport-Assisted Risk-Sensitive Q-Learning
Zahra Shahrooei
Ali Baheri
21
2
0
17 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
32
1
0
06 Jun 2024
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Shangding Gu
Laixi Shi
Yuhao Ding
Alois Knoll
C. Spanos
Adam Wierman
Ming Jin
OffRL
22
2
0
31 May 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
27
0
0
29 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
27
12
0
26 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
35
1
0
26 May 2024
Federated Reinforcement Learning with Constraint Heterogeneity
Hao Jin
Liangyu Zhang
Zhihua Zhang
22
0
0
06 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
33
4
0
02 May 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
39
9
0
02 May 2024
Myopically Verifiable Probabilistic Certificates for Safe Control and Learning
Zhuoyuan Wang
Haoming Jing
Christian Kurniawan
Albert Chern
Yorie Nakahira
27
1
0
23 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
29
5
0
22 Apr 2024
Primal Methods for Variational Inequality Problems with Functional Constraints
Liang Zhang
Niao He
Michael Muehlebach
32
2
0
19 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
19
0
0
01 Mar 2024
A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi
Xun Shen
Yanan Sui
26
10
0
03 Feb 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
Zifan Wu
Bo Tang
Qian Lin
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
11
3
0
26 Jan 2024
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Peide Huang
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
66
6
0
23 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
12
17
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
20
3
0
01 Dec 2023
State-Wise Safe Reinforcement Learning With Pixel Observations
S. Zhan
Yixuan Wang
Qingyuan Wu
Ruochen Jiao
Chao Huang
Qi Zhu
19
10
0
03 Nov 2023
Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization
Fan Yang
Wen-Min Zhou
Zuxin Liu
Ding Zhao
David Held
10
1
0
10 Oct 2023
Deep Reinforcement Learning Based Cross-Layer Design in Terahertz Mesh Backhaul Networks
Zhifeng Hu
Chong Han
Xudong Wang
6
3
0
08 Oct 2023
Evaluation of Constrained Reinforcement Learning Algorithms for Legged Locomotion
Joonho Lee
Lukas Schroth
Victor Klemm
Marko Bjelonic
Alexander Reske
Marco Hutter
11
14
0
27 Sep 2023
Iterative Reachability Estimation for Safe Reinforcement Learning
Milan Ganai
Zheng Gong
Chenning Yu
Sylvia L. Herbert
Sicun Gao
15
17
0
24 Sep 2023
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
8
3
0
15 Sep 2023
Task-Oriented Cross-System Design for Timely and Accurate Modeling in the Metaverse
Zhen Meng
Kan Chen
Yufeng Diao
Changyang She
G. Zhao
Muhammad Ali Imran
B. Vucetic
17
12
0
11 Sep 2023
Not Only Rewards But Also Constraints: Applications on Legged Robot Locomotion
Yunho Kim
H. Oh
J. Lee
Jinhyeok Choi
Gwanghyeon Ji
Moonkyu Jung
D. Youm
Jemin Hwangbo
19
42
0
24 Aug 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan-Chia Cheng
J. Yang
Yitao Liang
OOD
22
1
0
10 Aug 2023
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
13
4
0
13 Jul 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
K. Zhang
Alejandro Ribeiro
30
19
0
20 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
25
11
0
01 Jun 2023
1
2
Next