ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.09101
  4. Cited By
Safe Policies for Reinforcement Learning via Primal-Dual Methods

Safe Policies for Reinforcement Learning via Primal-Dual Methods

20 November 2019
Santiago Paternain
Miguel Calvo-Fullana
Luiz F. O. Chamon
Alejandro Ribeiro
ArXivPDFHTML

Papers citing "Safe Policies for Reinforcement Learning via Primal-Dual Methods"

21 / 21 papers shown
Title
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Zhikun Tao
Gang Xiong
He Fang
Zhen Shen
Yunjun Han
Qing-Shan Jia
OffRL
34
0
0
13 May 2025
When to Localize? A Risk-Constrained Reinforcement Learning Approach
When to Localize? A Risk-Constrained Reinforcement Learning Approach
Chak Lam Shek
Kasra Torshizi
Troi Williams
Pratap Tokekar
41
2
0
05 Nov 2024
Second-Order Min-Max Optimization with Lazy Hessians
Second-Order Min-Max Optimization with Lazy Hessians
Lesi Chen
Chengchang Liu
Jingzhao Zhang
46
1
0
12 Oct 2024
Edge Intelligence in Satellite-Terrestrial Networks with Hybrid Quantum
  Computing
Edge Intelligence in Satellite-Terrestrial Networks with Hybrid Quantum Computing
Siyue Huang
Lifeng Wang
Xin Wang
Bo Tan
Wei Ni
Kai-Kit Wong
35
1
0
30 Sep 2024
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
74
1
0
29 Aug 2024
$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function
  Design for Safe Reinforcement Learning via Large Language Model
E2CFD\mathrm{E^{2}CFD}E2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
45
0
0
08 Jul 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
49
1
0
06 Jun 2024
One-Shot Safety Alignment for Large Language Models via Optimal
  Dualization
One-Shot Safety Alignment for Large Language Models via Optimal Dualization
Xinmeng Huang
Shuo Li
Yan Sun
Osbert Bastani
Hamed Hassani
Dongsheng Ding
49
4
0
29 May 2024
Constrained Proximal Policy Optimization
Constrained Proximal Policy Optimization
Chengbin Xuan
Feng Zhang
Faliang Yin
H. Lam
26
0
0
23 May 2023
Policy Gradients for Probabilistic Constrained Reinforcement Learning
Policy Gradients for Probabilistic Constrained Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
29
6
0
02 Oct 2022
Mean-Field Approximation of Cooperative Constrained Multi-Agent
  Reinforcement Learning (CMARL)
Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
37
4
0
15 Sep 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
32
2
0
28 Jun 2022
Provably Efficient Model-Free Constrained RL with Linear Function
  Approximation
Provably Efficient Model-Free Constrained RL with Linear Function Approximation
A. Ghosh
Xingyu Zhou
Ness B. Shroff
75
23
0
23 Jun 2022
A Review of Safe Reinforcement Learning: Methods, Theory and
  Applications
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
117
241
0
20 May 2022
Model-Based Safe Reinforcement Learning with Time-Varying State and
  Control Constraints: An Application to Intelligent Vehicles
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles
Xinglong Zhang
Yaoqian Peng
Biao Luo
Wei Pan
Xin Xu
Haibin Xie
27
11
0
18 Dec 2021
Safe Policy Optimization with Local Generalized Linear Function
  Approximations
Safe Policy Optimization with Local Generalized Linear Function Approximations
Akifumi Wachi
Yunyue Wei
Yanan Sui
OffRL
35
10
0
09 Nov 2021
Learning Policies with Zero or Bounded Constraint Violation for
  Constrained MDPs
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
42
78
0
04 Jun 2021
CRPO: A New Approach for Safe Reinforcement Learning with Convergence
  Guarantee
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu
Yingbin Liang
Guanghui Lan
52
122
0
11 Nov 2020
Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints
Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints
Qinbo Bai
Vaneet Aggarwal
Ather Gattami
22
7
0
11 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
25
159
0
01 Mar 2020
Safe Exploration in Markov Decision Processes
Safe Exploration in Markov Decision Processes
T. Moldovan
Pieter Abbeel
78
308
0
22 May 2012
1