ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.06332
  4. Cited By
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Primal-Dual Approach
v1v2v3 (latest)

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

13 September 2021
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
ArXiv (abs)PDFHTMLGithub

Papers citing "Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach"

41 / 41 papers shown
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
Anirudh Satheesh
Sooraj Sathish
Swetha Ganesh
Keenan Powell
Vaneet Aggarwal
117
0
0
07 Nov 2025
AL-CoLe: Augmented Lagrangian for Constrained Learning
AL-CoLe: Augmented Lagrangian for Constrained Learning
Ignacio Boero
Ignacio Hounie
Alejandro Ribeiro
154
0
0
23 Oct 2025
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs
Yukuan Wei
Xudong Li
Lin F. Yang
194
0
0
20 Sep 2025
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Shaocong Ma
Ziyi Chen
Yi Zhou
Heng Huang
OffRL
316
7
0
24 Aug 2025
Constrained Sliced Wasserstein Embedding
Constrained Sliced Wasserstein Embedding
Navid Naderializadeh
Darian Salehi
Hengrong Du
Soheil Kolouri
288
6
0
02 Jun 2025
An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints
Jiahui Zhu
Kihyun Yu
Dabeen Lee
Xin Liu
Honghao Wei
264
1
0
28 May 2025
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints
Max Buckley
Konstantinos Papathanasiou
Andreas Spanopoulos
369
0
0
09 Mar 2025
Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces
Provably Efficient RL for Linear MDPs under Instantaneous Safety Constraints in Non-Convex Feature Spaces
Amirhossein Roknilamouki
A. Ghosh
Ming Shi
Fatemeh Nourzad
Eylem Ekici
Ness B. Shroff
383
5
0
25 Feb 2025
Last-Iterate Convergence of General Parameterized Policies in
  Constrained MDPs
Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Washim Uddin Mondal
Vaneet Aggarwal
373
1
0
21 Aug 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained
  Reinforcement Learning
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
220
3
0
15 Jul 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
349
4
0
29 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
309
24
0
26 May 2024
Natural Policy Gradient and Actor Critic Methods for Constrained
  Multi-Task Reinforcement Learning
Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
Sihan Zeng
Thinh T. Doan
Justin Romberg
238
0
0
03 May 2024
Global Convergence Guarantees for Federated Policy Gradient Methods with
  Adversaries
Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries
Swetha Ganesh
Jiayu Chen
Gugan Thoppe
Vaneet Aggarwal
FedML
382
5
0
15 Mar 2024
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical
  Systems
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Wesley A Suttle
Vipul K Sharma
K. Kosaraju
S. Sivaranjani
Ji Liu
Vijay Gupta
Brian M Sadler
231
3
0
06 Mar 2024
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective
  Reinforcement Learning
Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning
Dohyeong Kim
Mineui Hong
Jeongho Park
Songhwai Oh
334
3
0
01 Mar 2024
Truly No-Regret Learning in Constrained MDPs
Truly No-Regret Learning in Constrained MDPs
Adrian Müller
Pragnya Alatur
Volkan Cevher
Giorgia Ramponi
Niao He
446
17
0
24 Feb 2024
A Survey of Constraint Formulations in Safe Reinforcement Learning
A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi
Xun Shen
Yanan Sui
362
46
0
03 Feb 2024
Safe Reinforcement Learning with Instantaneous Constraints: The Role of
  Aggressive Exploration
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Honghao Wei
Xin Liu
Lei Ying
223
7
0
22 Dec 2023
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Tingting Ni
Maryam Kamgarpour
403
5
0
01 Dec 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
  Constrained MDPs
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2023
Dongsheng Ding
Chen-Yu Wei
Jianchao Tan
Alejandro Ribeiro
417
31
0
20 Jun 2023
A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement
  Learning
A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Kihyuk Hong
Yuhang Li
Ambuj Tewari
OffRL
393
11
0
13 Jun 2023
Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained
  Markov Decision Processes
Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes
A. Müller
Pragnya Alatur
Giorgia Ramponi
Niao He
353
8
0
12 Jun 2023
Provably Efficient Generalized Lagrangian Policy Optimization for Safe
  Multi-Agent Reinforcement Learning
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement LearningConference on Learning for Dynamics & Control (L4DC), 2023
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
Mihailo R. Jovanović
OffRL
397
15
0
31 May 2023
Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with
  General Utilities
Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General UtilitiesNeural Information Processing Systems (NeurIPS), 2023
Donghao Ying
Yunkai Zhang
Yuhao Ding
Alec Koppel
Javad Lavaei
425
22
0
27 May 2023
Long-Term Fairness with Unknown Dynamics
Long-Term Fairness with Unknown DynamicsNeural Information Processing Systems (NeurIPS), 2023
Tongxin Yin
Reilly P. Raab
M. Liu
Yang Liu
FaML
318
29
0
19 Apr 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under
  Instantaneous Hard Constraints
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard ConstraintsInternational Conference on Machine Learning (ICML), 2023
Ming Shi
Yitao Liang
Ness B. Shroff
228
18
0
08 Feb 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint
  Violation
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
243
6
0
27 Jan 2023
Trust Region-Based Safe Distributional Reinforcement Learning for
  Multiple Constraints
Trust Region-Based Safe Distributional Reinforcement Learning for Multiple ConstraintsNeural Information Processing Systems (NeurIPS), 2023
Dohyeong Kim
Kyungjae Lee
Songhwai Oh
291
21
0
26 Jan 2023
Constrained Reinforcement Learning via Dissipative Saddle Flow Dynamics
Constrained Reinforcement Learning via Dissipative Saddle Flow DynamicsAsilomar Conference on Signals, Systems and Computers (ACSSC), 2022
Tianqi Zheng
Pengcheng You
Enrique Mallada
219
4
0
03 Dec 2022
Learning Globally Smooth Functions on Manifolds
Learning Globally Smooth Functions on ManifoldsInternational Conference on Machine Learning (ICML), 2022
J. Cerviño
Luiz F. O. Chamon
B. Haeffele
René Vidal
Alejandro Ribeiro
589
6
0
01 Oct 2022
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement
  Learning in Unknown Stochastic Environments
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic EnvironmentsInternational Conference on Machine Learning (ICML), 2022
Yixuan Wang
S. Zhan
Ruochen Jiao
Zhilu Wang
Wanxin Jin
Zhuoran Yang
Zhaoran Wang
Chao Huang
Qi Zhu
420
78
0
29 Sep 2022
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDPNeural Information Processing Systems (NeurIPS), 2022
Fan Chen
Junyu Zhang
Zaiwen Wen
OffRL
264
13
0
13 Jul 2022
Provably Efficient Model-Free Constrained RL with Linear Function
  Approximation
Provably Efficient Model-Free Constrained RL with Linear Function ApproximationNeural Information Processing Systems (NeurIPS), 2022
A. Ghosh
Xingyu Zhou
Ness B. Shroff
448
36
0
23 Jun 2022
Near-Optimal Sample Complexity Bounds for Constrained MDPs
Near-Optimal Sample Complexity Bounds for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2022
Sharan Vaswani
Lin F. Yang
Csaba Szepesvári
319
45
0
13 Jun 2022
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective
  Reinforcement Learning
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Ruida Zhou
Tao-Wen Liu
D. Kalathil
P. R. Kumar
Chao Tian
262
20
0
10 Jun 2022
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding
Jianchao Tan
Jiali Duan
Tamer Bacsar
Mihailo R. Jovanović
406
24
0
06 Jun 2022
A Review of Safe Reinforcement Learning: Methods, Theory and
  Applications
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRLAI4TS
677
318
0
20 May 2022
Challenging Common Assumptions in Convex Reinforcement Learning
Challenging Common Assumptions in Convex Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Mirco Mutti
Ric De Santi
Piersilvio De Bartolomeis
Marcello Restelli
OffRL
451
27
0
03 Feb 2022
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
471
17
0
12 Sep 2021
Scheduling and Power Control for Wireless Multicast Systems via Deep
  Reinforcement Learning
Scheduling and Power Control for Wireless Multicast Systems via Deep Reinforcement Learning
R. Raghu
M. Panju
Vaneet Aggarwal
V. Sharma
189
6
0
27 Sep 2020
1
Page 1 of 1