ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.05850
  4. Cited By
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
v1v2 (latest)

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

AAAI Conference on Artificial Intelligence (AAAI), 2022
12 June 2022
Qinbo Bai
Amrit Singh Bedi
Vaneet Aggarwal
ArXiv (abs)PDFHTMLGithub

Papers citing "Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm"

13 / 13 papers shown
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
Anirudh Satheesh
Sooraj Sathish
Swetha Ganesh
Keenan Powell
Vaneet Aggarwal
125
0
0
07 Nov 2025
Adaptive Shielding for Safe Reinforcement Learning under Hidden-Parameter Dynamics Shifts
Adaptive Shielding for Safe Reinforcement Learning under Hidden-Parameter Dynamics Shifts
Minjae Kwon
Tyler Ingebrand
Ufuk Topcu
Lu Feng
300
1
0
20 May 2025
Polynomial-Time Approximability of Constrained Reinforcement Learning
Polynomial-Time Approximability of Constrained Reinforcement Learning
Jeremy McMahan
927
1
0
11 Feb 2025
Last-Iterate Convergence of General Parameterized Policies in
  Constrained MDPs
Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs
Washim Uddin Mondal
Vaneet Aggarwal
374
1
0
21 Aug 2024
Last-Iterate Global Convergence of Policy Gradients for Constrained
  Reinforcement Learning
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro
Marco Mussi
Matteo Papini
Alberto Maria Metelli
BDL
221
3
0
15 Jul 2024
Deterministic Policies for Constrained Reinforcement Learning in
  Polynomial-Time
Deterministic Policies for Constrained Reinforcement Learning in Polynomial-TimeNeural Information Processing Systems (NeurIPS), 2024
Jeremy McMahan
283
3
0
23 May 2024
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Tingting Ni
Maryam Kamgarpour
404
5
0
01 Dec 2023
Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm
  with General Parameterization for Infinite Horizon Discounted Reward Markov
  Decision Processes
Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision ProcessesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Washim Uddin Mondal
Vaneet Aggarwal
334
23
0
18 Oct 2023
Enhancing Infrared Small Target Detection Robustness with Bi-Level
  Adversarial Framework
Enhancing Infrared Small Target Detection Robustness with Bi-Level Adversarial Framework
Zhu Liu
Zihang Chen
Jinyuan Liu
Long Ma
Xin-Yue Fan
Risheng Liu
AAML
462
1
0
03 Sep 2023
Mean-Field Approximation of Cooperative Constrained Multi-Agent
  Reinforcement Learning (CMARL)
Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
227
8
0
15 Sep 2022
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding
Jianchao Tan
Jiali Duan
Tamer Bacsar
Mihailo R. Jovanović
406
24
0
06 Jun 2022
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Primal-Dual Approach
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
517
68
0
13 Sep 2021
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
471
17
0
12 Sep 2021
1
Page 1 of 1