ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.05439
  4. Cited By
Concave Utility Reinforcement Learning with Zero-Constraint Violations

Concave Utility Reinforcement Learning with Zero-Constraint Violations

12 September 2021
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
ArXivPDFHTML

Papers citing "Concave Utility Reinforcement Learning with Zero-Constraint Violations"

9 / 9 papers shown
Title
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
M. Çelikok
F. Oliehoek
Jan Willem van de Meent
35
0
0
29 May 2024
Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward
  Markov Decision Processes
Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes
Bhargav Ganguly
Yang Xu
Vaneet Aggarwal
16
0
0
18 Oct 2023
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon
  Average Reward Markov Decision Processes
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai
Washim Uddin Mondal
Vaneet Aggarwal
24
9
0
05 Sep 2023
Reinforcement Learning with Delayed, Composite, and Partially Anonymous
  Reward
Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward
Washim Uddin Mondal
Vaneet Aggarwal
32
2
0
04 May 2023
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
Fan Chen
Junyu Zhang
Zaiwen Wen
OffRL
31
8
0
13 Jul 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with
  Constraints
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
38
25
0
31 Jan 2022
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Primal-Dual Approach
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
99
56
0
13 Sep 2021
On the Convergence and Sample Efficiency of Variance-Reduced Policy
  Gradient Method
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Junyu Zhang
Chengzhuo Ni
Zheng Yu
Csaba Szepesvári
Mengdi Wang
44
67
0
17 Feb 2021
Scheduling and Power Control for Wireless Multicast Systems via Deep
  Reinforcement Learning
Scheduling and Power Control for Wireless Multicast Systems via Deep Reinforcement Learning
R. Raghu
M. Panju
Vaneet Aggarwal
V. Sharma
23
4
0
27 Sep 2020
1