Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.05439
Cited By
Concave Utility Reinforcement Learning with Zero-Constraint Violations
12 September 2021
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Concave Utility Reinforcement Learning with Zero-Constraint Violations"
9 / 9 papers shown
Title
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
M. Çelikok
F. Oliehoek
Jan Willem van de Meent
35
0
0
29 May 2024
Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes
Bhargav Ganguly
Yang Xu
Vaneet Aggarwal
16
0
0
18 Oct 2023
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai
Washim Uddin Mondal
Vaneet Aggarwal
24
9
0
05 Sep 2023
Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward
Washim Uddin Mondal
Vaneet Aggarwal
32
2
0
04 May 2023
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
Fan Chen
Junyu Zhang
Zaiwen Wen
OffRL
31
8
0
13 Jul 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
38
25
0
31 Jan 2022
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
99
56
0
13 Sep 2021
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Junyu Zhang
Chengzhuo Ni
Zheng Yu
Csaba Szepesvári
Mengdi Wang
44
67
0
17 Feb 2021
Scheduling and Power Control for Wireless Multicast Systems via Deep Reinforcement Learning
R. Raghu
M. Panju
Vaneet Aggarwal
V. Sharma
23
4
0
27 Sep 2020
1