Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.05357
Cited By
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
10 June 2022
Ruida Zhou
Tao-Wen Liu
D. Kalathil
P. R. Kumar
Chao Tian
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning"
10 / 10 papers shown
Title
Session-Level Dynamic Ad Load Optimization using Offline Robust Reinforcement Learning
Tao Liu
Qi Xu
Wei Shi
Zhigang Hua
Shuang Yang
OffRL
38
0
0
09 Jan 2025
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
35
0
0
03 Oct 2024
Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization
Youbang Sun
Tao-Wen Liu
P. R. Kumar
Shahin Shahrampour
37
0
0
04 May 2024
Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games
Youbang Sun
Tao-Wen Liu
Ruida Zhou
P. R. Kumar
Shahin Shahrampour
28
11
0
15 Oct 2023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu
Chenghao Deng
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Jieyu Zhao
Furong Huang
27
4
0
07 Sep 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
37
19
0
17 Jul 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
K. Zhang
Alejandro Ribeiro
38
19
0
20 Jun 2023
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Archana Bura
Aria HasanzadeZonuzy
D. Kalathil
S. Shakkottai
J. Chamberland
17
28
0
01 Dec 2021
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
99
56
0
13 Sep 2021
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
89
136
0
30 Jan 2021
1