ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.05357
  4. Cited By
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective
  Reinforcement Learning

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

10 June 2022
Ruida Zhou
Tao-Wen Liu
D. Kalathil
P. R. Kumar
Chao Tian
ArXivPDFHTML

Papers citing "Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning"

10 / 10 papers shown
Title
Session-Level Dynamic Ad Load Optimization using Offline Robust Reinforcement Learning
Session-Level Dynamic Ad Load Optimization using Offline Robust Reinforcement Learning
Tao Liu
Qi Xu
Wei Shi
Zhigang Hua
Shuang Yang
OffRL
38
0
0
09 Jan 2025
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
35
0
0
03 Oct 2024
Linear Convergence of Independent Natural Policy Gradient in Games with
  Entropy Regularization
Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization
Youbang Sun
Tao-Wen Liu
P. R. Kumar
Shahin Shahrampour
37
0
0
04 May 2024
Provably Fast Convergence of Independent Natural Policy Gradient for
  Markov Potential Games
Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games
Youbang Sun
Tao-Wen Liu
Ruida Zhou
P. R. Kumar
Shahin Shahrampour
28
11
0
15 Oct 2023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation
  Strategies towards Equal Long-term Benefit Rate
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu
Chenghao Deng
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Jieyu Zhao
Furong Huang
27
4
0
07 Sep 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function
  Approximation
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
37
19
0
17 Jul 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
  Constrained MDPs
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
K. Zhang
Alejandro Ribeiro
38
19
0
20 Jun 2023
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe
  Reinforcement Learning
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Archana Bura
Aria HasanzadeZonuzy
D. Kalathil
S. Shakkottai
J. Chamberland
17
28
0
01 Dec 2021
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Primal-Dual Approach
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
99
56
0
13 Sep 2021
Policy Mirror Descent for Reinforcement Learning: Linear Convergence,
  New Sampling Complexity, and Generalized Problem Classes
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
89
136
0
30 Jan 2021
1