First-order Policy Optimization for Robust Markov Decision Process

21 September 2022

Papers citing "First-order Policy Optimization for Robust Markov Decision Process"

5 / 5 papers shown

Title
Policy Mirror Descent Inherently Explores Action Space Yan Li Guanghui Lan OffRL 35 6 0 08 Mar 2023
Policy Gradient Method For Robust Reinforcement Learning Yue Wang Shaofeng Zou 29 58 0 15 May 2022
Twice regularized MDPs and the equivalence between robustness and regularization E. Derman M. Geist Shie Mannor 19 42 0 12 Oct 2021
Online Robust Reinforcement Learning with Model Uncertainty Yue Wang Shaofeng Zou OOD OffRL 16 76 0 29 Sep 2021
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes Guanghui Lan 40 120 0 30 Jan 2021