Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.10579
Cited By
First-order Policy Optimization for Robust Markov Decision Process
21 September 2022
Yan Li
Guanghui Lan
Tuo Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"First-order Policy Optimization for Robust Markov Decision Process"
5 / 5 papers shown
Title
Policy Mirror Descent Inherently Explores Action Space
Yan Li
Guanghui Lan
OffRL
35
6
0
08 Mar 2023
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
29
58
0
15 May 2022
Twice regularized MDPs and the equivalence between robustness and regularization
E. Derman
M. Geist
Shie Mannor
19
42
0
12 Oct 2021
Online Robust Reinforcement Learning with Model Uncertainty
Yue Wang
Shaofeng Zou
OOD
OffRL
16
76
0
29 Sep 2021
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
40
120
0
30 Jan 2021
1