Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.14133
Cited By
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods
25 June 2023
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Provably Convergent Policy Optimization via Metric-aware Trust Region Methods"
2 / 2 papers shown
Title
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
87
135
0
30 Jan 2021
On Linear Convergence of Policy Gradient Methods for Finite MDPs
Jalaj Bhandari
Daniel Russo
48
59
0
21 Jul 2020
1