Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.02951
Cited By
A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces
4 October 2023
B. Kerimkulov
J. Leahy
David Siska
Lukasz Szpruch
Yufei Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces"
4 / 4 papers shown
Title
Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes
Johannes Muller
Semih Cayci
41
0
0
06 Jun 2024
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kaipeng Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
87
27
0
10 Oct 2022
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Amrit Singh Bedi
Souradip Chakraborty
Anjaly Parayil
Brian M Sadler
Pratap Tokekar
Alec Koppel
43
17
0
28 Jan 2022
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
89
136
0
30 Jan 2021
1