v1v2 (latest)

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

International Conference on Machine Learning (ICML), 2022

18 January 2022

Papers citing "Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime"

15 / 15 papers shown

Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control

Chengxiu Hua

Jiawen Gu

Yushun Tang

299

20 Oct 2025

Phase Diagram of Dropout for Two-Layer Neural Networks in the Mean-Field Regime

Lénaic Chizat

Pierre Marion

Yerkin Yesbay

146

08 Oct 2025

RPRO: Ranked Preference Reinforcement Optimization for Enhancing Medical QA and Diagnostic Reasoning

304

31 Aug 2025

Efficient Computation of Blackwell Optimal Policies using Rational Functions

Dibyangshu Mukherjee

Shivaram Kalyanakrishnan

OffRL

111

25 Aug 2025

Mean-Field Generalisation Bounds for Learning Controls in Stochastic Environments

203

21 Aug 2025

Non-convex entropic mean-field optimization via Best Response flow

Razvan-Andrei Lascu

Mateusz B. Majka

351

28 May 2025

Meta-reinforcement learning with minimum attention

Pilhwa Lee

Shashank Gupta

OffRL

372

22 May 2025

Linear convergence of proximal descent schemes on the Wasserstein space

442

22 Nov 2024

A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

457

04 Oct 2023

Policy Optimization for Continuous Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

460

30 May 2023

Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality

François Ged

M. H. Veiga

363

22 Mar 2023

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-CriticInternational Conference on Machine Learning (ICML), 2023

328

28 Jan 2023

Geometry and convergence of natural policy gradient methodsInformation Geometry (IG), 2022

Johannes Muller

Guido Montúfar

379

03 Nov 2022

Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problemsSIAM Journal of Control and Optimization (SICON), 2022

Michael Giegrich

Christoph Reisinger

Yufei Zhang

360

01 Nov 2022

Linear convergence of a policy gradient method for some finite horizon continuous time control problemsSIAM Journal of Control and Optimization (SICON), 2022

C. Reisinger

Wolfgang Stockinger

Yufei Zhang

476

22 Mar 2022