Mirror Learning: A Unifying Framework of Policy Optimisation

7 January 2022

Papers citing "Mirror Learning: A Unifying Framework of Policy Optimisation"

17 / 17 papers shown

Title
Mirror Descent Actor Critic via Bounded Advantage Learning Ryo Iwaki 93 0 0 06 Feb 2025
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification Rudolf Reiter Jasper Hoffmann D. Reinhardt Florian Messerer Katrin Baumgärtner Shamburaj Sawant Joschka Boedecker Moritz Diehl S. Gros 79 5 0 04 Feb 2025
Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards A. Ahmad Mehdi Kermanshah Kevin J. Leahy Zachary Serlin H. Siu Makai Mann C. Vasile Roberto Tron C. Belta OffRL 66 0 0 26 Nov 2024
Beyond the Boundaries of Proximal Policy Optimization Charlie B. Tan Edan Toledo Benjamin Ellis Jakob Foerster Ferenc Huszár 21 0 0 01 Nov 2024
Dual Approximation Policy Optimization Zhihan Xiong Maryam Fazel Lin Xiao 28 1 0 02 Oct 2024
AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control Rudolf Reiter Andrea Ghezzi Katrin Baumgärtner Jasper Hoffmann Robert D. McAllister Moritz Diehl 34 6 0 06 Jun 2024
Discovering Temporally-Aware Reinforcement Learning Algorithms Matthew Jackson Chris Xiaoxuan Lu Louis Kirsch R. T. Lange Shimon Whiteson Jakob N. Foerster 19 18 0 08 Feb 2024
Learning mirror maps in policy mirror descent Carlo Alfano Sebastian Towers Silvia Sapora Chris Xiaoxuan Lu Patrick Rebeschini 30 0 0 07 Feb 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations Matthias Lehmann 38 0 0 24 Jan 2024
Challenges for Reinforcement Learning in Quantum Circuit Design Philipp Altmann Jonas Stein Michael Kolle Adelina Barligea Thomas Gabor Thomy Phan Sebastian Feld Claudia Linnhoff-Popien 22 4 0 18 Dec 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages Andrew Jesson Chris Xiaoxuan Lu Gunshi Gupta Angelos Filos Jakob N. Foerster Y. Gal OffRL 25 5 0 02 Jun 2023
Heterogeneous-Agent Reinforcement Learning Yifan Zhong J. Kuba Xidong Feng Siyi Hu Jiaming Ji Yaodong Yang 18 36 0 19 Apr 2023
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence Carlo Alfano Rui Yuan Patrick Rebeschini 57 15 0 30 Jan 2023
Proximal Learning With Opponent-Learning Awareness S. Zhao Chris Xiaoxuan Lu Roger C. Grosse Jakob N. Foerster 29 21 0 18 Oct 2022
Discovered Policy Optimisation Chris Xiaoxuan Lu J. Kuba Alistair Letcher Luke Metz Christian Schroeder de Witt Jakob N. Foerster OffRL 39 74 0 11 Oct 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL J. Kuba Xidong Feng Shiyao Ding Hao Dong Jun Wang Yaodong Yang 18 16 0 02 Aug 2022
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes Guanghui Lan 89 136 0 30 Jan 2021