ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.02373
  4. Cited By
Mirror Learning: A Unifying Framework of Policy Optimisation

Mirror Learning: A Unifying Framework of Policy Optimisation

7 January 2022
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
ArXivPDFHTML

Papers citing "Mirror Learning: A Unifying Framework of Policy Optimisation"

17 / 17 papers shown
Title
Mirror Descent Actor Critic via Bounded Advantage Learning
Mirror Descent Actor Critic via Bounded Advantage Learning
Ryo Iwaki
93
0
0
06 Feb 2025
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Rudolf Reiter
Jasper Hoffmann
D. Reinhardt
Florian Messerer
Katrin Baumgärtner
Shamburaj Sawant
Joschka Boedecker
Moritz Diehl
S. Gros
79
5
0
04 Feb 2025
Accelerating Proximal Policy Optimization Learning Using Task Prediction
  for Solving Environments with Delayed Rewards
Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards
A. Ahmad
Mehdi Kermanshah
Kevin J. Leahy
Zachary Serlin
H. Siu
Makai Mann
C. Vasile
Roberto Tron
C. Belta
OffRL
66
0
0
26 Nov 2024
Beyond the Boundaries of Proximal Policy Optimization
Beyond the Boundaries of Proximal Policy Optimization
Charlie B. Tan
Edan Toledo
Benjamin Ellis
Jakob Foerster
Ferenc Huszár
21
0
0
01 Nov 2024
Dual Approximation Policy Optimization
Dual Approximation Policy Optimization
Zhihan Xiong
Maryam Fazel
Lin Xiao
28
1
0
02 Oct 2024
AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model
  Predictive Control
AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control
Rudolf Reiter
Andrea Ghezzi
Katrin Baumgärtner
Jasper Hoffmann
Robert D. McAllister
Moritz Diehl
34
6
0
06 Jun 2024
Discovering Temporally-Aware Reinforcement Learning Algorithms
Discovering Temporally-Aware Reinforcement Learning Algorithms
Matthew Jackson
Chris Xiaoxuan Lu
Louis Kirsch
R. T. Lange
Shimon Whiteson
Jakob N. Foerster
19
18
0
08 Feb 2024
Learning mirror maps in policy mirror descent
Learning mirror maps in policy mirror descent
Carlo Alfano
Sebastian Towers
Silvia Sapora
Chris Xiaoxuan Lu
Patrick Rebeschini
30
0
0
07 Feb 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning:
  Theory, Algorithms and Implementations
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
Matthias Lehmann
38
0
0
24 Jan 2024
Challenges for Reinforcement Learning in Quantum Circuit Design
Challenges for Reinforcement Learning in Quantum Circuit Design
Philipp Altmann
Jonas Stein
Michael Kolle
Adelina Barligea
Thomas Gabor
Thomy Phan
Sebastian Feld
Claudia Linnhoff-Popien
22
4
0
18 Dec 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive
  Advantages
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
25
5
0
02 Jun 2023
Heterogeneous-Agent Reinforcement Learning
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
J. Kuba
Xidong Feng
Siyi Hu
Jiaming Ji
Yaodong Yang
18
36
0
19 Apr 2023
A Novel Framework for Policy Mirror Descent with General
  Parameterization and Linear Convergence
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence
Carlo Alfano
Rui Yuan
Patrick Rebeschini
57
15
0
30 Jan 2023
Proximal Learning With Opponent-Learning Awareness
Proximal Learning With Opponent-Learning Awareness
S. Zhao
Chris Xiaoxuan Lu
Roger C. Grosse
Jakob N. Foerster
29
21
0
18 Oct 2022
Discovered Policy Optimisation
Discovered Policy Optimisation
Chris Xiaoxuan Lu
J. Kuba
Alistair Letcher
Luke Metz
Christian Schroeder de Witt
Jakob N. Foerster
OffRL
39
74
0
11 Oct 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to
  Cooperative MARL
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
18
16
0
02 Aug 2022
Policy Mirror Descent for Reinforcement Learning: Linear Convergence,
  New Sampling Complexity, and Generalized Problem Classes
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
89
136
0
30 Jan 2021
1