ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.01399
  4. Cited By
Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER

Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER

2 December 2020
Markus Holzleitner
Lukas Gruber
Jose A. Arjona-Medina
Johannes Brandstetter
Sepp Hochreiter
ArXiv (abs)PDFHTML

Papers citing "Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER"

12 / 12 papers shown
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
Simin Li
Zihao Mao
Hanxiao Li
Zonglei Jing
Zhuohang bian
...
Yuqing Ma
Bo An
Yaodong Yang
Weifeng Lv
Xianglong Liu
216
0
0
13 Oct 2025
Adversarial Reinforcement Learning Framework for ESP Cheater Simulation
Adversarial Reinforcement Learning Framework for ESP Cheater Simulation
Inkyu Park
J. Lee
Taehwan Kwon
Juheon Choi
Seungku Kim
Junsu Kim
Kimin Lee
AAML
308
0
0
29 Sep 2025
Autonomous Curriculum Design via Relative Entropy Based Task Modifications
Autonomous Curriculum Design via Relative Entropy Based Task Modifications
Muhammed Yusuf Satici
Jianxun Wang
David L. Roberts
184
1
0
28 Feb 2025
Value Improved Actor Critic Algorithms
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
Mustafa Mert Celikok
M. Spaan
Wendelin Bohmer
OffRL
500
1
0
03 Jun 2024
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Semantic HELM: A Human-Readable Memory for Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
407
18
0
15 Jun 2023
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial
  Minority Influence
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority InfluenceNeural Networks (Neural Netw.), 2023
Simin Li
Jun Guo
Jingqiao Xiu
Pu Feng
Xin Yu
Aishan Liu
Wenjun Wu
Xianglong Liu
AAML
470
30
0
07 Feb 2023
Bridging Physics-Informed Neural Networks with Reinforcement Learning:
  Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)
Amartya Mukherjee
Jun Liu
227
19
0
01 Feb 2023
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
354
17
0
12 Jul 2022
Learning Long-Term Reward Redistribution via Randomized Return
  Decomposition
Learning Long-Term Reward Redistribution via Randomized Return DecompositionInternational Conference on Learning Representations (ICLR), 2021
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
411
44
0
26 Nov 2021
Proximal Policy Optimization for Tracking Control Exploiting Future
  Reference Information
Proximal Policy Optimization for Tracking Control Exploiting Future Reference Information
Jana Mayer
Johannes Westermann
Juan Pedro Gutiérrez H. Muriedas
Uwe Mettin
A. Lampe
OffRL
199
2
0
20 Jul 2021
Online Algorithms and Policies Using Adaptive and Machine Learning
  Approaches
Online Algorithms and Policies Using Adaptive and Machine Learning ApproachesIEEE Transactions on Automatic Control (IEEE TAC), 2021
Anuradha M. Annaswamy
A. Guha
Yingnan Cui
Sunbochen Tang
Peter A. Fisher
Joseph E. Gaudio
401
40
0
13 May 2021
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Vihang Patil
M. Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
P. Blies
Johannes Brandstetter
Jose A. Arjona-Medina
Sepp Hochreiter
431
46
0
29 Sep 2020
1
Page 1 of 1