Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms

15 February 2022

Papers citing "Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms"

3 / 3 papers shown

Title
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees Hsin-En Su Yen-Ju Chen Ping-Chun Hsieh Xi Liu OffRL 18 0 0 10 Dec 2022
The Primacy Bias in Deep Reinforcement Learning Evgenii Nikishin Max Schwarzer P. DÓro Pierre-Luc Bacon Aaron C. Courville OnRL 88 178 0 16 May 2022
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation Harshat Kumar Alec Koppel Alejandro Ribeiro 99 79 0 18 Oct 2019