Off-Policy Actor-Critic with Shared Experience Replay

Off-Policy Actor-Critic with Shared Experience Replay

25 September 2019

Papers citing "Off-Policy Actor-Critic with Shared Experience Replay"

17 / 17 papers shown

Title
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey Zhihong Liu Xin Xu Peng Qiao Dongsheng Li OffRL 22 2 0 08 Nov 2024
The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin P. DÓro Evgenii Nikishin Aaron C. Courville 42 1 0 07 May 2024
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective Ying Wen Bo Liu M. Zhou Shufang Hou Zhe Cao Chenyang Le Jingxiao Chen Zheng Tian Weinan Zhang Jun Wang AI4CE 23 10 0 24 Dec 2022
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning Wei Fu Chao Yu Zelai Xu Jiaqi Yang Yi Wu 34 32 0 15 Jun 2022
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS Eleni Nisioti Matéo Mahaut Pierre-Yves Oudeyer Ida Momennejad Clément Moulin-Frier 21 9 0 10 Jun 2022
Non-Markovian policies occupancy measures Romain Laroche Rémi Tachet des Combes Jacob Buckman OffRL 37 1 0 27 May 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms Romain Laroche Rémi Tachet des Combes 46 2 0 15 Feb 2022
Model-Value Inconsistency as a Signal for Epistemic Uncertainty Angelos Filos Eszter Vértes Zita Marinho Gregory Farquhar Diana Borsa A. Friesen Feryal M. P. Behbahani Tom Schaul André Barreto Simon Osindero 44 7 0 08 Dec 2021
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch Shangtong Zhang Rémi Tachet des Combes Romain Laroche 30 10 0 04 Nov 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective Ting-Han Fan Peter J. Ramadge CML FAtt OffRL 21 2 0 06 Oct 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates Romain Laroche Rémi Tachet des Combes 46 8 0 29 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience C. Banerjee Zhiyong Chen N. Noman 11 30 0 24 Sep 2021
Bootstrapped Meta-Learning Sebastian Flennerhag Yannick Schroecker Tom Zahavy Hado van Hasselt David Silver Satinder Singh 38 58 0 09 Sep 2021
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning Changnan Xiao Haosen Shi Jiajun Fan Shihong Deng 18 5 0 01 Jun 2021
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration Zhenghao Peng Hao Sun Bolei Zhou 15 18 0 14 Jun 2020
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control H. F. Song A. Abdolmaleki Jost Tobias Springenberg Aidan Clark Hubert Soyer ... Dhruva Tirumala N. Heess Dan Belov Martin Riedmiller M. Botvinick 29 121 0 26 Sep 2019
Off-Policy Actor-Critic T. Degris Martha White R. Sutton OffRL CML 163 220 0 22 May 2012