Distributed Prioritized Experience Replay

2 March 2018

Dan Horgan

David Budden

David Silver

Papers citing "Distributed Prioritized Experience Replay"

23 / 373 papers shown

Title
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL T. Paine Sergio Gomez Colmenarejo Ziyun Wang Scott E. Reed Y. Aytar ... Matthew W. Hoffman Gabriel Barth-Maron Serkan Cabi David Budden Nando de Freitas OffRL 14 26 0 11 Oct 2018
Improvements on Hindsight Learning A. Deshpande Srikanth Sarma Ashutosh Jha Balaraman Ravindran OffRL 11 3 0 16 Sep 2018
An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution Rosanne Liu Joel Lehman Piero Molino F. Such Eric Frank Alexander Sergeev J. Yosinski 13 883 0 09 Jul 2018
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion Jacob Buckman Danijar Hafner George Tucker E. Brevdo Honglak Lee 11 328 0 04 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Xiangxiang Chu 12 9 0 02 Jul 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation Dmitry Kalashnikov A. Irpan P. Pastor Julian Ibarz Alexander Herzog ... Deirdre Quillen E. Holly Mrinal Kalakrishnan Vincent Vanhoucke Sergey Levine 4 1,446 0 27 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards Jose A. Arjona-Medina Michael Gillhofer Michael Widrich Thomas Unterthiner Johannes Brandstetter Sepp Hochreiter 22 212 0 20 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning Ian Osband John Aslanides Albin Cassirer UQCV BDL 21 373 0 08 Jun 2018
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems C. Stanton Jeff Clune LRM 25 41 0 01 Jun 2018
Observe and Look Further: Achieving Consistent Performance on Atari Tobias Pohlen Bilal Piot Todd Hester M. G. Azar Dan Horgan ... John Quan Mel Vecerík Matteo Hessel Rémi Munos Olivier Pietquin 12 120 0 29 May 2018
Playing hard exploration games by watching YouTube Y. Aytar Tobias Pfaff David Budden T. Paine Ziyun Wang Nando de Freitas 24 269 0 29 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning Elad Sarafian Aviv Tamar Sarit Kraus OffRL 24 11 0 20 May 2018
Distributed Distributional Deterministic Policy Gradients Gabriel Barth-Maron Matthew W. Hoffman David Budden Will Dabney Dan Horgan TB Dhruva Alistair Muldal N. Heess Timothy Lillicrap OffRL 34 475 0 23 Apr 2018
Learning Awareness Models Brandon Amos Laurent Dinh Serkan Cabi Thomas Rothörl Sergio Gomez Colmenarejo Alistair Muldal Tom Erez Yuval Tassa Nando de Freitas Misha Denil 11 44 0 17 Apr 2018
Accelerated Methods for Deep Reinforcement Learning Adam Stooke Pieter Abbeel OffRL OnRL 14 133 0 07 Mar 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch Martin Riedmiller Roland Hafner Thomas Lampe Michael Neunert Jonas Degrave T. Wiele Volodymyr Mnih N. Heess Jost Tobias Springenberg 19 445 0 28 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods Scott Fujimoto H. V. Hoof D. Meger OffRL 13 5,045 0 26 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 13 1,572 0 05 Feb 2018
RLlib: Abstractions for Distributed Reinforcement Learning Eric Liang Richard Liaw Philipp Moritz Robert Nishihara Roy Fox Ken Goldberg Joseph E. Gonzalez Michael I. Jordan Ion Stoica OffRL AI4CE 15 173 0 26 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning F. Such Vashisht Madhavan Edoardo Conti Joel Lehman Kenneth O. Stanley Jeff Clune 24 685 0 18 Dec 2017
Ray: A Distributed Framework for Emerging AI Applications Philipp Moritz Robert Nishihara Stephanie Wang Alexey Tumanov Richard Liaw ... Melih Elibol Zongheng Yang William Paul Michael I. Jordan Ion Stoica GNN 13 1,222 0 16 Dec 2017
Deep Learning for Video Game Playing Niels Justesen Philip Bontrager Julian Togelius S. Risi VLM 24 206 0 25 Aug 2017
Emergence of Locomotion Behaviours in Rich Environments N. Heess TB Dhruva S. Sriram Jay Lemmon J. Merel ... Tom Erez Ziyun Wang S. M. Ali Eslami Martin Riedmiller David Silver 134 928 0 07 Jul 2017