ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.00933
  4. Cited By
Distributed Prioritized Experience Replay

Distributed Prioritized Experience Replay

2 March 2018
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
ArXivPDFHTML

Papers citing "Distributed Prioritized Experience Replay"

23 / 373 papers shown
Title
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
T. Paine
Sergio Gomez Colmenarejo
Ziyun Wang
Scott E. Reed
Y. Aytar
...
Matthew W. Hoffman
Gabriel Barth-Maron
Serkan Cabi
David Budden
Nando de Freitas
OffRL
14
26
0
11 Oct 2018
Improvements on Hindsight Learning
Improvements on Hindsight Learning
A. Deshpande
Srikanth Sarma
Ashutosh Jha
Balaraman Ravindran
OffRL
11
3
0
16 Sep 2018
An Intriguing Failing of Convolutional Neural Networks and the CoordConv
  Solution
An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution
Rosanne Liu
Joel Lehman
Piero Molino
F. Such
Eric Frank
Alexander Sergeev
J. Yosinski
13
883
0
09 Jul 2018
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value
  Expansion
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Jacob Buckman
Danijar Hafner
George Tucker
E. Brevdo
Honglak Lee
11
328
0
04 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An
  Alternative To Proximal Policy Optimization
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
Xiangxiang Chu
12
9
0
02 Jul 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
4
1,446
0
27 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
22
212
0
20 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
21
373
0
08 Jun 2018
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on
  Challenging Deep Reinforcement Learning Problems
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems
C. Stanton
Jeff Clune
LRM
25
41
0
01 Jun 2018
Observe and Look Further: Achieving Consistent Performance on Atari
Observe and Look Further: Achieving Consistent Performance on Atari
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
...
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
12
120
0
29 May 2018
Playing hard exploration games by watching YouTube
Playing hard exploration games by watching YouTube
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
24
269
0
29 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement
  Learning
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
24
11
0
20 May 2018
Distributed Distributional Deterministic Policy Gradients
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
34
475
0
23 Apr 2018
Learning Awareness Models
Learning Awareness Models
Brandon Amos
Laurent Dinh
Serkan Cabi
Thomas Rothörl
Sergio Gomez Colmenarejo
Alistair Muldal
Tom Erez
Yuval Tassa
Nando de Freitas
Misha Denil
11
44
0
17 Apr 2018
Accelerated Methods for Deep Reinforcement Learning
Accelerated Methods for Deep Reinforcement Learning
Adam Stooke
Pieter Abbeel
OffRL
OnRL
14
133
0
07 Mar 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
19
445
0
28 Feb 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
D. Meger
OffRL
13
5,045
0
26 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
13
1,572
0
05 Feb 2018
RLlib: Abstractions for Distributed Reinforcement Learning
RLlib: Abstractions for Distributed Reinforcement Learning
Eric Liang
Richard Liaw
Philipp Moritz
Robert Nishihara
Roy Fox
Ken Goldberg
Joseph E. Gonzalez
Michael I. Jordan
Ion Stoica
OffRL
AI4CE
15
173
0
26 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative
  for Training Deep Neural Networks for Reinforcement Learning
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
24
685
0
18 Dec 2017
Ray: A Distributed Framework for Emerging AI Applications
Ray: A Distributed Framework for Emerging AI Applications
Philipp Moritz
Robert Nishihara
Stephanie Wang
Alexey Tumanov
Richard Liaw
...
Melih Elibol
Zongheng Yang
William Paul
Michael I. Jordan
Ion Stoica
GNN
13
1,222
0
16 Dec 2017
Deep Learning for Video Game Playing
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
24
206
0
25 Aug 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
131
928
0
07 Jul 2017
Previous
12345678