Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.00933
Cited By
Distributed Prioritized Experience Replay
2 March 2018
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Distributed Prioritized Experience Replay"
50 / 373 papers shown
Title
How does AI play football? An analysis of RL and real-world football strategies
Atom Scott
Keisuke Fujii
Masaki Onishi
50
13
0
24 Nov 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
16
82
0
22 Nov 2021
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
Dominik Schmidt
Thomas Schmied
OffRL
20
12
0
19 Nov 2021
CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms
Shengyi Huang
Rousslan Fernand Julien Dossa
Chang Ye
Jeff Braga
OffRL
11
0
0
16 Nov 2021
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN
Peizheng Li
Jonathan D. Thomas
Xiaoyang Wang
Ahmed Khalil
A. Ahmad
...
S. Kapoor
Arjun Parekh
A. Doufexi
Arman Shojaeifard
Robert Piechocki
AI4TS
14
37
0
12 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
11
1
0
12 Nov 2021
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Andrew Cohen
Ervin Teng
Vincent-Pierre Berges
Ruo-Ping Dong
Hunter Henry
Marwan Mattar
Alexander Zook
Sujoy Ganguly
18
33
0
10 Nov 2021
Human-Level Control without Server-Grade Hardware
Brett Daley
Chris Amato
BDL
OffRL
8
0
0
01 Nov 2021
Cooperative Deep
Q
Q
Q
-learning Framework for Environments Providing Image Feedback
Krishnan Raghavan
Vignesh Narayanan
S. Jagannathan
VLM
OffRL
26
1
0
28 Oct 2021
Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling
Masaki Furukawa
Hiroki Matsutani
OffRL
16
1
0
26 Oct 2021
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control
Vinay Hanumaiah
Sahika Genc
AI4CE
8
6
0
26 Oct 2021
A Versatile and Efficient Reinforcement Learning Framework for Autonomous Driving
Guan-Bo Wang
Haoyi Niu
Desheng Zhu
Jianming Hu
Xianyuan Zhan
Guyue Zhou
OffRL
16
2
0
22 Oct 2021
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Siyang Wu
Tonghan Wang
Chenghao Li
Yang Hu
Chongjie Zhang
OffRL
21
1
0
15 Oct 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CML
FAtt
OffRL
21
2
0
06 Oct 2021
Large Batch Experience Replay
Thibault Lahire
M. Geist
Emmanuel Rachelson
OffRL
48
13
0
04 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
11
7
0
03 Oct 2021
Sim and Real: Better Together
Shirli Di-Castro Shashua
Dotan DiCastro
Shie Mannor
58
11
0
01 Oct 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Arnaud Fickinger
Hengyuan Hu
Brandon Amos
Stuart J. Russell
Noam Brown
49
21
0
30 Sep 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates
Romain Laroche
Rémi Tachet des Combes
40
8
0
29 Sep 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
60
54
0
28 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
11
30
0
24 Sep 2021
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Chris Cummins
Bram Wasti
Jiadong Guo
Brandon Cui
Jason Ansel
...
Jia-Wei Liu
O. Teytaud
Benoit Steiner
Yuandong Tian
Hugh Leather
31
68
0
17 Sep 2021
Evolutionary Self-Replication as a Mechanism for Producing Artificial Intelligence
Samuel Schmidgall
Joe Hays
35
1
0
16 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
92
0
14 Sep 2021
Learning Selective Communication for Multi-Agent Path Finding
Ziyuan Ma
Yudong Luo
Jia Pan
AI4CE
39
48
0
12 Sep 2021
Event-Based Communication in Distributed Q-Learning
Daniel Jarne Ornia
M. Mazo
23
1
0
03 Sep 2021
Federated Reinforcement Learning: Techniques, Applications, and Open Challenges
Jiaju Qi
Qihao Zhou
Lei Lei
Kan Zheng
FedML
31
145
0
26 Aug 2021
When should agents explore?
Miruna Pislar
David Szepesvari
Georg Ostrovski
Diana Borsa
Tom Schaul
40
22
0
26 Aug 2021
Neural-to-Tree Policy Distillation with Policy Improvement Criterion
Zhaorong Li
Yang Yu
Yingfeng Chen
Ke Chen
Zhipeng Hu
Changjie Fan
15
5
0
16 Aug 2021
QoS-Aware Scheduling in New Radio Using Deep Reinforcement Learning
Jakob Stigenberg
Vidit Saxena
Soma Tayamon
E. Ghadimi
14
2
0
14 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning
You Qiaoben
Chengyang Ying
Xinning Zhou
Hang Su
Jun Zhu
Bo Zhang
AAML
25
14
0
30 Jun 2021
Action Set Based Policy Optimization for Safe Power Grid Management
Bo Zhou
Hongsheng Zeng
Yuecheng Liu
Kejiao Li
Fan Wang
Hao Tian
16
21
0
29 Jun 2021
Distributed Heuristic Multi-Agent Path Finding with Communication
Ziyuan Ma
Yudong Luo
Hang Ma
15
69
0
21 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
11
48
0
06 Jun 2021
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Mingde Zhao
Zhen Liu
Sitao Luan
Shuyuan Zhang
Doina Precup
Yoshua Bengio
31
37
0
03 Jun 2021
Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural Architectures for Real-Time Strategy Games
Won Joon Yun
Sungwon Yi
Joongheon Kim
20
10
0
21 May 2021
Reinforcement Learning for Ridesharing: An Extended Survey
Zhiwei Qin
Hongtu Zhu
Jieping Ye
39
84
0
03 May 2021
Network Defense is Not a Game
Andres Molina-Markham
Ransom K. Winder
Ahmad Ridley
AAML
17
11
0
20 Apr 2021
Podracer architectures for scalable Reinforcement Learning
Matteo Hessel
M. Kroiss
Aidan Clark
Iurii Kemaev
John Quan
Thomas Keck
Fabio Viola
H. V. Hasselt
11
37
0
13 Apr 2021
Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Florian Laurent
Manuel Schneider
Christian Scheller
J. Watson
Jiaoyang Li
...
Nilabha Bhattacharya
Shivam Agarwal
A. Egli
Erik Nygren
Sharada Mohanty
36
28
0
30 Mar 2021
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
Wenzhao Lian
Chang Su
Mel Vecerík
Ning Ye
S. Schaal
Jonathan Scholz
OffRL
21
60
0
21 Mar 2021
Large Batch Simulation for Deep Reinforcement Learning
Brennan Shacklett
Erik Wijmans
Aleksei Petrenko
Manolis Savva
Dhruv Batra
V. Koltun
Kayvon Fatahalian
3DV
OffRL
AI4CE
27
26
0
12 Mar 2021
Analyzing the Hidden Activations of Deep Policy Networks: Why Representation Matters
Trevor A. McInroe
Michael Spurrier
J. Sieber
Stephen Conneely
6
0
0
11 Mar 2021
Provably Efficient Cooperative Multi-Agent Reinforcement Learning with Function Approximation
Abhimanyu Dubey
Alex Pentland
19
23
0
08 Mar 2021
Off-Belief Learning
Hengyuan Hu
Adam Lerer
Brandon Cui
David J. Wu
Luis Pineda
Noam Brown
Jakob N. Foerster
OffRL
13
69
0
06 Mar 2021
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision
Johan Bjorck
Xiangyu Chen
Christopher De Sa
Carla P. Gomes
Kilian Q. Weinberger
23
6
0
26 Feb 2021
Synthetic Returns for Long-Term Credit Assignment
David Raposo
Samuel Ritter
Adam Santoro
Greg Wayne
T. Weber
M. Botvinick
H. V. Hasselt
Francis Song
AI4TS
13
34
0
24 Feb 2021
Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning
Victor Campos
Pablo Sprechmann
S. Hansen
André Barreto
Steven Kapturowski
Alex Vitvitskyi
Adria Puigdomenech Badia
Charles Blundell
OffRL
OnRL
33
25
0
24 Feb 2021
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
14
5
0
24 Feb 2021
Previous
1
2
3
4
5
6
7
8
Next