Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.06680
Cited By
Dota 2 with Large Scale Deep Reinforcement Learning
13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dota 2 with Large Scale Deep Reinforcement Learning"
41 / 991 papers shown
Title
Jukebox: A Generative Model for Music
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
VLM
23
722
0
30 Apr 2020
Out-of-the-box channel pruned networks
Ragav Venkatesan
Gurumurthy Swaminathan
Xiong Zhou
Anna Luo
15
0
0
30 Apr 2020
Pseudo Rehearsal using non photo-realistic images
Bhasker Sri Harsha Suri
Kalidas Yeturu
17
1
0
28 Apr 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
36
348
0
27 Apr 2020
Reinforcement Learning Generalization with Surprise Minimization
Jerry Zikun Chen
OOD
11
19
0
26 Apr 2020
Policy Gradient from Demonstration and Curiosity
Jie Chen
Wenjun Xu
6
11
0
22 Apr 2020
Real World Games Look Like Spinning Tops
Wojciech M. Czarnecki
Gauthier Gidel
Brendan D. Tracey
K. Tuyls
Shayegan Omidshafiei
David Balduzzi
Max Jaderberg
7
96
0
20 Apr 2020
Weakly-Supervised Reinforcement Learning for Controllable Behavior
Lisa Lee
Benjamin Eysenbach
Ruslan Salakhutdinov
S. Gu
Chelsea Finn
SSL
4
26
0
06 Apr 2020
Benchmarking End-to-End Behavioural Cloning on Video Games
Anssi Kanervisto
J. Pussinen
Ville Hautamaki
OffRL
14
24
0
02 Apr 2020
Action Space Shaping in Deep Reinforcement Learning
Anssi Kanervisto
Christian Scheller
Ville Hautamaki
11
80
0
02 Apr 2020
Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward Network Goes with Reinforcement Learning
Marco Pleines
J. Jitsev
Mike Preuss
Frank Zimmer
11
2
0
01 Apr 2020
Mimicking Evolution with Reinforcement Learning
João Abrantes
A. Abrantes
F. Oliehoek
14
2
0
31 Mar 2020
Suphx: Mastering Mahjong with Deep Reinforcement Learning
Junjie Li
Sotetsu Koyamada
Qiwei Ye
Guoqing Liu
Chao Wang
Ruihan Yang
Li Zhao
Tao Qin
Tie-Yan Liu
H. Hon
11
123
0
30 Mar 2020
Counterfactual Policy Evaluation for Decision-Making in Autonomous Driving
Patrick Hart
Alois Knoll
CML
OffRL
9
3
0
20 Mar 2020
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
Huan Zhang
Hongge Chen
Chaowei Xiao
Bo-wen Li
Mingyan D. Liu
Duane S. Boning
Cho-Jui Hsieh
AAML
25
260
0
19 Mar 2020
End-to-End Deep Diagnosis of X-ray Images
Kudaibergen Urinbayev
Yerassyl Orazbek
Yernur Nurambek
A. Mirzakhmetov
H. A. Varol
MedIm
18
12
0
19 Mar 2020
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
6
123
0
19 Mar 2020
A Survey of End-to-End Driving: Architectures and Training Methods
Ardi Tampuu
Maksym Semikin
Naveed Muhammad
D. Fishman
Tambet Matiisen
3DV
18
228
0
13 Mar 2020
Taylor Expansion Policy Optimization
Yunhao Tang
Michal Valko
Rémi Munos
OffRL
4
14
0
13 Mar 2020
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis
Aman Sinha
Matthew O'Kelly
Hongrui Zheng
Rahul Mangharam
John C. Duchi
Russ Tedrake
OffRL
66
26
0
09 Mar 2020
Deep Adversarial Reinforcement Learning for Object Disentangling
Melvin Laux
O. Arenz
Jan Peters
J. Pajarinen
DRL
9
3
0
08 Mar 2020
Finding online neural update rules by learning to remember
Karol Gregor
CLL
34
6
0
06 Mar 2020
Real-World Human-Robot Collaborative Reinforcement Learning
A. Shafti
Jonas Tjomsland
William Dudley
Aldo A. Faisal
23
17
0
02 Mar 2020
Efficient exploration of zero-sum stochastic games
Carlos Martin
T. Sandholm
12
5
0
24 Feb 2020
From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI
S. Risi
Mike Preuss
27
56
0
24 Feb 2020
Training Question Answering Models From Synthetic Data
Raul Puri
Ryan Spring
M. Patwary
M. Shoeybi
Bryan Catanzaro
ELM
24
159
0
22 Feb 2020
Deep RL Agent for a Real-Time Action Strategy Game
Michal Warchalski
Dimitrije Radojević
M. Milosevic
9
0
0
15 Feb 2020
Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning
Navid Naderializadeh
J. Sydir
M. Simsek
Hosein Nikopour
10
121
0
14 Feb 2020
Resolving Spurious Correlations in Causal Models of Environments via Interventions
S. Volodin
Nevan Wichers
Jeremy Nixon
CML
9
16
0
12 Feb 2020
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
67
83
0
06 Feb 2020
Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks
Joseph Suárez
Yilun Du
Igor Mordatch
Phillip Isola
AI4CE
11
6
0
31 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
57
132
0
27 Jan 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
23
188
0
23 Dec 2019
Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation
Tianhong Dai
Kai Arulkumaran
Tamara Gerbert
Samyakh Tukra
Feryal M. P. Behbahani
Anil Anthony Bharath
9
27
0
18 Dec 2019
Neural Network Surgery with Sets
Jonathan Raiman
Susan Zhang
Christy Dennison
11
4
0
13 Dec 2019
SquirRL: Automating Attack Analysis on Blockchain Incentive Mechanisms with Deep Reinforcement Learning
Charlie Hou
Mingxun Zhou
Yan Ji
Philip Daian
Florian Tramèr
Giulia Fanti
Ari Juels
AAML
10
30
0
04 Dec 2019
Accelerating Reinforcement Learning with Suboptimal Guidance
Eivind Bøhn
Signe Moe
T. Johansen
OnRL
14
0
0
21 Nov 2019
QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement Learning
Srivatsan Krishnan
Maximilian Lam
Sharad Chitlangia
Zishen Wan
Gabriel Barth-Maron
Aleksandra Faust
Vijay Janapa Reddi
MQ
13
22
0
02 Oct 2019
MoGlow: Probabilistic and controllable motion synthesis using normalising flows
G. Henter
Simon Alexanderson
Jonas Beskow
34
97
0
16 May 2019
Learn to Interpret Atari Agents
Zhao Yang
S. Bai
Li Zhang
Philip H. S. Torr
14
28
0
29 Dec 2018
Quantifying the probable approximation error of probabilistic inference programs
Marco F. Cusumano-Towner
Vikash K. Mansinghka
25
7
0
31 May 2016
Previous
1
2
3
...
18
19
20