ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.06680
  4. Cited By
Dota 2 with Large Scale Deep Reinforcement Learning

Dota 2 with Large Scale Deep Reinforcement Learning

13 December 2019
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
Vicki Cheung
Przemyslaw Debiak
Christy Dennison
David Farhi
Quirin Fischer
Shariq Hashme
Christopher Hesse
Rafal Jozefowicz
Scott Gray
Catherine Olsson
J. Pachocki
Michael Petrov
Henrique Pondé de Oliveira Pinto
Jonathan Raiman
Tim Salimans
Jeremy Schlatter
Jonas Schneider
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
    GNN
    VLM
    CLL
    AI4CE
    LRM
ArXivPDFHTML

Papers citing "Dota 2 with Large Scale Deep Reinforcement Learning"

50 / 991 papers shown
Title
Forward and inverse reinforcement learning sharing network weights and
  hyperparameters
Forward and inverse reinforcement learning sharing network weights and hyperparameters
E. Uchibe
Kenji Doya
11
18
0
17 Aug 2020
SuperSuit: Simple Microwrappers for Reinforcement Learning Environments
SuperSuit: Simple Microwrappers for Reinforcement Learning Environments
J. K. Terry
Benjamin Black
Ananth Hari
9
22
0
17 Aug 2020
Reinforcement Learning with Quantum Variational Circuits
Reinforcement Learning with Quantum Variational Circuits
Owen Lockwood
Mei Si
6
132
0
15 Aug 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect
  Information
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
29
19
0
14 Aug 2020
OR-Gym: A Reinforcement Learning Library for Operations Research
  Problems
OR-Gym: A Reinforcement Learning Library for Operations Research Problems
Christian D. Hubbs
Hector D. Perez
Owais Sarwar
N. Sahinidis
I. Grossmann
J. Wassick
OffRL
AI4CE
19
74
0
14 Aug 2020
Learning to Reason in Round-based Games: Multi-task Sequence Generation
  for Purchasing Decision Making in First-person Shooters
Learning to Reason in Round-based Games: Multi-task Sequence Generation for Purchasing Decision Making in First-person Shooters
Yilei Zeng
Deren Lei
Beichen Li
Gangrong Jiang
Emilio Ferrara
M. Zyda
LRM
17
3
0
12 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
13
17
0
11 Aug 2020
Queueing Network Controls via Deep Reinforcement Learning
Queueing Network Controls via Deep Reinforcement Learning
J. Dai
Mark O. Gluzman
OffRL
19
50
0
31 Jul 2020
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task
  Activities
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
Baoxiong Jia
Yixin Chen
Siyuan Huang
Yixin Zhu
Song-Chun Zhu
8
51
0
31 Jul 2020
Improving Multi-Agent Cooperation using Theory of Mind
Improving Multi-Agent Cooperation using Theory of Mind
Terence X. Lim
Sidney Tio
Desmond C. Ong
LLMAG
14
12
0
30 Jul 2020
Interpretable Contextual Team-aware Item Recommendation: Application in
  Multiplayer Online Battle Arena Games
Interpretable Contextual Team-aware Item Recommendation: Application in Multiplayer Online Battle Arena Games
Andrés Villa
Vladimir Araujo
Francisca Cattan
Denis Parra
8
18
0
30 Jul 2020
Combining Deep Reinforcement Learning and Search for
  Imperfect-Information Games
Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Noam Brown
A. Bakhtin
Adam Lerer
Qucheng Gong
13
133
0
27 Jul 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
17
174
0
24 Jul 2020
WordCraft: An Environment for Benchmarking Commonsense Agents
WordCraft: An Environment for Benchmarking Commonsense Agents
Minqi Jiang
Jelena Luketina
Nantas Nardelli
Pasquale Minervini
Philip H. S. Torr
Shimon Whiteson
Tim Rocktaschel
LLMAG
OffRL
6
22
0
17 Jul 2020
Co-generation of game levels and game-playing agents
Co-generation of game levels and game-playing agents
Aaron Dharna
Julian Togelius
Lisa Soros
11
17
0
16 Jul 2020
Distributed Reinforcement Learning of Targeted Grasping with Active
  Vision for Mobile Manipulators
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators
Yasuhiro Fujita
Kota Uenishi
Avinash Ummadisingu
P. Nagarajan
Shimpei Masuda
M. Castro
19
18
0
16 Jul 2020
Single-partition adaptive Q-learning
Single-partition adaptive Q-learning
J. Araújo
Mário A. T. Figueiredo
M. Botto
OffRL
18
2
0
14 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive
  Representations
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
19
308
0
12 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
6
19
0
09 Jul 2020
Deep Reinforcement Learning and its Neuroscientific Implications
Deep Reinforcement Learning and its Neuroscientific Implications
M. Botvinick
Jane X. Wang
Will Dabney
Kevin J. Miller
Z. Kurth-Nelson
OffRL
AI4CE
18
168
0
07 Jul 2020
Strong Generalization and Efficiency in Neural Programs
Strong Generalization and Efficiency in Neural Programs
Yujia Li
Felix Gimeno
Pushmeet Kohli
Oriol Vinyals
17
17
0
07 Jul 2020
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in
  Reinforcement Learning
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
H. V. Seijen
Hadi Nekoei
Evan Racah
A. Chandar
OffRL
8
13
0
07 Jul 2020
Meta-Learning through Hebbian Plasticity in Random Networks
Meta-Learning through Hebbian Plasticity in Random Networks
Elias Najarro
S. Risi
20
76
0
06 Jul 2020
Towards Game-Playing AI Benchmarks via Performance Reporting Standards
Towards Game-Playing AI Benchmarks via Performance Reporting Standards
Vanessa Volz
B. Naujoks
9
4
0
06 Jul 2020
An Autonomous Free Airspace En-route Controller using Deep Reinforcement
  Learning Techniques
An Autonomous Free Airspace En-route Controller using Deep Reinforcement Learning Techniques
Joris Mollinga
H. V. Hoof
17
15
0
03 Jul 2020
Continual Learning: Tackling Catastrophic Forgetting in Deep Neural
  Networks with Replay Processes
Continual Learning: Tackling Catastrophic Forgetting in Deep Neural Networks with Replay Processes
Timothée Lesort
CLL
8
21
0
01 Jul 2020
Object-Centric Learning with Slot Attention
Object-Centric Learning with Slot Attention
Francesco Locatello
Dirk Weissenborn
Thomas Unterthiner
Aravindh Mahendran
G. Heigold
Jakob Uszkoreit
Alexey Dosovitskiy
Thomas Kipf
OCL
35
817
0
26 Jun 2020
SOAC: The Soft Option Actor-Critic Architecture
SOAC: The Soft Option Actor-Critic Architecture
Chenghao Li
Xiaoteng Ma
Chongjie Zhang
Jun Yang
L. Xia
Qianchuan Zhao
12
6
0
25 Jun 2020
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Shengyi Huang
Santiago Ontañón
14
307
0
25 Jun 2020
Reinforcement Learning and its Connections with Neuroscience and
  Psychology
Reinforcement Learning and its Connections with Neuroscience and Psychology
Ajay Subramanian
Sharad Chitlangia
V. Baths
OffRL
8
28
0
25 Jun 2020
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning
Çağlar Gülçehre
Ziyun Wang
Alexander Novikov
T. Paine
Sergio Gomez Colmenarejo
...
Matthew W. Hoffman
Ofir Nachum
George Tucker
N. Heess
Nando de Freitas
OffRL
19
71
0
24 Jun 2020
The NetHack Learning Environment
The NetHack Learning Environment
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
12
177
0
24 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with
  Asynchronous Reinforcement Learning
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
8
99
0
21 Jun 2020
Automatic Curriculum Learning through Value Disagreement
Automatic Curriculum Learning through Value Disagreement
Yunzhi Zhang
Pieter Abbeel
Lerrel Pinto
17
103
0
17 Jun 2020
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash
  Equilibria in Large Games
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen Marcus McAleer
John Lanier
Roy Fox
Pierre Baldi
16
76
0
15 Jun 2020
Learning to Incentivize Other Learning Agents
Learning to Incentivize Other Learning Agents
Jiachen Yang
Ang Li
Mehrdad Farajtabar
P. Sunehag
Edward Hughes
H. Zha
10
67
0
10 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
11
42
0
08 Jun 2020
A Comparison of Self-Play Algorithms Under a Generalized Framework
A Comparison of Self-Play Algorithms Under a Generalized Framework
Daniel Hernández
Kevin Denamganai
Sam Devlin
Spyridon Samothrakis
James Alfred Walker
11
12
0
08 Jun 2020
A Decentralized Policy Gradient Approach to Multi-task Reinforcement
  Learning
A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning
Sihan Zeng
Aqeel Anwar
Thinh T. Doan
A. Raychowdhury
J. Romberg
11
38
0
08 Jun 2020
Reinforcement Learning for Multi-Product Multi-Node Inventory Management
  in Supply Chains
Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains
Nazneen N. Sultana
Hardik Meisheri
Vinita Baniwal
Somjit Nath
Balaraman Ravindran
H. Khadilkar
9
22
0
07 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
44
225
0
01 Jun 2020
Time-Variant Variational Transfer for Value Functions
Time-Variant Variational Transfer for Value Functions
Giuseppe Canonaco
Andrea Soprani
M. Roveri
Marcello Restelli
OOD
8
0
0
26 May 2020
Policy Entropy for Out-of-Distribution Classification
Policy Entropy for Out-of-Distribution Classification
Andreas Sedlmeier
Robert Muller
Steffen Illium
Claudia Linnhoff-Popien
OODD
OffRL
13
14
0
25 May 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
9
13
0
21 May 2020
A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks
A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks
Angela S. Lin
Sudha Rao
Asli Celikyilmaz
E. Nouri
Chris Brockett
Debadeepta Dey
Bill Dolan
12
24
0
19 May 2020
Deep Learning: Our Miraculous Year 1990-1991
Deep Learning: Our Miraculous Year 1990-1991
J. Schmidhuber
3DGS
MedIm
6
6
0
12 May 2020
Measuring the Algorithmic Efficiency of Neural Networks
Measuring the Algorithmic Efficiency of Neural Networks
Danny Hernandez
Tom B. Brown
233
94
0
08 May 2020
Navigating the Landscape of Multiplayer Games
Navigating the Landscape of Multiplayer Games
Shayegan Omidshafiei
K. Tuyls
Wojciech M. Czarnecki
Francisco C. Santos
Mark Rowland
...
Paul Muller
Julien Perolat
Bart De Vylder
A. Gruslys
Rémi Munos
11
2
0
04 May 2020
Off-the-shelf deep learning is not enough: parsimony, Bayes and
  causality
Off-the-shelf deep learning is not enough: parsimony, Bayes and causality
Rama K Vasudevan
M. Ziatdinov
L. Vlček
Sergei V. Kalinin
BDL
CML
AI4CE
6
0
0
04 May 2020
Differentially Private Federated Learning with Laplacian Smoothing
Differentially Private Federated Learning with Laplacian Smoothing
Zhicong Liang
Bao Wang
Quanquan Gu
Stanley Osher
Yuan Yao
FedML
12
7
0
01 May 2020
Previous
123...181920
Next