ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01281
  4. Cited By
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

3 July 2018
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
Antonio García Castañeda
Charlie Beattie
Neil C. Rabinowitz
Ari S. Morcos
Avraham Ruderman
Nicolas Sonnerat
Tim Green
Louise Deason
Joel Z. Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
    OffRL
ArXivPDFHTML

Papers citing "Human-level performance in first-person multiplayer games with population-based deep reinforcement learning"

50 / 115 papers shown
Title
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
40
11
0
01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
34
9
0
23 Feb 2022
Bayesian sense of time in biological and artificial brains
Bayesian sense of time in biological and artificial brains
Z. Fountas
Alexey Zakharov
29
0
0
14 Jan 2022
Direct Mutation and Crossover in Genetic Algorithms Applied to
  Reinforcement Learning Tasks
Direct Mutation and Crossover in Genetic Algorithms Applied to Reinforcement Learning Tasks
Tarek Faycal
Claudio Zito
9
2
0
13 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
30
100
0
11 Jan 2022
The Partially Observable Asynchronous Multi-Agent Cooperation Challenge
The Partially Observable Asynchronous Multi-Agent Cooperation Challenge
Meng Yao
Qiyue Yin
Jun Yang
Tongtong Yu
S. Shen
Junge Zhang
Bin Liang
Kaiqi Huang
19
5
0
07 Dec 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Rujikorn Charakorn
P. Manoonpong
Nat Dilokthanakul
25
5
0
05 Nov 2021
Collaborating with Humans without Human Data
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
122
160
0
15 Oct 2021
Pick Your Battles: Interaction Graphs as Population-Level Objectives for
  Strategic Diversity
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity
M. Garnelo
Wojciech M. Czarnecki
Siqi Liu
Dhruva Tirumala
Junhyuk Oh
Gauthier Gidel
H. V. Hasselt
David Balduzzi
24
25
0
08 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
54
54
0
28 Sep 2021
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via
  Convex Relaxation
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Chuangchuang Sun
Dong-Ki Kim
Jonathan P. How
AAML
31
18
0
14 Sep 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
26
181
0
27 Jul 2021
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
73
28
0
13 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Continuous Control with Deep Reinforcement Learning for Autonomous
  Vessels
Continuous Control with Deep Reinforcement Learning for Autonomous Vessels
Nader Zare
Bruno Brandoli
Mahtab Sarvmaili
Amílcar Soares
Stan Matwin
11
8
0
27 Jun 2021
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body
  Simulation
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation
C. Freeman
Erik Frey
Anton Raichuk
Sertan Girgin
Igor Mordatch
Olivier Bachem
11
348
0
24 Jun 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium
  Meta-Solvers
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
35
36
0
17 Jun 2021
Counter-Strike Deathmatch with Large-Scale Behavioural Cloning
Counter-Strike Deathmatch with Large-Scale Behavioural Cloning
Tim Pearce
Jun Zhu
25
43
0
09 Apr 2021
Flatland Competition 2020: MAPF and MARL for Efficient Train
  Coordination on a Grid World
Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Florian Laurent
Manuel Schneider
Christian Scheller
J. Watson
Jiaoyang Li
...
Nilabha Bhattacharya
Shivam Agarwal
A. Egli
Erik Nygren
Sharada Mohanty
31
28
0
30 Mar 2021
Modelling Behavioural Diversity for Learning in Open-Ended Games
Modelling Behavioural Diversity for Learning in Open-Ended Games
Nicolas Perez Nieves
Yaodong Yang
Oliver Slumbers
D. Mguni
Ying Wen
Jun Wang
17
67
0
14 Mar 2021
Esports Agents with a Theory of Mind: Towards Better Engagement,
  Education, and Engineering
Esports Agents with a Theory of Mind: Towards Better Engagement, Education, and Engineering
Murtuza N. Shergadwala
M. S. El-Nasr
9
7
0
08 Mar 2021
Credit Assignment with Meta-Policy Gradient for Multi-Agent
  Reinforcement Learning
Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning
Jianzhun Shao
Hongchang Zhang
Yuhang Jiang
Shuncheng He
Xiangyang Ji
24
5
0
24 Feb 2021
Training Learned Optimizers with Randomly Initialized Learned Optimizers
Training Learned Optimizers with Randomly Initialized Learned Optimizers
Luke Metz
C. Freeman
Niru Maheswaranathan
Jascha Narain Sohl-Dickstein
41
12
0
14 Jan 2021
Adaptive Synthetic Characters for Military Training
Adaptive Synthetic Characters for Military Training
Volkan Ustun
Rajay Kumar
Adam Reilly
Seyed Sajjadi
Andrew Miller
AI4CE
13
9
0
06 Jan 2021
Which Heroes to Pick? Learning to Draft in MOBA Games with Neural
  Networks and Tree Search
Which Heroes to Pick? Learning to Draft in MOBA Games with Neural Networks and Tree Search
Sheng Chen
Menghui Zhu
Deheng Ye
Weinan Zhang
Qiang Fu
Wei Yang
19
29
0
18 Dec 2020
Open Problems in Cooperative AI
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z. Leibo
Kate Larson
T. Graepel
24
199
0
15 Dec 2020
Robust Multi-Agent Reinforcement Learning with Social Empowerment for
  Coordination and Communication
Robust Multi-Agent Reinforcement Learning with Social Empowerment for Coordination and Communication
T. V. D. Heiden
Christoph Salge
E. Gavves
H. V. Hoof
11
9
0
15 Dec 2020
Applied Machine Learning for Games: A Graduate School Course
Applied Machine Learning for Games: A Graduate School Course
Yilei Zeng
Aayush Shah
Jameson Thai
M. Zyda
AI4CE
9
3
0
30 Nov 2020
TLeague: A Framework for Competitive Self-Play based Distributed
  Multi-Agent Reinforcement Learning
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Peng Sun
Jiechao Xiong
Lei Han
Xinghai Sun
Shuxing Li
Jiawei Xu
Meng Fang
Zhengyou Zhang
OffRL
LRM
25
19
0
25 Nov 2020
Supervised Learning Achieves Human-Level Performance in MOBA Games: A
  Case Study of Honor of Kings
Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings
Deheng Ye
Guibin Chen
P. Zhao
Fuhao Qiu
Bo Yuan
...
Liang Wang
Tengfei Shi
Qiang Fu
Wei Yang
Lanxiao Huang
26
48
0
25 Nov 2020
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture
  using Feedback-Modulated Delay Loops
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture using Feedback-Modulated Delay Loops
Florian Stelzer
André Röhm
Raul Vicente
Ingo Fischer
University of Tartu
AI4CE
9
46
0
19 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social
  Preferences
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Bowen Baker
LRM
13
33
0
10 Nov 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
4
173
0
05 Nov 2020
Meta-trained agents implement Bayes-optimal agents
Meta-trained agents implement Bayes-optimal agents
Vladimir Mikulik
Grégoire Delétang
Tom McGrath
Tim Genewein
Miljan Martic
Shane Legg
Pedro A. Ortega
OOD
FedML
27
40
0
21 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
16
202
0
04 Oct 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
19
9
0
29 Sep 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
25
174
0
24 Jul 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke
Joshua Achiam
Pieter Abbeel
14
286
0
08 Jul 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
11
42
0
08 Jun 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax
  Policies
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies
Stephan Zheng
Alexander R. Trott
Sunil Srinivasa
Nikhil Naik
Melvin Gruesbeck
David C. Parkes
R. Socher
23
131
0
28 Apr 2020
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
38
1,927
0
11 Apr 2020
How Do You Act? An Empirical Study to Understand Behavior of Deep
  Reinforcement Learning Agents
How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents
Richard Meyes
Moritz Schneider
Tobias Meisen
18
2
0
07 Apr 2020
Fiber: A Platform for Efficient Development and Distributed Training for
  Reinforcement Learning and Population-Based Methods
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
Jiale Zhi
Rui Wang
Jeff Clune
Kenneth O. Stanley
OffRL
12
12
0
25 Mar 2020
Decentralized MCTS via Learned Teammate Models
Decentralized MCTS via Learned Teammate Models
A. Czechowski
F. Oliehoek
165
19
0
19 Mar 2020
FormulaZero: Distributionally Robust Online Adaptation via Offline
  Population Synthesis
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis
Aman Sinha
Matthew O'Kelly
Hongrui Zheng
Rahul Mangharam
John C. Duchi
Russ Tedrake
OffRL
66
26
0
09 Mar 2020
Computer-inspired Quantum Experiments
Computer-inspired Quantum Experiments
Mario Krenn
Manuel Erhard
A. Zeilinger
11
73
0
23 Feb 2020
Social diversity and social preferences in mixed-motive reinforcement
  learning
Social diversity and social preferences in mixed-motive reinforcement learning
Kevin R. McKee
I. Gemp
Brian McWilliams
Edgar A. Duénez-Guzmán
Edward Hughes
Joel Z. Leibo
12
80
0
06 Feb 2020
Variational Recurrent Models for Solving Partially Observable Control
  Tasks
Variational Recurrent Models for Solving Partially Observable Control Tasks
Dongqi Han
Kenji Doya
Jun Tani
DRL
OffRL
10
59
0
23 Dec 2019
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse
  Rewards
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards
Xingyu Lu
Stas Tiomkin
Pieter Abbeel
OffRL
25
3
0
21 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
26
1,789
0
13 Dec 2019
Previous
123
Next