Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.00506
Cited By
v1
v2 (latest)
The Hanabi Challenge: A New Frontier for AI Research
Artificial Intelligence (AI), 2019
1 February 2019
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
H. F. Song
Emilio Parisotto
Vincent Dumoulin
Subhodeep Moitra
Edward Hughes
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Hanabi Challenge: A New Frontier for AI Research"
50 / 194 papers shown
Title
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
375
131
0
14 Dec 2022
Credit-cognisant reinforcement learning for multi-agent cooperation
F. Bredell
S. M. I. H. A. Engelbrecht
M. I. J. C. Schoeman
75
0
0
18 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Peng Zhang
Yawen Huang
Bingzhang Hu
Shizheng Wang
Haoran Duan
Noura Al Moubayed
Yefeng Zheng
Yang Long
OffRL
136
1
0
02 Nov 2022
Coordination with Humans via Strategy Matching
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Michelle Zhao
Reid G. Simmons
H. Admoni
188
13
0
27 Oct 2022
Equivariant Networks for Zero-Shot Coordination
Neural Information Processing Systems (NeurIPS), 2022
Darius Muglich
Christian Schroeder de Witt
Elise van der Pol
Shimon Whiteson
Jakob N. Foerster
269
19
0
21 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
194
11
0
11 Oct 2022
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
Siyi Hu
Yifan Zhong
Minquan Gao
Weixun Wang
Hao Dong
Xiaodan Liang
Zhihui Li
Xiaojun Chang
Yaodong Yang
123
27
0
11 Oct 2022
Combining Theory of Mind and Abduction for Cooperation under Imperfect Information
European Workshop on Multi-Agent Systems (EUMAS), 2022
Nieves Montes
Nardine Osman
Carles Sierra
115
5
0
30 Sep 2022
Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess
T. Bertram
Johannes Furnkranz
Martin Müller
SSL
OnRL
243
8
0
03 Aug 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
171
22
0
19 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
Neural Information Processing Systems (NeurIPS), 2022
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRL
LRM
142
40
0
14 Jul 2022
Self-Explaining Deviations for Coordination
Neural Information Processing Systems (NeurIPS), 2022
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
166
2
0
13 Jul 2022
Generalized Beliefs for Cooperative AI
International Conference on Machine Learning (ICML), 2022
Darius Muglich
L. Zintgraf
Christian Schroeder de Witt
Shimon Whiteson
Jakob N. Foerster
181
9
0
26 Jun 2022
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world
Neural Information Processing Systems (NeurIPS), 2022
Eugene Vinitsky
Nathan Lichtlé
Xiaomeng Yang
Brandon Amos
Jakob N. Foerster
OffRL
431
65
0
20 Jun 2022
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Neural Information Processing Systems (NeurIPS), 2022
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
354
30
0
06 Jun 2022
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL
International Conference on Machine Learning (ICML), 2022
Siyi Hu
Chuanlong Xie
Xiaodan Liang
Xiaojun Chang
144
26
0
01 Jun 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
International Conference on Learning Representations (ICLR), 2022
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
188
34
0
04 Apr 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Artificial Intelligence Review (Artif Intell Rev), 2022
Jaan Aru
Aqeel Labash
Oriol Corcoll
Raul Vicente
220
32
0
30 Mar 2022
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi
Bram Grooten
Jelle Wemmenhove
Maurice Poot
J. Portegies
96
4
0
22 Mar 2022
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects
Xihuai Wang
Zhicheng Zhang
Weinan Zhang
238
34
0
20 Mar 2022
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Jaleh Zand
Jack Parker-Holder
Stephen J. Roberts
143
14
0
08 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
234
12
0
01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
227
23
0
23 Feb 2022
Compute Trends Across Three Eras of Machine Learning
IEEE International Joint Conference on Neural Network (IJCNN), 2022
J. Sevilla
Lennart Heim
A. Ho
T. Besiroglu
Marius Hobbhahn
Pablo Villalobos
477
352
0
11 Feb 2022
Learning Intuitive Policies Using Action Features
International Conference on Machine Learning (ICML), 2022
Mingwei Ma
Jizhou Liu
Samuel Sokota
Max Kleiman-Weiner
Jakob N. Foerster
247
4
0
29 Jan 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Keane Lucas
R. Allen
183
33
0
28 Jan 2022
Conditional Imitation Learning for Multi-Agent Games
IEEE/ACM International Conference on Human-Robot Interaction (HRI), 2022
Andy Shih
Stefano Ermon
Dorsa Sadigh
210
14
0
05 Jan 2022
Towards Controllable Agent in MOBA Games with Generative Modeling
Shubao Zhang
125
0
0
15 Dec 2021
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob
David J. Wu
Gabriele Farina
Adam Lerer
Hengyuan Hu
A. Bakhtin
Jacob Andreas
Noam Brown
221
60
0
14 Dec 2021
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Science Advances (Sci Adv), 2021
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
250
28
0
06 Dec 2021
Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates
Nicholas Kantack
114
2
0
18 Nov 2021
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems
Jiayu Chen
Yuanxin Zhang
Yuanfan Xu
Huimin Ma
Huazhong Yang
Jiaming Song
Yu Wang
Yi Wu
VLM
DRL
184
39
0
08 Nov 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Rujikorn Charakorn
P. Manoonpong
Nat Dilokthanakul
159
6
0
05 Nov 2021
Instructive artificial intelligence (AI) for human training, assistance, and explainability
Nicholas Kantack
Nina Cohen
Nathan D. Bos
Corey Lowman
James Everett
Timothy Endres
104
4
0
02 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
219
89
0
15 Oct 2021
Collaborating with Humans without Human Data
D. Strouse
Kevin R. McKee
M. Botvinick
Edward Hughes
Richard Everett
346
196
0
15 Oct 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Arnaud Fickinger
Hengyuan Hu
Brandon Amos
Stuart J. Russell
Noam Brown
225
23
0
30 Sep 2021
Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play
Arkady Arkhangorodsky
Scot Fang
Victoria F. Knight
Ajay Nagesh
Maria Ryskina
Kevin Knight
LLMAG
108
0
0
20 Sep 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
Neural Information Processing Systems (NeurIPS), 2021
H. Siu
Jaime D. Peña
Edenna Chen
Yutai Zhou
Victor J. Lopez
Kyle Palko
K. Chang
R. Allen
183
61
0
15 Jul 2021
Centralized Model and Exploration Policy for Multi-Agent RL
Qizhen Zhang
Chris Xiaoxuan Lu
Animesh Garg
Jakob N. Foerster
152
19
0
14 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations
AI Redefined
S. Gottipati
Sagar Kurandwad
Clodéric Mars
Gregory Szriftgiser
Franccois Chabot
151
9
0
21 Jun 2021
Multi-Agent Curricula and Emergent Implicit Signaling
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Niko A. Grupen
Daniel D. Lee
B. Selman
226
9
0
21 Jun 2021
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Hengyuan Hu
Adam Lerer
Noam Brown
Jakob N. Foerster
243
21
0
16 Jun 2021
On the Critical Role of Conventions in Adaptive Human-AI Collaboration
International Conference on Learning Representations (ICLR), 2021
Andy Shih
Arjun Sawhney
J. Kondic
Stefano Ermon
Dorsa Sadigh
162
43
0
07 Apr 2021
Esports Agents with a Theory of Mind: Towards Better Engagement, Education, and Engineering
Murtuza N. Shergadwala
M. S. El-Nasr
98
7
0
08 Mar 2021
Off-Belief Learning
International Conference on Machine Learning (ICML), 2021
Hengyuan Hu
Adam Lerer
Brandon Cui
David J. Wu
Luis Pineda
Noam Brown
Jakob N. Foerster
OffRL
353
82
0
06 Mar 2021
Continuous Coordination As a Realistic Scenario for Lifelong Learning
International Conference on Machine Learning (ICML), 2021
Hadi Nekoei
Akilesh Badrinaaraayanan
Aaron Courville
Sarath Chandar
CLL
OffRL
151
50
0
04 Mar 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Neural Information Processing Systems (NeurIPS), 2021
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
400
1,765
0
02 Mar 2021
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Yaodong Yang
Jun Luo
Ying Wen
Oliver Slumbers
D. Graves
H. Ammar
Jun Wang
Matthew E. Taylor
136
39
0
15 Feb 2021
Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Pol Moreno
Edward Hughes
Kevin R. McKee
Bernardo Avila-Pires
T. Weber
122
28
0
03 Feb 2021
Previous
1
2
3
4
Next