Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.00506
Cited By
v1
v2 (latest)
The Hanabi Challenge: A New Frontier for AI Research
Artificial Intelligence (AI), 2019
1 February 2019
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
H. F. Song
Emilio Parisotto
Vincent Dumoulin
Subhodeep Moitra
Edward Hughes
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Hanabi Challenge: A New Frontier for AI Research"
50 / 194 papers shown
Title
Entropy is all you need for Inter-Seed Cross-Play in Hanabi
Johannes Forkel
Jakob Foerster
8
0
0
27 Nov 2025
Robust and Diverse Multi-Agent Learning via Rational Policy Gradient
Niklas Lauffer
Ameesh Shah
Micah Carroll
Sanjit A. Seshia
Stuart J. Russell
Michael Dennis
AAML
56
1
0
12 Nov 2025
Multi-Agent Craftax: Benchmarking Open-Ended Multi-Agent Reinforcement Learning at the Hyperscale
Bassel Al Omari
Michael T. Matthews
Alexander Rutherford
Jakob N. Foerster
70
1
0
07 Nov 2025
Monopoly Deal: A Benchmark Environment for Bounded One-Sided Response Games
Will Wolf
84
0
0
29 Oct 2025
HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning
Long H Dang
David Rawlinson
LRM
105
0
0
26 Oct 2025
Knowledge and Common Knowledge of Strategies
Borja Sierra Miranda
Thomas Studer
52
0
0
22 Oct 2025
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning
Anjie Liu
Jianhong Wang
Samuel Kaski
Jun Wang
M. Yang
208
0
0
20 Oct 2025
Evaluating Language Models' Evaluations of Games
Katherine M. Collins
Cedegao E. Zhang
Graham Todd
Lance Ying
Mauricio Barba da Costa
...
Adrian Weller
Ionatan Kuperwajs
L. Wong
Joshua B. Tenenbaum
Thomas L. Griffiths
ReLM
ELM
LRM
116
1
0
13 Oct 2025
AI Agents for the Dhumbal Card Game: A Comparative Study
Sahaj Raj Malla
48
0
0
10 Oct 2025
The Heterogeneous Multi-Agent Challenge
Charles Dansereau
Junior-Samuel Lopez-Yepez
Karthik Soma
Antoine Fagette
76
1
0
23 Sep 2025
K
K
K
-Level Policy Gradients for Multi-Agent Reinforcement Learning
Aryaman Reddi
Gabriele Tiboni
Jan Peters
Carlo DÉramo
92
0
0
15 Sep 2025
The Yokai Learning Environment: Tracking Beliefs Over Space and Time
Constantin Ruhdorfer
Matteo Bortoletto
Andreas Bulling
144
1
0
17 Aug 2025
Evolutionary Optimization of Deep Learning Agents for Sparrow Mahjong
Jim O'Connor
Derin Gezgin
Gary B Parker
41
0
0
11 Aug 2025
Assistax: A Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics
Leonard Hinckeldey
Elliot Fosong
Elle Miller
Rimvydas Rubavicius
Trevor A. McInroe
Patricia Wollstadt
Christiane B. Wiebel-Herboth
Subramanian Ramamoorthy
Stefano V. Albrecht
125
0
0
29 Jul 2025
Remembering the Markov Property in Cooperative MARL
Kale-ab Abebe Tessera
Leonard Hinckeldey
Riccardo Zamboni
David Abel
Amos Storkey
147
0
0
24 Jul 2025
Moving Out: Physically-grounded Human-AI Collaboration
Xuhui Kang
Sung-Wook Lee
Haolin Liu
Yuyan Wang
Yen-Ling Kuo
286
0
0
24 Jul 2025
Biological Pathway Guided Gene Selection Through Collaborative Reinforcement Learning
Ehtesamul Azim
Dongjie Wang
Tae Hyun Hwang
Yanjie Fu
Wei-na Zhang
93
2
0
30 May 2025
Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review
Edward Gu
H. Siu
Melanie Platt
Isabelle Hurley
Jaime D. Peña
Rohan R. Paleja
147
2
0
25 Mar 2025
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
International Conference on Learning Representations (ICLR), 2025
Tobias Gessler
Tin Dizdarevic
Ani Calinescu
Benjamin Ellis
Andrei Lupu
Jakob Foerster
301
4
0
22 Mar 2025
Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems
George Stamatelis
Angelos-Nikolaos Kanatas
G. C. Alexandropoulos
151
1
0
19 Mar 2025
Don't lie to your friends: Learning what you know from collaborative self-play
Jacob Eisenstein
Reza Aghajani
Adam Fisch
Dheeru Dua
Fantine Huot
Mirella Lapata
Vicky Zayats
Jonathan Berant
356
5
0
18 Mar 2025
A Generalist Hanabi Agent
International Conference on Learning Representations (ICLR), 2025
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
889
1
0
17 Mar 2025
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu
Chao Yu
Chao Yu
Huining Yuan
Xiangmin Yi
...
Wenhao Tang
Yu Wang
Wenbo Ding
Xiusi Chen
Yu Wang
954
1
0
04 Feb 2025
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
International Conference on Learning Representations (ICLR), 2024
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
518
65
0
20 Nov 2024
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
Usman Anwar
Ashish Pandian
Jia Wan
David M. Krueger
Jakob N. Foerster
259
0
0
07 Nov 2024
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge
Neural Information Processing Systems (NeurIPS), 2024
Weihua Du
Qiushi Lyu
Jiaming Shan
Zhenting Qi
Hongxin Zhang
...
Andi Peng
Tianmin Shu
Kwonjoon Lee
Behzad Dariush
Chuang Gan
362
9
0
04 Nov 2024
Learning to Coordinate without Communication under Incomplete Information
Shenghui Chen
Shufang Zhu
Giuseppe De Giacomo
Ufuk Topcu
297
0
0
19 Sep 2024
HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit
Yang Li
Dengyu Zhang
Junfan Chen
Ying Wen
Qingrui Zhang
Shaoshuai Mou
Wei Pan
258
1
0
13 Sep 2024
Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors
Jiaqi Liu
Peng Hang
Xiaoxiang Na
Chao Huang
Jian Sun
297
23
0
09 Sep 2024
Learning in Games with Progressive Hiding
Adaptive Agents and Multi-Agent Systems (AAMAS), 2024
Benjamin Heymann
Marc Lanctot
231
0
0
05 Sep 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
396
1
0
05 Sep 2024
In-Context Exploiter for Extensive-Form Games
Shuxin Li
Chang Yang
Youzhi Zhang
Pengdeng Li
Xinrun Wang
Xiao Huang
Hau Chan
Bo An
209
0
0
10 Aug 2024
KnowPC: Knowledge-Driven Programmatic Reinforcement Learning for Zero-shot Coordination
Yin Gu
Qi Liu
Zhi Li
Kai Zhang
155
2
0
08 Aug 2024
LiteEFG: An Efficient Python Library for Solving Extensive-form Games
Mingyang Liu
Gabriele Farina
Asuman Ozdaglar
147
2
0
29 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
516
52
0
05 Jul 2024
Efficacy of Language Model Self-Play in Non-Zero-Sum Games
Austen Liao
Nicholas Tomlin
Dan Klein
229
9
0
27 Jun 2024
The Overcooked Generalisation Challenge: Evaluating Cooperation with Novel Partners in Unknown Environments Using Unsupervised Environment Design
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
358
8
0
25 Jun 2024
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
Haruka Kita
Sotetsu Koyamada
Yotaro Yamaguchi
Shin Ishii
186
1
0
14 Jun 2024
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning
Lin Liu
Jian Zhao
Cheng Hu
Zhengtao Cao
Youpeng Zhao
...
Wenjun Wang
Zhaofeng He
Houqiang Li
Xia Lin
Lanxiao Huang
OffRL
SyDa
192
0
0
06 Jun 2024
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Wenzhe Li
Zihan Ding
Seth Karten
Chi Jin
312
9
0
04 Jun 2024
PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning
Martin Balla
G. E. Long
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
OffRL
GP
126
3
0
28 May 2024
Human-Agent Cooperation in Games under Incomplete Information through Natural Language Communication
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Shenghui Chen
Daniel Fried
Ufuk Topcu
LLMAG
239
3
0
23 May 2024
Efficient Multi-agent Reinforcement Learning by Planning
Qihan Liu
Jianing Ye
Xiaoteng Ma
Jun Yang
Bin Liang
Chongjie Zhang
191
15
0
20 May 2024
Configurable Mirror Descent: Towards a Unification of Decision Making
Pengdeng Li
Shuxin Li
Chang Yang
Xinrun Wang
Shuyue Hu
Yi-Ju Chang
Hau Chan
Bo An
246
1
0
20 May 2024
A Design Trajectory Map of Human-AI Collaborative Reinforcement Learning Systems: Survey and Taxonomy
Zhaoxing Li
156
2
0
16 May 2024
Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph Task
IEEE International Conference on Systems, Man and Cybernetics (SMC), 2024
Shurong Wang
Yufei Zhang
Xuliang Huang
Hongwei Wang
125
0
0
15 May 2024
Designing Skill-Compatible AI: Methodologies and Frameworks in Chess
International Conference on Learning Representations (ICLR), 2024
Karim Hamade
Reid McIlroy-Young
Siddhartha Sen
Jon M. Kleinberg
Ashton Anderson
176
10
0
08 May 2024
Imitation Learning: A Survey of Learning Methods, Environments and Metrics
Nathan Gavenski
Odinaldo Rodrigues
Michael Luck
255
131
0
30 Apr 2024
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation
Hongxin Zhang
Zeyuan Wang
Qiushi Lyu
Zheyuan Zhang
Sunli Chen
Tianmin Shu
Yilun Du
Kwonjoon Lee
Yilun Du
Chuang Gan
373
33
0
16 Apr 2024
Laser Learning Environment: A new environment for coordination-critical multi-agent tasks
Yannick Molinghen
Raphael Avalos
Mark Van Achter
A. Nowé
Tom Lenaerts
172
1
0
04 Apr 2024
1
2
3
4
Next