Title
Entropy is all you need for Inter-Seed Cross-Play in Hanabi Johannes Forkel Jakob Foerster 8 0 0 27 Nov 2025
Robust and Diverse Multi-Agent Learning via Rational Policy Gradient Niklas Lauffer Ameesh Shah Micah Carroll Sanjit A. Seshia Stuart J. Russell Michael Dennis AAML 56 1 0 12 Nov 2025
Multi-Agent Craftax: Benchmarking Open-Ended Multi-Agent Reinforcement Learning at the Hyperscale Bassel Al Omari Michael T. Matthews Alexander Rutherford Jakob N. Foerster 70 1 0 07 Nov 2025
Monopoly Deal: A Benchmark Environment for Bounded One-Sided Response Games Will Wolf 84 0 0 29 Oct 2025
HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning Long H Dang David Rawlinson LRM 105 0 0 26 Oct 2025
Knowledge and Common Knowledge of Strategies Borja Sierra Miranda Thomas Studer 52 0 0 22 Oct 2025
A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning Anjie Liu Jianhong Wang Samuel Kaski Jun Wang M. Yang 208 0 0 20 Oct 2025
Evaluating Language Models' Evaluations of Games Katherine M. Collins Cedegao E. Zhang Graham Todd Lance Ying Mauricio Barba da Costa ... Adrian Weller Ionatan Kuperwajs L. Wong Joshua B. Tenenbaum Thomas L. Griffiths ReLM ELM LRM 116 1 0 13 Oct 2025
AI Agents for the Dhumbal Card Game: A Comparative Study Sahaj Raj Malla 48 0 0 10 Oct 2025
The Heterogeneous Multi-Agent Challenge Charles Dansereau Junior-Samuel Lopez-Yepez Karthik Soma Antoine Fagette 76 1 0 23 Sep 2025
$K$ -Level Policy Gradients for Multi-Agent Reinforcement Learning Aryaman Reddi Gabriele Tiboni Jan Peters Carlo DÉramo 92 0 0 15 Sep 2025
The Yokai Learning Environment: Tracking Beliefs Over Space and Time Constantin Ruhdorfer Matteo Bortoletto Andreas Bulling 144 1 0 17 Aug 2025
Evolutionary Optimization of Deep Learning Agents for Sparrow Mahjong Jim O'Connor Derin Gezgin Gary B Parker 41 0 0 11 Aug 2025
Assistax: A Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics Leonard Hinckeldey Elliot Fosong Elle Miller Rimvydas Rubavicius Trevor A. McInroe Patricia Wollstadt Christiane B. Wiebel-Herboth Subramanian Ramamoorthy Stefano V. Albrecht 125 0 0 29 Jul 2025
Remembering the Markov Property in Cooperative MARL Kale-ab Abebe Tessera Leonard Hinckeldey Riccardo Zamboni David Abel Amos Storkey 147 0 0 24 Jul 2025
Moving Out: Physically-grounded Human-AI Collaboration Xuhui Kang Sung-Wook Lee Haolin Liu Yuyan Wang Yen-Ling Kuo 286 0 0 24 Jul 2025
Biological Pathway Guided Gene Selection Through Collaborative Reinforcement Learning Ehtesamul Azim Dongjie Wang Tae Hyun Hwang Yanjie Fu Wei-na Zhang 93 2 0 30 May 2025
Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review Edward Gu H. Siu Melanie Platt Isabelle Hurley Jaime D. Peña Rohan R. Paleja 147 2 0 25 Mar 2025
OvercookedV2: Rethinking Overcooked for Zero-Shot CoordinationInternational Conference on Learning Representations (ICLR), 2025 Tobias Gessler Tin Dizdarevic Ani Calinescu Benjamin Ellis Andrei Lupu Jakob Foerster 301 4 0 22 Mar 2025
Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems George Stamatelis Angelos-Nikolaos Kanatas G. C. Alexandropoulos 151 1 0 19 Mar 2025
Don't lie to your friends: Learning what you know from collaborative self-play Jacob Eisenstein Reza Aghajani Adam Fisch Dheeru Dua Fantine Huot Mirella Lapata Vicky Zayats Jonathan Berant 356 5 0 18 Mar 2025
A Generalist Hanabi AgentInternational Conference on Learning Representations (ICLR), 2025 Arjun Vaithilingam Sudhakar Hadi Nekoei Mathieu Reymond Miao Liu Janarthanan Rajendran Sarath Chandar 889 1 0 17 Mar 2025
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play Zelai Xu Chao Yu Chao Yu Huining Yuan Xiangmin Yi ... Wenhao Tang Yu Wang Wenbo Ding Xiusi Chen Yu Wang 954 1 0 04 Feb 2025
BALROG: Benchmarking Agentic LLM and VLM Reasoning On GamesInternational Conference on Learning Representations (ICLR), 2024 Davide Paglieri Bartłomiej Cupiał Samuel Coward Ulyana Piterbarg Maciej Wolczyk ... Lerrel Pinto Rob Fergus Jakob Foerster Jack Parker-Holder Tim Rocktaschel LLMAG LRM 518 65 0 20 Nov 2024
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games Usman Anwar Ashish Pandian Jia Wan David M. Krueger Jakob N. Foerster 259 0 0 07 Nov 2024
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence ChallengeNeural Information Processing Systems (NeurIPS), 2024 Weihua Du Qiushi Lyu Jiaming Shan Zhenting Qi Hongxin Zhang ... Andi Peng Tianmin Shu Kwonjoon Lee Behzad Dariush Chuang Gan 362 9 0 04 Nov 2024
Learning to Coordinate without Communication under Incomplete Information Shenghui Chen Shufang Zhu Giuseppe De Giacomo Ufuk Topcu 297 0 0 19 Sep 2024
HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit Yang Li Dengyu Zhang Junfan Chen Ying Wen Qingrui Zhang Shaoshuai Mou Wei Pan 258 1 0 13 Sep 2024
Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors Jiaqi Liu Peng Hang Xiaoxiang Na Chao Huang Jian Sun 297 23 0 09 Sep 2024
Learning in Games with Progressive HidingAdaptive Agents and Multi-Agent Systems (AAMAS), 2024 Benjamin Heymann Marc Lanctot 231 0 0 05 Sep 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models Qi Ju Falin Hei Zhemei Fang Yunfeng Luo 396 1 0 05 Sep 2024
In-Context Exploiter for Extensive-Form Games Shuxin Li Chang Yang Youzhi Zhang Pengdeng Li Xinrun Wang Xiao Huang Hau Chan Bo An 209 0 0 10 Aug 2024
KnowPC: Knowledge-Driven Programmatic Reinforcement Learning for Zero-shot Coordination Yin Gu Qi Liu Zhi Li Kai Zhang 155 2 0 08 Aug 2024
LiteEFG: An Efficient Python Library for Solving Extensive-form Games Mingyang Liu Gabriele Farina Asuman Ozdaglar 147 2 0 29 Jul 2024
Simplifying Deep Temporal Difference Learning Matteo Gallici Mattie Fellows Benjamin Ellis B. Pou Ivan Masmitja Jakob Foerster Mario Martin OffRL 516 52 0 05 Jul 2024
Efficacy of Language Model Self-Play in Non-Zero-Sum Games Austen Liao Nicholas Tomlin Dan Klein 229 9 0 27 Jun 2024
The Overcooked Generalisation Challenge: Evaluating Cooperation with Novel Partners in Unknown Environments Using Unsupervised Environment Design Constantin Ruhdorfer Matteo Bortoletto Anna Penzkofer Andreas Bulling 358 8 0 25 Jun 2024
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI Haruka Kita Sotetsu Koyamada Yotaro Yamaguchi Shin Ishii 186 1 0 14 Jun 2024
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning Lin Liu Jian Zhao Cheng Hu Zhengtao Cao Youpeng Zhao ... Wenjun Wang Zhaofeng He Houqiang Li Xia Lin Lanxiao Huang OffRL SyDa 192 0 0 06 Jun 2024
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning Wenzhe Li Zihan Ding Seth Karten Chi Jin 312 9 0 04 Jun 2024
PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning Martin Balla G. E. Long J. Goodman Raluca D. Gaina Diego Perez-Liebana OffRL GP 126 3 0 28 May 2024
Human-Agent Cooperation in Games under Incomplete Information through Natural Language CommunicationInternational Joint Conference on Artificial Intelligence (IJCAI), 2024 Shenghui Chen Daniel Fried Ufuk Topcu LLMAG 239 3 0 23 May 2024
Efficient Multi-agent Reinforcement Learning by Planning Qihan Liu Jianing Ye Xiaoteng Ma Jun Yang Bin Liang Chongjie Zhang 191 15 0 20 May 2024
Configurable Mirror Descent: Towards a Unification of Decision Making Pengdeng Li Shuxin Li Chang Yang Xinrun Wang Shuyue Hu Yi-Ju Chang Hau Chan Bo An 246 1 0 20 May 2024
A Design Trajectory Map of Human-AI Collaborative Reinforcement Learning Systems: Survey and Taxonomy Zhaoxing Li 156 2 0 16 May 2024
Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph TaskIEEE International Conference on Systems, Man and Cybernetics (SMC), 2024 Shurong Wang Yufei Zhang Xuliang Huang Hongwei Wang 125 0 0 15 May 2024
Designing Skill-Compatible AI: Methodologies and Frameworks in ChessInternational Conference on Learning Representations (ICLR), 2024 Karim Hamade Reid McIlroy-Young Siddhartha Sen Jon M. Kleinberg Ashton Anderson 176 10 0 08 May 2024
Imitation Learning: A Survey of Learning Methods, Environments and Metrics Nathan Gavenski Odinaldo Rodrigues Michael Luck 255 131 0 30 Apr 2024
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation Hongxin Zhang Zeyuan Wang Qiushi Lyu Zheyuan Zhang Sunli Chen Tianmin Shu Yilun Du Kwonjoon Lee Yilun Du Chuang Gan 373 33 0 16 Apr 2024
Laser Learning Environment: A new environment for coordination-critical multi-agent tasks Yannick Molinghen Raphael Avalos Mark Van Achter A. Nowé Tom Lenaerts 172 1 0 04 Apr 2024