Title
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022 Benjamin Ellis Jonathan Cook S. Moalla Mikayel Samvelyan Mingfei Sun Anuj Mahajan Jakob N. Foerster Shimon Whiteson 375 131 0 14 Dec 2022
Credit-cognisant reinforcement learning for multi-agent cooperation F. Bredell S. M. I. H. A. Engelbrecht M. I. J. C. Schoeman 75 0 0 18 Nov 2022
Knowing the Past to Predict the Future: Reinforcement Virtual Learning Peng Zhang Yawen Huang Bingzhang Hu Shizheng Wang Haoran Duan Noura Al Moubayed Yefeng Zheng Yang Long OffRL 136 1 0 02 Nov 2022
Coordination with Humans via Strategy MatchingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022 Michelle Zhao Reid G. Simmons H. Admoni 188 13 0 27 Oct 2022
Equivariant Networks for Zero-Shot CoordinationNeural Information Processing Systems (NeurIPS), 2022 Darius Muglich Christian Schroeder de Witt Elise van der Pol Shimon Whiteson Jakob N. Foerster 269 19 0 21 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning Hengyuan Hu David J. Wu Adam Lerer Jakob N. Foerster Noam Brown 194 11 0 11 Oct 2022
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library Siyi Hu Yifan Zhong Minquan Gao Weixun Wang Hao Dong Xiaodan Liang Zhihui Li Xiaojun Chang Yaodong Yang 123 27 0 11 Oct 2022
Combining Theory of Mind and Abduction for Cooperation under Imperfect InformationEuropean Workshop on Multi-Agent Systems (EUMAS), 2022 Nieves Montes Nardine Osman Carles Sierra 115 5 0 30 Sep 2022
Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess T. Bertram Johannes Furnkranz Martin Müller SSL OnRL 243 8 0 03 Aug 2022
Mimetic Models: Ethical Implications of AI that Acts Like YouAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022 Reid McIlroy-Young Jon M. Kleinberg S. Sen Solon Barocas Ashton Anderson 171 22 0 19 Jul 2022
K-level Reasoning for Zero-Shot Coordination in HanabiNeural Information Processing Systems (NeurIPS), 2022 Brandon Cui Hengyuan Hu Luis Pineda Jakob N. Foerster OffRL LRM 142 40 0 14 Jul 2022
Self-Explaining Deviations for CoordinationNeural Information Processing Systems (NeurIPS), 2022 Hengyuan Hu Samuel Sokota David J. Wu A. Bakhtin Andrei Lupu Brandon Cui Jakob N. Foerster 166 2 0 13 Jul 2022
Generalized Beliefs for Cooperative AIInternational Conference on Machine Learning (ICML), 2022 Darius Muglich L. Zintgraf Christian Schroeder de Witt Shimon Whiteson Jakob N. Foerster 181 9 0 26 Jun 2022
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real worldNeural Information Processing Systems (NeurIPS), 2022 Eugene Vinitsky Nathan Lichtlé Xiaomeng Yang Brandon Amos Jakob N. Foerster OffRL 431 65 0 20 Jun 2022
Policy Optimization for Markov Games: Unified Framework and Faster ConvergenceNeural Information Processing Systems (NeurIPS), 2022 Runyu Zhang Qinghua Liu Haiquan Wang Caiming Xiong Na Li Yu Bai 354 30 0 06 Jun 2022
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RLInternational Conference on Machine Learning (ICML), 2022 Siyi Hu Chuanlong Xie Xiaodan Liang Xiaojun Chang 144 26 0 01 Jun 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy OptimizationInternational Conference on Learning Representations (ICLR), 2022 Zihan Zhou Wei Fu Bingliang Zhang Yi Wu 188 34 0 04 Apr 2022
Mind the gap: Challenges of deep learning approaches to Theory of MindArtificial Intelligence Review (Artif Intell Rev), 2022 Jaan Aru Aqeel Labash Oriol Corcoll Raul Vicente 220 32 0 30 Mar 2022
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi Bram Grooten Jelle Wemmenhove Maurice Poot J. Portegies 96 4 0 22 Mar 2022
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects Xihuai Wang Zhicheng Zhang Weinan Zhang 238 34 0 20 Mar 2022
On-the-fly Strategy Adaptation for ad-hoc Agent CoordinationAdaptive Agents and Multi-Agent Systems (AAMAS), 2022 Jaleh Zand Jack Parker-Holder Stephen J. Roberts 143 14 0 08 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data Cultural General Intelligence Team Avishkar Bhoopchand Bethanie Brownfield Adrian Collister Agustin Dal Lago ... Alex Platonov Evan Senter Sukhdeep Singh Alexander Zacherl Lei M. Zhang VLM 234 12 0 01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 227 23 0 23 Feb 2022
Compute Trends Across Three Eras of Machine LearningIEEE International Joint Conference on Neural Network (IJCNN), 2022 J. Sevilla Lennart Heim A. Ho T. Besiroglu Marius Hobbhahn Pablo Villalobos 477 352 0 11 Feb 2022
Learning Intuitive Policies Using Action FeaturesInternational Conference on Machine Learning (ICML), 2022 Mingwei Ma Jizhou Liu Samuel Sokota Max Kleiman-Weiner Jakob N. Foerster 247 4 0 29 Jan 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot CoordinationAdaptive Agents and Multi-Agent Systems (AAMAS), 2022 Keane Lucas R. Allen 183 33 0 28 Jan 2022
Conditional Imitation Learning for Multi-Agent GamesIEEE/ACM International Conference on Human-Robot Interaction (HRI), 2022 Andy Shih Stefano Ermon Dorsa Sadigh 210 14 0 05 Jan 2022
Towards Controllable Agent in MOBA Games with Generative Modeling Shubao Zhang 125 0 0 15 Dec 2021
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob David J. Wu Gabriele Farina Adam Lerer Hengyuan Hu A. Bakhtin Jacob Andreas Noam Brown 221 60 0 14 Dec 2021
Student of Games: A unified learning algorithm for both perfect and imperfect information gamesScience Advances (Sci Adv), 2021 Martin Schmid Matej Moravcík Neil Burch Rudolf Kadlec Josh Davidson ... Marc Lanctot G. Z. Holland Elnaz Davoodi Alden Christianson Michael Bowling 250 28 0 06 Dec 2021
Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates Nicholas Kantack 114 2 0 18 Nov 2021
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems Jiayu Chen Yuanxin Zhang Yuanfan Xu Huimin Ma Huazhong Yang Jiaming Song Yu Wang Yi Wu VLM DRL 184 39 0 08 Nov 2021
Learning to Cooperate with Unseen Agent via Meta-Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2021 Rujikorn Charakorn P. Manoonpong Nat Dilokthanakul 159 6 0 05 Nov 2021
Instructive artificial intelligence (AI) for human training, assistance, and explainability Nicholas Kantack Nina Cohen Nathan D. Bos Corey Lowman James Everett Timothy Endres 104 4 0 02 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind Yuan-Fang Wang Fangwei Zhong Jing Xu Yizhou Wang LLMAG 219 89 0 15 Oct 2021
Collaborating with Humans without Human Data D. Strouse Kevin R. McKee M. Botvinick Edward Hughes Richard Everett 346 196 0 15 Oct 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning Arnaud Fickinger Hengyuan Hu Brandon Amos Stuart J. Russell Noam Brown 225 23 0 30 Sep 2021
Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play Arkady Arkhangorodsky Scot Fang Victoria F. Knight Ajay Nagesh Maria Ryskina Kevin Knight LLMAG 108 0 0 20 Sep 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in HanabiNeural Information Processing Systems (NeurIPS), 2021 H. Siu Jaime D. Peña Edenna Chen Yutai Zhou Victor J. Lopez Kyle Palko K. Chang R. Allen 183 61 0 15 Jul 2021
Centralized Model and Exploration Policy for Multi-Agent RL Qizhen Zhang Chris Xiaoxuan Lu Animesh Garg Jakob N. Foerster 152 19 0 14 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations AI Redefined S. Gottipati Sagar Kurandwad Clodéric Mars Gregory Szriftgiser Franccois Chabot 151 9 0 21 Jun 2021
Multi-Agent Curricula and Emergent Implicit SignalingAdaptive Agents and Multi-Agent Systems (AAMAS), 2021 Niko A. Grupen Daniel D. Lee B. Selman 226 9 0 21 Jun 2021
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings Hengyuan Hu Adam Lerer Noam Brown Jakob N. Foerster 243 21 0 16 Jun 2021
On the Critical Role of Conventions in Adaptive Human-AI CollaborationInternational Conference on Learning Representations (ICLR), 2021 Andy Shih Arjun Sawhney J. Kondic Stefano Ermon Dorsa Sadigh 162 43 0 07 Apr 2021
Esports Agents with a Theory of Mind: Towards Better Engagement, Education, and Engineering Murtuza N. Shergadwala M. S. El-Nasr 98 7 0 08 Mar 2021
Off-Belief LearningInternational Conference on Machine Learning (ICML), 2021 Hengyuan Hu Adam Lerer Brandon Cui David J. Wu Luis Pineda Noam Brown Jakob N. Foerster OffRL 353 82 0 06 Mar 2021
Continuous Coordination As a Realistic Scenario for Lifelong LearningInternational Conference on Machine Learning (ICML), 2021 Hadi Nekoei Akilesh Badrinaaraayanan Aaron Courville Sarath Chandar CLL OffRL 151 50 0 04 Mar 2021
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent GamesNeural Information Processing Systems (NeurIPS), 2021 Chao Yu Akash Velu Eugene Vinitsky Jiaxuan Gao Yu Wang Alexandre M. Bayen Yi Wu OffRL 400 1,765 0 02 Mar 2021
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning SystemsAdaptive Agents and Multi-Agent Systems (AAMAS), 2021 Yaodong Yang Jun Luo Ying Wen Oliver Slumbers D. Graves H. Ammar Jun Wang Matthew E. Taylor 136 39 0 15 Feb 2021
Neural Recursive Belief States in Multi-Agent Reinforcement Learning Pol Moreno Edward Hughes Kevin R. McKee Bernardo Avila-Pires T. Weber 122 28 0 03 Feb 2021