Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.04000
Cited By
v1
v2
v3
v4
v5 (latest)
Off-Belief Learning
International Conference on Machine Learning (ICML), 2021
6 March 2021
Hengyuan Hu
Adam Lerer
Brandon Cui
David J. Wu
Luis Pineda
Noam Brown
Jakob N. Foerster
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"Off-Belief Learning"
50 / 54 papers shown
High entropy leads to symmetry equivariant policies in Dec-POMDPs
Johannes Forkel
Jakob Foerster
Andreas Bulling
Jakob Foerster
115
1
0
27 Nov 2025
Remembering the Markov Property in Cooperative MARL
Kale-ab Abebe Tessera
Leonard Hinckeldey
Riccardo Zamboni
David Abel
Amos Storkey
232
0
0
24 Jul 2025
CooT: Learning to Coordinate In-Context with Coordination Transformers
Huai-Chih Wang
Hsiang-Chun Chuang
Hsi-Chun Cheng
Dai-Jie Wu
Shao-Hua Sun
OffRL
186
0
0
30 Jun 2025
Efficient Generation of Diverse Cooperative Agents with World Models
Yi Loo
Akshunn Trivedi
Malika Meghjani
144
1
0
09 Jun 2025
ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork
Caroline Wang
Arrasy Rahman
Jiaxun Cui
Yoonchang Sung
Peter Stone
462
4
0
29 May 2025
Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration
Andreas Kontogiannis
Konstantinos Papathanasiou
Yi Shen
Giorgos Stamou
Michael M. Zavlanos
G. Vouros
464
4
0
08 May 2025
Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Stéphane Aroca-Ouellette
Miguel Aroca-Ouellette
Katharina von der Wense
Alessandro Roncone
283
1
0
07 May 2025
AssistanceZero: Scalably Solving Assistance Games
Cassidy Laidlaw
Eli Bronstein
Timothy Guo
Dylan Feng
Lukas Berglund
Justin Svegliato
Stuart J. Russell
Anca Dragan
440
4
0
09 Apr 2025
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
International Conference on Learning Representations (ICLR), 2025
Tobias Gessler
Tin Dizdarevic
Ani Calinescu
Benjamin Ellis
Andrei Lupu
Jakob Foerster
462
11
0
22 Mar 2025
A Generalist Hanabi Agent
International Conference on Learning Representations (ICLR), 2025
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
977
4
0
17 Mar 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
IEEE Transactions on Evolutionary Computation (TEVC), 2022
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
663
25
0
03 Jan 2025
Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games
Usman Anwar
Ashish Pandian
Jia Wan
David M. Krueger
Jakob N. Foerster
426
0
0
07 Nov 2024
Conceptual Belief-Informed Reinforcement Learning
Xingrui Gu
Chuyi Jiang
Chuyi Jiang
OffRL
426
0
0
02 Oct 2024
Strategy Game-Playing with Size-Constrained State Abstraction
Linjie Xu
Diego Perez-Liebana
Alexander Dockhorn
444
1
0
12 Aug 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
745
66
0
05 Jul 2024
LLM-Based Cooperative Agents using Information Relevance and Plan Validation
SeungWon Seo
Junhyeok Lee
SeongRae Noh
HyeongYeop Kang
205
0
0
27 May 2024
Human-compatible driving partners through data-regularized self-play reinforcement learning
Daphne Cornelisse
Eugene Vinitsky
429
13
0
28 Mar 2024
Learning Translations: Emergent Communication Pretraining for Cooperative Language Acquisition
Dylan R. Cope
Peter McBurney
247
1
0
26 Feb 2024
Refining Minimax Regret for Unsupervised Environment Design
Michael Beukman
Samuel Coward
Michael T. Matthews
Mattie Fellows
Minqi Jiang
Michael Dennis
Jakob Foerster
396
15
0
19 Feb 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
411
2
0
15 Feb 2024
Secret Collusion among AI Agents: Multi-Agent Deception via Steganography
Neural Information Processing Systems (NeurIPS), 2024
S. Motwani
Mikhail Baranchuk
Martin Strohmeier
Vijay Bolina
Juil Sock
Lewis Hammond
Christian Schroeder de Witt
1.2K
4
0
12 Feb 2024
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
393
11
0
21 Nov 2023
Cooperative AI via Decentralized Commitment Devices
Xinyuan Sun
Davide Crapis
Matt Stephenson
B. Monnot
Thomas Thiery
Jonathan Passerat-Palmbach
285
13
0
14 Nov 2023
Efficient Human-AI Coordination via Preparatory Language-based Convention
Cong Guan
Lichao Zhang
Chunpeng Fan
Yi-Chen Li
Feng Chen
Lihe Li
Yunjia Tian
Lei Yuan
Yang Yu
LM&Ro
296
10
0
01 Nov 2023
Towards A Natural Language Interface for Flexible Multi-Agent Task Assignment
Jake Brawer
Kayleigh Bishop
Bradley Hayes
Alessandro Roncone
260
5
0
31 Oct 2023
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Saaket Agashe
Yue Fan
Anthony Reyna
Xin Eric Wang
LLMAG
LRM
590
53
0
05 Oct 2023
Stabilizing Unsupervised Environment Design with a Learned Adversary
Ishita Mediratta
Minqi Jiang
Jack Parker-Holder
Michael Dennis
Eugene Vinitsky
Tim Rocktaschel
370
20
0
21 Aug 2023
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Hadi Nekoei
Xutong Zhao
Janarthanan Rajendran
Miao Liu
Sarath Chandar
200
10
0
20 Aug 2023
Adaptive Coordination in Social Embodied Rearrangement
International Conference on Machine Learning (ICML), 2023
Andrew Szot
Unnat Jain
Dhruv Batra
Z. Kira
Ruta Desai
Akshara Rai
273
20
0
31 May 2023
A Hierarchical Approach to Population Training for Human-AI Collaboration
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Yi Loo
Chen Gong
Malika Meghjani
278
9
0
26 May 2023
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Udari Madhushani
Kevin R. McKee
J. Agapiou
Joel Z Leibo
Richard Everett
Thomas W. Anthony
Edward Hughes
K. Tuyls
Edgar A. Duénez-Guzmán
212
4
0
01 May 2023
Language Instructed Reinforcement Learning for Human-AI Coordination
International Conference on Machine Learning (ICML), 2023
Hengyuan Hu
Dorsa Sadigh
LM&Ro
415
85
0
13 Apr 2023
Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Y. Lo
Christian Schroeder de Witt
Samuel Sokota
Jakob N. Foerster
Shimon Whiteson
OffRL
286
6
0
19 Mar 2023
Improving Zero-Shot Coordination Performance Based on Policy Similarity
International Conference on Automated Planning and Scheduling (ICAPS), 2023
Lebin Yu
Yunbo Qiu
Quanming Yao
Xudong Zhang
Jian Wang
314
2
0
10 Feb 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
International Conference on Learning Representations (ICLR), 2023
Chao Yu
Jiaxuan Gao
Weiling Liu
Bo Xu
Hao Tang
Jiaqi Yang
Yu Wang
Yi Wu
338
53
0
03 Feb 2023
Equivariant Networks for Zero-Shot Coordination
Neural Information Processing Systems (NeurIPS), 2022
Darius Muglich
Christian Schroeder de Witt
Elise van der Pol
Shimon Whiteson
Jakob N. Foerster
457
21
0
21 Oct 2022
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
International Conference on Learning Representations (ICLR), 2022
A. Bakhtin
David J. Wu
Adam Lerer
Jonathan Gray
Athul Paul Jacob
Gabriele Farina
Alexander H. Miller
Noam Brown
339
63
0
11 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
338
13
0
11 Oct 2022
Ad Hoc Teamwork in the Presence of Adversaries
Ted Fujimoto
Samrat Chatterjee
A. Ganguly
320
4
0
09 Aug 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
308
27
0
19 Jul 2022
K-level Reasoning for Zero-Shot Coordination in Hanabi
Neural Information Processing Systems (NeurIPS), 2022
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
OffRL
LRM
248
43
0
14 Jul 2022
Self-Explaining Deviations for Coordination
Neural Information Processing Systems (NeurIPS), 2022
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
205
2
0
13 Jul 2022
Grounding Aleatoric Uncertainty for Unsupervised Environment Design
Neural Information Processing Systems (NeurIPS), 2022
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Andrei Lupu
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
423
17
0
11 Jul 2022
Generalized Beliefs for Cooperative AI
International Conference on Machine Learning (ICML), 2022
Darius Muglich
L. Zintgraf
Christian Schroeder de Witt
Shimon Whiteson
Jakob N. Foerster
258
11
0
26 Jun 2022
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games
International Conference on Machine Learning (ICML), 2022
R. Loftin
F. Oliehoek
178
4
0
20 Jun 2022
The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
International Conference on Learning Representations (ICLR), 2022
Cassidy Laidlaw
Anca Dragan
OffRL
223
45
0
22 Apr 2022
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Jaleh Zand
Jack Parker-Holder
Stephen J. Roberts
248
14
0
08 Mar 2022
Learning Intuitive Policies Using Action Features
International Conference on Machine Learning (ICML), 2022
Mingwei Ma
Jizhou Liu
Samuel Sokota
Max Kleiman-Weiner
Jakob N. Foerster
342
4
0
29 Jan 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Keane Lucas
R. Allen
243
36
0
28 Jan 2022
Attention Based Communication and Control for Multi-UAV Path Planning
IEEE Wireless Communications Letters (WCL), 2021
Hamid Shiri
Hyowoon Seo
Jihong Park
M. Bennis
261
21
0
20 Dec 2021
1
2
Next
Page 1 of 2