v1v2v3v4v5 (latest)

Off-Belief Learning

International Conference on Machine Learning (ICML), 2021

6 March 2021

ArXiv (abs)PDF HTML Github

Papers citing "Off-Belief Learning"

50 / 54 papers shown

High entropy leads to symmetry equivariant policies in Dec-POMDPs

Johannes Forkel

Jakob Foerster

Andreas Bulling

Jakob Foerster

115

27 Nov 2025

Remembering the Markov Property in Cooperative MARL

Kale-ab Abebe Tessera

232

24 Jul 2025

CooT: Learning to Coordinate In-Context with Coordination Transformers

186

30 Jun 2025

Efficient Generation of Diverse Cooperative Agents with World Models

Yi Loo

Akshunn Trivedi

Malika Meghjani

144

09 Jun 2025

ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork

462

29 May 2025

Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Andreas Kontogiannis

Konstantinos Papathanasiou

464

08 May 2025

Implicitly Aligning Humans and Autonomous Agents through Shared Task AbstractionsInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

Stéphane Aroca-Ouellette

Miguel Aroca-Ouellette

Katharina von der Wense

Alessandro Roncone

283

07 May 2025

AssistanceZero: Scalably Solving Assistance Games

440

09 Apr 2025

OvercookedV2: Rethinking Overcooked for Zero-Shot CoordinationInternational Conference on Learning Representations (ICLR), 2025

462

22 Mar 2025

A Generalist Hanabi AgentInternational Conference on Learning Representations (ICLR), 2025

Arjun Vaithilingam Sudhakar

Hadi Nekoei

Mathieu Reymond

Miao Liu

Janarthanan Rajendran

Sarath Chandar

977

17 Mar 2025

Heterogeneous Multi-agent Zero-Shot Coordination by CoevolutionIEEE Transactions on Evolutionary Computation (TEVC), 2022

663

03 Jan 2025

Noisy Zero-Shot Coordination: Breaking The Common Knowledge Assumption In Zero-Shot Coordination Games

426

07 Nov 2024

Conceptual Belief-Informed Reinforcement Learning

Xingrui Gu

Chuyi Jiang

OffRL

426

02 Oct 2024

Strategy Game-Playing with Size-Constrained State Abstraction

Linjie Xu

Diego Perez-Liebana

Alexander Dockhorn

444

12 Aug 2024

Simplifying Deep Temporal Difference Learning

745

05 Jul 2024

LLM-Based Cooperative Agents using Information Relevance and Plan Validation

205

27 May 2024

Human-compatible driving partners through data-regularized self-play reinforcement learning

Daphne Cornelisse

Eugene Vinitsky

429

28 Mar 2024

Learning Translations: Emergent Communication Pretraining for Cooperative Language Acquisition

Dylan R. Cope

Peter McBurney

247

26 Feb 2024

Refining Minimax Regret for Unsupervised Environment Design

396

19 Feb 2024

Symmetry-Breaking Augmentations for Ad Hoc Teamwork

411

15 Feb 2024

Secret Collusion among AI Agents: Multi-Agent Deception via SteganographyNeural Information Processing Systems (NeurIPS), 2024

Christian Schroeder de Witt

1.2K

12 Feb 2024

minimax: Efficient Baselines for Autocurricula in JAX

393

21 Nov 2023

Cooperative AI via Decentralized Commitment Devices

Jonathan Passerat-Palmbach

285

14 Nov 2023

Efficient Human-AI Coordination via Preparatory Language-based Convention

296

01 Nov 2023

Towards A Natural Language Interface for Flexible Multi-Agent Task Assignment

260

31 Oct 2023

LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

590

05 Oct 2023

Stabilizing Unsupervised Environment Design with a Learned Adversary

370

21 Aug 2023

Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi

Hadi Nekoei

Xutong Zhao

Janarthanan Rajendran

Miao Liu

Sarath Chandar

200

20 Aug 2023

Adaptive Coordination in Social Embodied RearrangementInternational Conference on Machine Learning (ICML), 2023

Ruta Desai

273

31 May 2023

A Hierarchical Approach to Population Training for Human-AI CollaborationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

Yi Loo

Chen Gong

Malika Meghjani

278

26 May 2023

Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas

Edgar A. Duénez-Guzmán

212

01 May 2023

Language Instructed Reinforcement Learning for Human-AI CoordinationInternational Conference on Machine Learning (ICML), 2023

Hengyuan Hu

Dorsa Sadigh

LM&Ro

415

13 Apr 2023

Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

Y. Lo

Christian Schroeder de Witt

286

19 Mar 2023

Improving Zero-Shot Coordination Performance Based on Policy SimilarityInternational Conference on Automated Planning and Scheduling (ICAPS), 2023

314

10 Feb 2023

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are BiasedInternational Conference on Learning Representations (ICLR), 2023

Chao Yu

338

03 Feb 2023

Equivariant Networks for Zero-Shot CoordinationNeural Information Processing Systems (NeurIPS), 2022

Darius Muglich

Christian Schroeder de Witt

Elise van der Pol

Shimon Whiteson

Jakob N. Foerster

457

21 Oct 2022

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and PlanningInternational Conference on Learning Representations (ICLR), 2022

339

11 Oct 2022

Human-AI Coordination via Human-Regularized Search and Learning

338

11 Oct 2022

Ad Hoc Teamwork in the Presence of Adversaries

Ted Fujimoto

Samrat Chatterjee

A. Ganguly

320

09 Aug 2022

Mimetic Models: Ethical Implications of AI that Acts Like YouAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022

308

19 Jul 2022

K-level Reasoning for Zero-Shot Coordination in HanabiNeural Information Processing Systems (NeurIPS), 2022

248

14 Jul 2022

Self-Explaining Deviations for CoordinationNeural Information Processing Systems (NeurIPS), 2022

205

13 Jul 2022

Grounding Aleatoric Uncertainty for Unsupervised Environment DesignNeural Information Processing Systems (NeurIPS), 2022

423

11 Jul 2022

Generalized Beliefs for Cooperative AIInternational Conference on Machine Learning (ICML), 2022

Darius Muglich

L. Zintgraf

Christian Schroeder de Witt

Shimon Whiteson

Jakob N. Foerster

258

26 Jun 2022

On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated GamesInternational Conference on Machine Learning (ICML), 2022

R. Loftin

F. Oliehoek

178

20 Jun 2022

The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human ModelsInternational Conference on Learning Representations (ICLR), 2022

Cassidy Laidlaw

Anca Dragan

OffRL

223

22 Apr 2022

On-the-fly Strategy Adaptation for ad-hoc Agent CoordinationAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

Jaleh Zand

Jack Parker-Holder

Stephen J. Roberts

248

08 Mar 2022

Learning Intuitive Policies Using Action FeaturesInternational Conference on Machine Learning (ICML), 2022

342

29 Jan 2022

Any-Play: An Intrinsic Augmentation for Zero-Shot CoordinationAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

Keane Lucas

R. Allen

243

28 Jan 2022

Attention Based Communication and Control for Multi-UAV Path PlanningIEEE Wireless Communications Letters (WCL), 2021

261

20 Dec 2021