ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.07166
  4. Cited By
K-level Reasoning for Zero-Shot Coordination in Hanabi

K-level Reasoning for Zero-Shot Coordination in Hanabi

14 July 2022
Brandon Cui
Hengyuan Hu
Luis Pineda
Jakob N. Foerster
    OffRL
    LRM
ArXivPDFHTML

Papers citing "K-level Reasoning for Zero-Shot Coordination in Hanabi"

11 / 11 papers shown
Title
Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions
Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions
Stéphane Aroca-Ouellette
Miguel Aroca-Ouellette
K. Wense
A. Roncone
39
0
0
07 May 2025
A Generalist Hanabi Agent
A Generalist Hanabi Agent
Arjun Vaithilingam Sudhakar
Hadi Nekoei
Mathieu Reymond
Miao Liu
Janarthanan Rajendran
Sarath Chandar
160
0
0
17 Mar 2025
Strategy Game-Playing with Size-Constrained State Abstraction
Strategy Game-Playing with Size-Constrained State Abstraction
Linjie Xu
Diego Perez-Liebana
Alexander Dockhorn
35
0
0
12 Aug 2024
On the Complexity of Learning to Cooperate with Populations of Socially
  Rational Agents
On the Complexity of Learning to Cooperate with Populations of Socially Rational Agents
R. Loftin
Saptarashmi Bandyopadhyay
M. Çelikok
26
0
0
29 Jun 2024
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Lebin Yu
Yunbo Qiu
Quanming Yao
Xudong Zhang
Jian Wang
16
1
0
10 Feb 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Chao Yu
Jiaxuan Gao
Weiling Liu
Bo Xu
Hao Tang
Jiaqi Yang
Yu Wang
Yi Wu
31
39
0
03 Feb 2023
Coordination with Humans via Strategy Matching
Coordination with Humans via Strategy Matching
Michelle Zhao
Reid G. Simmons
H. Admoni
11
8
0
27 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
55
8
0
23 Oct 2022
Mastering the Game of No-Press Diplomacy via Human-Regularized
  Reinforcement Learning and Planning
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
A. Bakhtin
David J. Wu
Adam Lerer
Jonathan Gray
Athul Paul Jacob
Gabriele Farina
Alexander H. Miller
Noam Brown
12
41
0
11 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
11
7
0
11 Oct 2022
"Other-Play" for Zero-Shot Coordination
"Other-Play" for Zero-Shot Coordination
Hengyuan Hu
Adam Lerer
A. Peysakhovich
Jakob N. Foerster
VLM
OffRL
136
218
0
06 Mar 2020
1