Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.02923
Cited By
v1
v2 (latest)
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
6 October 2020
Jonathan Gray
Adam Lerer
A. Bakhtin
Noam Brown
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Human-Level Performance in No-Press Diplomacy via Equilibrium Search"
34 / 34 papers shown
Title
Dynamic Search for Inference-Time Alignment in Diffusion Models
Xiner Li
Masatoshi Uehara
Xingyu Su
Gabriele Scalia
Tommaso Biancalani
Aviv Regev
Sergey Levine
Shuiwang Ji
96
4
0
03 Mar 2025
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL
Wichayaporn Wongkamjan
Yanze Wang
Feng Gu
Denis Peskoff
Jonathan K. Kummerfeld
Jonathan May
Jordan Lee Boyd-Graber
207
0
0
18 Feb 2025
Personalized Help for Optimizing Low-Skilled Users' Strategy
Feng Gu
Wichayaporn Wongkamjan
Jordan Lee Boyd-Graber
Jonathan K. Kummerfeld
Denis Peskoff
Jonathan May
100
0
0
14 Nov 2024
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Pranav Putta
Edmund Mills
Naman Garg
S. Motwani
Chelsea Finn
Divyansh Garg
Rafael Rafailov
LLMAG
LRM
97
88
0
13 Aug 2024
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Yauwai Yim
Chunkit Chan
Tianyu Shi
Zheye Deng
Wei Fan
Tianshi Zheng
Yangqiu Song
LLMAG
98
13
0
05 Aug 2024
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy
Zhenyu Guan
Xiangyu Kong
Fangwei Zhong
Yizhou Wang
78
12
0
09 Jul 2024
Tree Search for Language Model Agents
Jing Yu Koh
Stephen Marcus McAleer
Daniel Fried
Ruslan Salakhutdinov
LM&Ro
LLMAG
LRM
131
75
0
01 Jul 2024
More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play
Wichayaporn Wongkamjan
Feng Gu
Yanze Wang
Ulf Hermjakob
Jonathan May
Brandon M. Stewart
Jonathan K. Kummerfeld
Denis Peskoff
Jordan L. Boyd-Graber
90
6
0
07 Jun 2024
Designing Skill-Compatible AI: Methodologies and Frameworks in Chess
Karim Hamade
Reid McIlroy-Young
Siddhartha Sen
Jon M. Kleinberg
Ashton Anderson
51
6
0
08 May 2024
MARL-LNS: Cooperative Multi-agent Reinforcement Learning via Large Neighborhoods Search
Weizhe Chen
Sven Koenig
B. Dilkina
78
0
0
03 Apr 2024
Evaluating Language Model Agency through Negotiations
Tim R. Davidson
V. Veselovsky
Martin Josifoski
Maxime Peyrard
Antoine Bosselut
Michal Kosinski
Robert West
LLMAG
89
29
0
09 Jan 2024
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4
Jiaxian Guo
Bo Yang
Paul D. Yoo
Bill Yuchen Lin
Yusuke Iwasawa
Yutaka Matsuo
LLMAG
118
45
0
29 Sep 2023
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Xiangyu Liu
Benjamin Eysenbach
Tuomas Sandholm
Furong Huang
Stephen Marcus McAleer
OOD
82
0
0
22 Jul 2023
Function Approximation for Solving Stackelberg Equilibrium in Large Perfect Information Games
Chun Kai Ling
J. Zico Kolter
Fei Fang
60
0
0
29 Dec 2022
Safe Subgame Resolving for Extensive Form Correlated Equilibrium
Chun Kai Ling
Fei Fang
43
0
0
29 Dec 2022
Discovering Latent Knowledge in Language Models Without Supervision
Collin Burns
Haotian Ye
Dan Klein
Jacob Steinhardt
163
386
0
07 Dec 2022
AutoReply: Detecting Nonsense in Dialogue Introspectively with Discriminative Replies
Weiyan Shi
Emily Dinan
Adithya Renduchintala
Daniel Fried
Athul Paul Jacob
Zhou Yu
M. Lewis
AAML
106
2
0
22 Nov 2022
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
A. Bakhtin
David J. Wu
Adam Lerer
Jonathan Gray
Athul Paul Jacob
Gabriele Farina
Alexander H. Miller
Noam Brown
122
47
0
11 Oct 2022
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments
I. Gemp
Thomas W. Anthony
Yoram Bachrach
Avishkar Bhoopchand
Kalesha Bullard
...
Florian Strub
Andrea Tacchetti
Eugene Tarassov
Zhe Wang
K. Tuyls
LLMAG
AI4CE
88
3
0
22 Sep 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Stephen Marcus McAleer
Gabriele Farina
Marc Lanctot
Tuomas Sandholm
174
26
0
08 Jun 2022
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Keane Lucas
R. Allen
103
26
0
28 Jan 2022
Conditional Imitation Learning for Multi-Agent Games
Andy Shih
Stefano Ermon
Dorsa Sadigh
88
11
0
05 Jan 2022
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob
David J. Wu
Gabriele Farina
Adam Lerer
Hengyuan Hu
A. Bakhtin
Jacob Andreas
Noam Brown
60
54
0
14 Dec 2021
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
86
22
0
06 Dec 2021
Normative Disagreement as a Challenge for Cooperative AI
J. Stastny
Maxime Riché
Alexander Lyzhov
Johannes Treutlein
Allan Dafoe
Jesse Clifton
51
10
0
27 Nov 2021
No-Press Diplomacy from Scratch
A. Bakhtin
David J. Wu
Adam Lerer
Noam Brown
178
44
0
06 Oct 2021
Temporal Induced Self-Play for Stochastic Bayesian Games
Weizhe Chen
Zihan Zhou
Yi Wu
Fei Fang
25
4
0
21 Aug 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
H. Siu
Jaime D. Peña
Edenna Chen
Yutai Zhou
Victor J. Lopez
Kyle Palko
K. Chang
R. Allen
138
58
0
15 Jul 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
93
36
0
17 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Helen Zhou
Ji Liu
71
117
0
11 Jun 2021
Learning to Play General-Sum Games Against Multiple Boundedly Rational Agents
Eric Zhao
Alexander R. Trott
Caiming Xiong
Stephan Zheng
OffRL
47
1
0
10 Jun 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael Bowling
Marc Lanctot
93
22
0
11 Jan 2021
OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research
Kai Li
Hang Xu
Enmin Zhao
Zhe Wu
Junliang Xing
VLM
59
0
0
11 Dec 2020
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
261
913
0
06 Jan 2017
1