Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.02318
Cited By
Improving Policies via Search in Cooperative Partially Observable Games
5 December 2019
Adam Lerer
Hengyuan Hu
Jakob N. Foerster
Noam Brown
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Policies via Search in Cooperative Partially Observable Games"
22 / 22 papers shown
Title
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
44
2
0
28 Oct 2024
Adaptation Procedure in Misinformation Games
Konstantinos Varsos
Merkouris Papamichail
G. Flouris
M. Bitsaki
24
0
0
07 Sep 2024
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
27
4
0
22 Jan 2023
Safe Subgame Resolving for Extensive Form Correlated Equilibrium
Chun Kai Ling
Fei Fang
13
0
0
29 Dec 2022
Towards automating Codenames spymasters with deep reinforcement learning
Sherman Siu
14
2
0
28 Dec 2022
What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?
Songyang Han
Sanbao Su
Sihong He
Shuo Han
Haizhao Yang
Shaofeng Zou
Fei Miao
AAML
27
22
0
06 Dec 2022
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
11
7
0
11 Oct 2022
Combining Theory of Mind and Abduction for Cooperation under Imperfect Information
Nieves Montes
Nardine Osman
Carles Sierra
26
4
0
30 Sep 2022
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
19
2
0
13 Jul 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
25
26
0
30 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning
Shitao Xiao
V. Subramanian
23
9
0
25 Oct 2021
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
H. Siu
Jaime D. Peña
Edenna Chen
Yutai Zhou
Victor J. Lopez
Kyle Palko
K. Chang
R. Allen
13
57
0
15 Jul 2021
Communicating Natural Programs to Humans and Machines
Samuel Acquaviva
Yewen Pu
Marta Kryven
Theo Sechopoulos
Catherine Wong
Gabrielle Ecanow
Maxwell Nye
Michael Henry Tessler
J. Tenenbaum
30
40
0
15 Jun 2021
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Daochen Zha
Jingru Xie
Wenye Ma
Sheng Zhang
Xiangru Lian
Xia Hu
Ji Liu
14
116
0
11 Jun 2021
Solving Common-Payoff Games with Approximate Policy Iteration
Samuel Sokota
Edward Lockhart
Finbarr Timbers
Elnaz Davoodi
Ryan DÓrazio
Neil Burch
Martin Schmid
Michael Bowling
Marc Lanctot
42
22
0
11 Jan 2021
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
34
199
0
15 Dec 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
29
19
0
14 Aug 2020
Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Noam Brown
A. Bakhtin
Adam Lerer
Qucheng Gong
15
133
0
27 Jul 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
11
42
0
08 Jun 2020
Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners
Rodrigo Canaan
Xianbo Gao
Youjin Chung
Julian Togelius
Andy Nealen
Stefan Menzel
13
4
0
28 Apr 2020
Rethinking Formal Models of Partially Observable Multiagent Decision Making
Vojtěch Kovařík
Martin Schmid
Neil Burch
Michael Bowling
Viliam Lisý
OffRL
14
54
0
26 Jun 2019
1