Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.15378
Cited By
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
30 June 2022
Julien Perolat
Bart De Vylder
Daniel Hennes
Eugene Tarassov
Florian Strub
V. D. Boer
Paul Muller
Jerome T. Connor
Neil Burch
Thomas W. Anthony
Stephen Marcus McAleer
Romuald Elie
Sarah H. Cen
Zhe Wang
A. Gruslys
Aleksandra Malysheva
Mina Khan
Sherjil Ozair
Finbarr Timbers
Tobias Pohlen
Tom Eccles
Mark Rowland
Marc Lanctot
Jean-Baptiste Lespiau
Bilal Piot
Shayegan Omidshafiei
Edward Lockhart
Laurent Sifre
Nathalie Beauguerlange
Rémi Munos
David Silver
Satinder Singh
Demis Hassabis
K. Tuyls
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning"
35 / 85 papers shown
Title
Towards practical reinforcement learning for tokamak magnetic control
Brendan D. Tracey
Andrea Michi
Yuri Chervonyi
Ian Davies
Cosmin Paduraru
...
Jonathan Evens
Paula Kurylowicz
D. Mankowitz
Martin Riedmiller
The Tcv Team
AI4CE
43
10
0
21 Jul 2023
Glamour muscles: why having a body is not what it means to be embodied
Shawn L. E. Beaulieu
Sam Kriegman
AI4CE
32
0
0
17 Jul 2023
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei
Wei Fu
Jiaxuan Gao
Guang Wang
Huanchen Zhang
Yi Wu
OffRL
LRM
29
5
0
29 Jun 2023
The Manipulation Problem: Conversational AI as a Threat to Epistemic Agency
Louis B. Rosenberg
20
4
0
19 Jun 2023
Composing Efficient, Robust Tests for Policy Selection
Dustin Morrill
Thomas J. Walsh
D. Hernández
Peter R. Wurman
Peter Stone
22
0
0
12 Jun 2023
Potential-based Credit Assignment for Cooperative RL-based Testing of Autonomous Vehicles
Utku Ayvaz
Chih-Hong Cheng
Hao Shen
16
0
0
28 May 2023
Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria
Fivos Kalogiannis
Ioannis Panageas
34
8
0
23 May 2023
Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games
Dylan J. Foster
Noah Golowich
Sham Kakade
36
10
0
22 Mar 2023
Generating synthetic multi-dimensional molecular-mediator time series data for artificial intelligence-based disease trajectory forecasting and drug development digital twins: Considerations
G. An
Chase Cockrell
26
2
0
16 Mar 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback
Yang Cai
Haipeng Luo
Chen-Yu Wei
Weiqiang Zheng
29
17
0
05 Mar 2023
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Marc Lanctot
John Schultz
Neil Burch
Max O. Smith
Daniel Hennes
Thomas W. Anthony
Julien Perolat
OffRL
20
4
0
02 Mar 2023
Learning not to Regret
David Sychrovský
Michal Sustr
Elnaz Davoodi
Michael Bowling
Marc Lanctot
Martin Schmid
34
3
0
02 Mar 2023
ASP: Learn a Universal Neural Solver!
Chenguang Wang
Zhouliang Yu
Stephen Marcus McAleer
Tianshu Yu
Yao-Chun Yang
AAML
32
24
0
01 Mar 2023
Auxiliary Task-based Deep Reinforcement Learning for Quantum Control
Shumin Zhou
Hailan Ma
S. Kuang
Daoyi Dong
29
5
0
28 Feb 2023
Price of Anarchy in a Double-Sided Critical Distribution System
David Sychrovský
Jakub Cerny
Sylvain Lichau
M. Loebl
18
1
0
20 Feb 2023
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Lukas Schafer
Oliver Slumbers
Stephen Marcus McAleer
Yali Du
Stefano V. Albrecht
D. Mguni
79
7
0
07 Feb 2023
Doubly Optimal No-Regret Learning in Monotone Games
Yang Cai
Weiqiang Zheng
46
11
0
30 Jan 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
27
4
0
22 Jan 2023
ApproxED: Approximate exploitability descent via learned best responses
Carlos Martin
T. Sandholm
32
0
0
20 Jan 2023
Function Approximation for Solving Stackelberg Equilibrium in Large Perfect Information Games
Chun Kai Ling
J. Zico Kolter
Fei Fang
35
0
0
29 Dec 2022
Adapting to game trees in zero-sum imperfect information games
Côme Fiegel
Pierre Ménard
Tadashi Kozuno
Rémi Munos
Vianney Perchet
Michal Valko
32
9
0
23 Dec 2022
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse
E. S. Matekole
Esther Ye
Ramya Iyer
Samuel Yen-Chi Chen
26
2
0
22 Dec 2022
Beyond CAGE: Investigating Generalization of Learned Autonomous Network Defense Policies
M. Wolk
A. Applebaum
Camron Dennler
P. Dwyer
M. Moskowitz
...
N. Nichols
Nicole Park
Paul Rachwalski
Frank Rau
A. Webster
OffRL
AAML
24
17
0
28 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
22
3
0
21 Nov 2022
General Intelligence Requires Rethinking Exploration
Minqi Jiang
Tim Rocktaschel
Edward Grefenstette
LRM
29
18
0
15 Nov 2022
Adversarial Policies Beat Superhuman Go AIs
T. T. Wang
Adam Gleave
Tom Tseng
Kellin Pelrine
Nora Belrose
...
Michael Dennis
Yawen Duan
V. Pogrebniak
Sergey Levine
Stuart Russell
AAML
13
21
0
01 Nov 2022
Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han
Siyuan Li
Bo An
Wei Zhao
P. Liu
35
0
0
24 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
27
1
0
18 Oct 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
39
13
0
03 Aug 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
35
18
0
13 Jul 2022
Approximating Discontinuous Nash Equilibrial Values of Two-Player General-Sum Differential Games
Lei Zhang
Mukesh Ghimire
Wenlong Zhang
Zhenni Xu
Yi Ren
27
7
0
05 Jul 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Stephen Marcus McAleer
Gabriele Farina
Marc Lanctot
T. Sandholm
32
24
0
08 Jun 2022
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems
Oliver Slumbers
D. Mguni
Stephen Marcus McAleer
Stefano B. Blumberg
Jun Wang
Yaodong Yang
32
9
0
30 May 2022
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
29
20
0
06 Dec 2021
Previous
1
2