Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.05614
Cited By
v1
v2
v3
v4 (latest)
Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent
13 March 2019
Edward Lockhart
Marc Lanctot
Julien Pérolat
Jean-Baptiste Lespiau
Dustin Morrill
Finbarr Timbers
K. Tuyls
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent"
44 / 44 papers shown
Title
Meta-Learning in Self-Play Regret Minimization
David Sychrovský
Martin Schmid
Michal Sustr
Michael Bowling
71
0
0
26 Apr 2025
Solving Infinite-Player Games with Player-to-Strategy Networks
Carlos Martin
Tuomas Sandholm
99
0
0
17 Jan 2025
Approximating N-Player Nash Equilibrium through Gradient Descent
Dongge Wang
Xiang Yan
Zehao Dou
Wenhan Huang
Yaodong Yang
Xiaotie Deng
48
0
0
06 Jan 2025
Joint-perturbation simultaneous pseudo-gradient
Carlos Martin
Tuomas Sandholm
58
2
0
17 Aug 2024
Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning
Zida Wu
Mathieu Lauriere
Samuel Jia Cong Chua
Matthieu Geist
Olivier Pietquin
Ankur M. Mehta
85
5
0
06 Mar 2024
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Gregory Palmer
Chris Parry
Daniel J.B. Harrold
Chris Willis
AI4CE
84
1
0
11 Oct 2023
Composing Efficient, Robust Tests for Policy Selection
Dustin Morrill
Thomas J. Walsh
D. Hernández
Peter R. Wurman
Peter Stone
59
1
0
12 Jun 2023
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
Xiangyu Liu
Souradip Chakraborty
Yanchao Sun
Furong Huang
AAML
66
5
0
27 May 2023
Safe Multi-agent Learning via Trapping Regions
A. Czechowski
F. Oliehoek
26
0
0
27 Feb 2023
Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling
Zun Li
Marc Lanctot
Kevin R. McKee
Luke Marris
I. Gemp
Daniel Hennes
Paul Muller
Kate Larson
Yoram Bachrach
Michael P. Wellman
66
11
0
01 Feb 2023
Are Equivariant Equilibrium Approximators Beneficial?
Zhijian Duan
Yunxuan Ma
Xiaotie Deng
75
4
0
27 Jan 2023
ApproxED: Approximate exploitability descent via learned best responses
Carlos Martin
Tuomas Sandholm
88
0
0
20 Jan 2023
Anticipatory Fictitious Play
Alex Cloud
Albert Wang
W. Kerr
39
1
0
20 Dec 2022
Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networks
Carlos Martin
Tuomas Sandholm
49
11
0
29 Nov 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
68
16
0
13 Oct 2022
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments
I. Gemp
Thomas W. Anthony
Yoram Bachrach
Avishkar Bhoopchand
Kalesha Bullard
...
Florian Strub
Andrea Tacchetti
Eugene Tarassov
Zhe Wang
K. Tuyls
LLMAG
AI4CE
88
3
0
22 Sep 2022
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games
Kenshi Abe
Kaito Ariu
Mitsuki Sakamoto
Kenta Toyoshima
Atsushi Iwasaki
84
12
0
21 Aug 2022
Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games
Kenshi Abe
Mitsuki Sakamoto
Atsushi Iwasaki
65
18
0
18 Jun 2022
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
86
22
0
06 Dec 2021
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
66
3
0
29 Oct 2021
Is Nash Equilibrium Approximator Learnable?
Zhijian Duan
Wenhan Huang
Dinghuai Zhang
Yali Du
Jun Wang
Yaodong Yang
Xiaotie Deng
76
6
0
17 Aug 2021
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Ying Wen
Hui Chen
Yaodong Yang
Zheng Tian
Minne Li
Xu Chen
Jun Wang
94
11
0
12 Jun 2021
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Michael Walton
Viliam Lisý
33
5
0
27 Feb 2021
Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
Lucas Liebenwein
Ryan M Sander
S. Karaman
Daniela Rus
BDL
83
37
0
19 Feb 2021
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games
Yulai Zhao
Yuandong Tian
Jason D. Lee
S. Du
OffRL
76
18
0
17 Feb 2021
Optimizing
α
μ
αμ
αμ
Tristan Cazenave
Swann Legras
V. Ventos
30
59
0
29 Jan 2021
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
241
163
0
11 Jan 2021
Model-free Neural Counterfactual Regret Minimization with Bootstrap Learning
Weiming Liu
Bin Li
Julian Togelius
83
8
0
03 Dec 2020
Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games
David Milec
Jakub Cerny
Viliam Lisý
Bo An
10
12
0
30 Sep 2020
The Advantage Regret-Matching Actor-Critic
A. Gruslys
Marc Lanctot
Rémi Munos
Finbarr Timbers
Martin Schmid
...
Jean-Baptiste Lespiau
John Schultz
M. G. Azar
Michael Bowling
K. Tuyls
OffRL
70
28
0
27 Aug 2020
Calibration of Shared Equilibria in General Sum Partially Observable Markov Games
N. Vadori
Sumitra Ganesh
P. Reddy
Manuela Veloso
132
15
0
23 Jun 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
99
43
0
08 Jun 2020
Discovering Imperfectly Observable Adversarial Actions using Anomaly Detection
Olga Petrova
K. Durkota
Galina Alperovich
Karel Horak
Michal Najman
B. Bosanský
Viliam Lisý
AAML
13
1
0
22 Apr 2020
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Julien Perolat
Rémi Munos
Jean-Baptiste Lespiau
Shayegan Omidshafiei
Mark Rowland
...
David Balduzzi
Bart De Vylder
Georgios Piliouras
Marc Lanctot
K. Tuyls
75
85
0
19 Feb 2020
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of
f
f
f
-Regression Counterfactual Regret Minimization
Ryan DÓrazio
Dustin Morrill
J. R. Wright
Michael Bowling
92
11
0
06 Dec 2019
Improving Fictitious Play Reinforcement Learning with Expanding Models
Rongjun Qin
Jing-Cheng Pang
Yang Yu
21
1
0
27 Nov 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
231
1,228
0
24 Nov 2019
The αμ Search Algorithm for the Game of Bridge
Tristan Cazenave
V. Ventos
21
10
0
18 Nov 2019
Bounds for Approximate Regret-Matching Algorithms
Scott Fujimoto
Dustin Morrill
J. R. Wright
72
3
0
03 Oct 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
133
254
0
26 Aug 2019
Neural Replicator Dynamics
Daniel Hennes
Dustin Morrill
Shayegan Omidshafiei
Rémi Munos
Julien Perolat
...
A. Gruslys
Jean-Baptiste Lespiau
Paavo Parmas
Edgar A. Duénez-Guzmán
K. Tuyls
74
16
0
01 Jun 2019
Double Neural Counterfactual Regret Minimization
Hui Li
Kailiang Hu
Zhibang Ge
Tao Jiang
Yuan Qi
Le Song
71
52
0
27 Dec 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
119
569
0
12 Oct 2018
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
261
913
0
06 Jan 2017
1