Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2006.08555
Cited By
v1
v2 (latest)
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
15 June 2020
Alexander Shmakov
John Lanier
Roy Fox
Pierre Baldi
Re-assign community
ArXiv (abs)
PDF
HTML
Github (51★)
Papers citing
"Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games"
42 / 42 papers shown
Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria
Eason Yu
Tzu Hao Liu
Yunke Wang
Clément L. Canonne
Nguyen H. Tran
Chang Xu
207
0
0
21 Oct 2025
Generative Evolutionary Meta-Solver (GEMS): Scalable Surrogate-Free Multi-Agent Reinforcement Learning
Alakh Sharma
Gaurish Trivedi
Kartikey Singh Bhandari
Yash Sinha
Dhruv Kumar
Pratik Narang
Jagat Sesh Challa
147
0
0
27 Sep 2025
PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense
Xavier Cadet
Simona Boboila
Sie Hendrata Dharmawan
Alina Oprea
Peter Chin
AAML
139
1
0
27 Aug 2025
Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games
Naming Liu
Mingzhi Wang
Xihuai Wang
Weinan Zhang
Yaodong Yang
Youzhi Zhang
Bo An
Ying Wen
267
1
0
02 Oct 2024
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Wenbo Ding
Yu Wang
Yu Wang
SyDa
SSL
OnRL
770
31
0
02 Aug 2024
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Jiesong Lian
Yucong Huang
Chengdong Ma
Mingzhi Wang
Ying Wen
Long Hu
Yixue Hao
711
5
0
31 May 2024
Attaining Human`s Desirable Outcomes in Human-AI Interaction via Structural Causal Games
Anjie Liu
Jianhong Wang
Haoxuan Li
Xu Chen
Jun Wang
Samuel Kaski
Mengyue Yang
318
0
0
26 May 2024
Self-adaptive PSRO: Towards an Automatic Population-based Game Solver
Pengdeng Li
Shuxin Li
Chang Yang
Xinrun Wang
Yi-Ju Chang
Hau Chan
Bo An
233
3
0
17 Apr 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
388
3
0
17 Mar 2024
Policy Space Response Oracles: A Survey
Ariyan Bighashdel
Yongzhao Wang
Alexander Shmakov
Rahul Savani
F. Oliehoek
441
18
0
04 Mar 2024
Neural Population Learning beyond Symmetric Zero-sum Games
Adaptive Agents and Multi-Agent Systems (AAMAS), 2024
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z Leibo
N. Heess
MLT
307
6
0
10 Jan 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Alexander Shmakov
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
351
3
0
09 Aug 2023
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Xiangyu Liu
Benjamin Eysenbach
Tuomas Sandholm
Furong Huang
Alexander Shmakov
OOD
262
0
0
22 Jul 2023
Policy Space Diversity for Non-Transitive Games
Neural Information Processing Systems (NeurIPS), 2023
Jian Yao
Weiming Liu
Haobo Fu
Yaodong Yang
Alexander Shmakov
Qiang Fu
Wei Yang
346
23
0
29 Jun 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Journal of Artificial Intelligence Research (JAIR), 2023
Yang Li
Shao Zhang
Jichen Sun
Wenhao Zhang
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
437
25
0
05 Jun 2023
Networked Communication for Decentralised Agents in Mean-Field Games
Patrick Benjamin
Alessandro Abate
FedML
570
2
0
05 Jun 2023
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
International Conference on Learning Representations (ICLR), 2023
Xiangyu Liu
Souradip Chakraborty
Yanchao Sun
Furong Huang
AAML
396
10
0
27 May 2023
Learning Diverse Risk Preferences in Population-based Self-play
AAAI Conference on Artificial Intelligence (AAAI), 2023
Y. Jiang
Qihan Liu
Xiaoteng Ma
Chenghao Li
Yiqin Yang
Jun Yang
Bin Liang
Qianchuan Zhao
470
8
0
19 May 2023
An Empirical Study on Google Research Football Multi-agent Scenarios
Machine Intelligence Research (MIR), 2023
Yan Song
He Jiang
Zheng Tian
Haifeng Zhang
Yingping Zhang
Jiangcheng Zhu
Zonghong Dai
Weinan Zhang
Jun Wang
296
10
0
16 May 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination
International Conference on Machine Learning (ICML), 2023
Yang Li
Shao Zhang
Jichen Sun
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
582
35
0
09 Feb 2023
Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Zun Li
Marc Lanctot
Kevin R. McKee
Luke Marris
I. Gemp
Daniel Hennes
Paul Muller
Kate Larson
Yoram Bachrach
Michael P. Wellman
386
11
0
01 Feb 2023
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Machine Intelligence Research (MIR), 2022
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
261
34
0
01 Dec 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Alexander Shmakov
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
Tuomas Sandholm
296
23
0
13 Jul 2022
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
377
3
0
12 Jul 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
International Conference on Learning Representations (ICLR), 2022
Alexander Shmakov
Gabriele Farina
Marc Lanctot
Tuomas Sandholm
461
31
0
08 Jun 2022
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems
International Conference on Machine Learning (ICML), 2022
Oliver Slumbers
D. Mguni
Alexander Shmakov
Stefano B. Blumberg
Jun Wang
Yaodong Yang
417
25
0
30 May 2022
NeuPL: Neural Population Learning
International Conference on Learning Representations (ICLR), 2022
Siqi Liu
Luke Marris
Daniel Hennes
J. Merel
N. Heess
T. Graepel
289
19
0
15 Feb 2022
Efficient Policy Space Response Oracles
Ming Zhou
Jingxiao Chen
Ying Wen
Weinan Zhang
Yaodong Yang
Yong Yu
Jun Wang
360
13
0
28 Jan 2022
Anytime PSRO for Two-Player Zero-Sum Games
Alexander Shmakov
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
Tuomas Sandholm
Roy Fox
396
18
0
19 Jan 2022
A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Chenguang Wang
Yaodong Yang
Oliver Slumbers
Congying Han
Tiande Guo
Haifeng Zhang
Jun Wang
285
20
0
28 Oct 2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Roy Fox
Alexander Shmakov
W. Overman
Ioannis Panageas
257
57
0
20 Oct 2021
No-Press Diplomacy from Scratch
A. Bakhtin
David J. Wu
Adam Lerer
Noam Brown
288
49
0
06 Oct 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
International Conference on Machine Learning (ICML), 2021
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
510
42
0
17 Jun 2021
Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Xiangyu Liu
Hangtian Jia
Ying Wen
Yaodong Yang
Yujing Hu
Yingfeng Chen
Changjie Fan
Zhipeng Hu
228
20
0
09 Jun 2021
Improving Social Welfare While Preserving Autonomy via a Pareto Mediator
Alexander Shmakov
John Lanier
Michael Dennis
Pierre Baldi
Roy Fox
258
5
0
07 Jun 2021
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Journal of machine learning research (JMLR), 2021
Ming Zhou
Bo Liu
Hanjing Wang
Muning Wen
Runzhe Wu
Ying Wen
Yaodong Yang
Weinan Zhang
Jun Wang
OffRL
223
56
0
05 Jun 2021
Neural Auto-Curricula
Xidong Feng
Oliver Slumbers
Bo Liu
Bo Liu
Alexander Shmakov
Ying Wen
Jun Wang
Yaodong Yang
303
4
0
04 Jun 2021
Iterative Empirical Game Solving via Single Policy Best Response
International Conference on Learning Representations (ICLR), 2021
Max O. Smith
Thomas W. Anthony
Michael P. Wellman
268
23
0
03 Jun 2021
Modelling Behavioural Diversity for Learning in Open-Ended Games
International Conference on Machine Learning (ICML), 2021
Nicolas Perez Nieves
Yaodong Yang
Oliver Slumbers
D. Mguni
Ying Wen
Jun Wang
373
79
0
14 Mar 2021
Online Double Oracle
Le Cong Dinh
Yaodong Yang
Alexander Shmakov
Zheng Tian
Nicolas Perez Nieves
Oliver Slumbers
D. Mguni
Haitham Bou-Ammar
Jun Wang
570
35
0
13 Mar 2021
XDO: A Double Oracle Algorithm for Extensive-Form Games
Neural Information Processing Systems (NeurIPS), 2021
Alexander Shmakov
John Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
252
58
0
11 Mar 2021
Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications
Sarah Perrin
Julien Perolat
Mathieu Laurière
Matthieu Geist
Romuald Elie
Olivier Pietquin
390
138
0
05 Jul 2020
1
Page 1 of 1