v1v2 (latest)

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

15 June 2020

ArXiv (abs)PDF HTML Github (51★)

Papers citing "Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games"

42 / 42 papers shown

Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria

207

21 Oct 2025

Generative Evolutionary Meta-Solver (GEMS): Scalable Surrogate-Free Multi-Agent Reinforcement Learning

Alakh Sharma

Gaurish Trivedi

Kartikey Singh Bhandari

147

27 Sep 2025

PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense

Xavier Cadet

Simona Boboila

Sie Hendrata Dharmawan

Alina Oprea

Peter Chin

AAML

139

27 Aug 2025

Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games

Naming Liu

Mingzhi Wang

Xihuai Wang

Weinan Zhang

Yaodong Yang

Youzhi Zhang

Bo An

Ying Wen

267

02 Oct 2024

A Survey on Self-play Methods in Reinforcement Learning

...

770

02 Aug 2024

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

711

31 May 2024

Attaining Human`s Desirable Outcomes in Human-AI Interaction via Structural Causal Games

Jianhong Wang

318

26 May 2024

Self-adaptive PSRO: Towards an Automatic Population-based Game Solver

233

17 Apr 2024

Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments

388

17 Mar 2024

Policy Space Response Oracles: A Survey

441

04 Mar 2024

Neural Population Learning beyond Symmetric Zero-sum GamesAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

307

10 Jan 2024

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Wei Pan

351

09 Aug 2023

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Furong Huang

262

22 Jul 2023

Policy Space Diversity for Non-Transitive GamesNeural Information Processing Systems (NeurIPS), 2023

346

29 Jun 2023

Tackling Cooperative Incompatibility for Zero-Shot Human-AI CoordinationJournal of Artificial Intelligence Research (JAIR), 2023

Shao Zhang

Wenhao Zhang

Xinbing Wang

Wei Pan

437

05 Jun 2023

Networked Communication for Decentralised Agents in Mean-Field Games

Patrick Benjamin

Alessandro Abate

FedML

570

05 Jun 2023

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RLInternational Conference on Learning Representations (ICLR), 2023

Furong Huang

396

27 May 2023

Learning Diverse Risk Preferences in Population-based Self-playAAAI Conference on Artificial Intelligence (AAAI), 2023

470

19 May 2023

An Empirical Study on Google Research Football Multi-agent ScenariosMachine Intelligence Research (MIR), 2023

Yan Song

296

16 May 2023

Cooperative Open-ended Learning Framework for Zero-shot CoordinationInternational Conference on Machine Learning (ICML), 2023

Shao Zhang

Xinbing Wang

Wei Pan

582

09 Feb 2023

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

386

01 Feb 2023

Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning ToolboxMachine Intelligence Research (MIR), 2022

Kaiqi Huang

Bin Liang

Liangsheng Wang

OffRL

261

01 Dec 2022

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

296

13 Jul 2022

Offline Equilibrium Finding

377

12 Jul 2022

ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate RegretInternational Conference on Learning Representations (ICLR), 2022

461

08 Jun 2022

A Game-Theoretic Framework for Managing Risk in Multi-Agent SystemsInternational Conference on Machine Learning (ICML), 2022

Jun Wang

417

30 May 2022

NeuPL: Neural Population LearningInternational Conference on Learning Representations (ICLR), 2022

289

15 Feb 2022

Efficient Policy Space Response Oracles

Jingxiao Chen

Yong Yu

360

28 Jan 2022

Anytime PSRO for Two-Player Zero-Sum Games

396

19 Jan 2022

A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers

Congying Han

Jun Wang

285

28 Oct 2021

Independent Natural Policy Gradient Always Converges in Markov Potential GamesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

257

20 Oct 2021

No-Press Diplomacy from Scratch

288

06 Oct 2021

Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-SolversInternational Conference on Machine Learning (ICML), 2021

510

17 Jun 2021

Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Changjie Fan

228

09 Jun 2021

Improving Social Welfare While Preserving Autonomy via a Pareto Mediator

258

07 Jun 2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement LearningJournal of machine learning research (JMLR), 2021

Muning Wen

Jun Wang

223

05 Jun 2021

Neural Auto-Curricula

Jun Wang

303

04 Jun 2021

Iterative Empirical Game Solving via Single Policy Best ResponseInternational Conference on Learning Representations (ICLR), 2021

Max O. Smith

Thomas W. Anthony

Michael P. Wellman

268

03 Jun 2021

Modelling Behavioural Diversity for Learning in Open-Ended GamesInternational Conference on Machine Learning (ICML), 2021

Jun Wang

373

14 Mar 2021

Jun Wang

570

13 Mar 2021

XDO: A Double Oracle Algorithm for Extensive-Form GamesNeural Information Processing Systems (NeurIPS), 2021

252

11 Mar 2021

Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications

Sarah Perrin

Julien Perolat

Mathieu Laurière

Matthieu Geist

Romuald Elie

Olivier Pietquin

390

138

05 Jul 2020