Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.08555
Cited By
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
15 June 2020
Stephen Marcus McAleer
John Lanier
Roy Fox
Pierre Baldi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games"
13 / 13 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
44
8
0
02 Aug 2024
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Jiesong Lian
Yucong Huang
Chengdong Ma
Mingzhi Wang
Ying Wen
Long Hu
Yixue Hao
57
0
0
31 May 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
30
1
0
17 Mar 2024
Networked Communication for Decentralised Agents in Mean-Field Games
Patrick Benjamin
Alessandro Abate
FedML
38
2
0
05 Jun 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Yang Li
Shao Zhang
Jichen Sun
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
24
21
0
09 Feb 2023
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
11
13
0
01 Dec 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
19
18
0
13 Jul 2022
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
39
2
0
12 Jul 2022
NeuPL: Neural Population Learning
Siqi Liu
Luke Marris
Daniel Hennes
J. Merel
N. Heess
T. Graepel
28
17
0
15 Feb 2022
Anytime PSRO for Two-Player Zero-Sum Games
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
T. Sandholm
Roy Fox
14
12
0
19 Jan 2022
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
24
49
0
20 Oct 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
25
36
0
17 Jun 2021
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
104
292
0
16 Oct 2019
1