Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.06426
Cited By
XDO: A Double Oracle Algorithm for Extensive-Form Games
11 March 2021
Stephen Marcus McAleer
John Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XDO: A Double Oracle Algorithm for Extensive-Form Games"
8 / 8 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
46
8
0
02 Aug 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
J. Wang
Zonghong Dai
Yaodong Yang
24
2
0
09 Aug 2023
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
30
3
0
18 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
27
18
0
13 Jul 2022
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
41
2
0
12 Jul 2022
Anytime PSRO for Two-Player Zero-Sum Games
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
T. Sandholm
Roy Fox
22
12
0
19 Jan 2022
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
24
49
0
20 Oct 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
35
36
0
17 Jun 2021
1