ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.06426
  4. Cited By
XDO: A Double Oracle Algorithm for Extensive-Form Games

XDO: A Double Oracle Algorithm for Extensive-Form Games

11 March 2021
Stephen Marcus McAleer
John Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
ArXivPDFHTML

Papers citing "XDO: A Double Oracle Algorithm for Extensive-Form Games"

8 / 8 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
46
8
0
02 Aug 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
J. Wang
Zonghong Dai
Yaodong Yang
24
2
0
09 Aug 2023
A Deep Reinforcement Learning Approach for Finding Non-Exploitable
  Strategies in Two-Player Atari Games
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
30
3
0
18 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
27
18
0
13 Jul 2022
Offline Equilibrium Finding
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
41
2
0
12 Jul 2022
Anytime PSRO for Two-Player Zero-Sum Games
Anytime PSRO for Two-Player Zero-Sum Games
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
T. Sandholm
Roy Fox
22
12
0
19 Jan 2022
Independent Natural Policy Gradient Always Converges in Markov Potential
  Games
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
24
49
0
20 Oct 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium
  Meta-Solvers
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
35
36
0
17 Jun 2021
1