ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.07700
  4. Cited By
Anytime PSRO for Two-Player Zero-Sum Games
v1v2 (latest)

Anytime PSRO for Two-Player Zero-Sum Games

19 January 2022
Alexander Shmakov
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
Tuomas Sandholm
Roy Fox
ArXiv (abs)PDFHTMLGithub

Papers citing "Anytime PSRO for Two-Player Zero-Sum Games"

8 / 8 papers shown
Generative Evolutionary Meta-Solver (GEMS): Scalable Surrogate-Free Multi-Agent Reinforcement Learning
Generative Evolutionary Meta-Solver (GEMS): Scalable Surrogate-Free Multi-Agent Reinforcement Learning
Alakh Sharma
Gaurish Trivedi
Kartikey Singh Bhandari
Yash Sinha
Dhruv Kumar
Pratik Narang
Jagat Sesh Challa
146
0
0
27 Sep 2025
Robust Multi-Objective Controlled Decoding of Large Language Models
Robust Multi-Objective Controlled Decoding of Large Language Models
Seongho Son
William Bankes
Sangwoong Yoon
Shyam Sundhar Ramesh
Xiaohang Tang
Ilija Bogunovic
454
8
0
11 Mar 2025
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Wenbo Ding
Yu Wang
Yu Wang
SyDaSSLOnRL
770
31
0
02 Aug 2024
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed
  Cooperative-Competitive Games
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive GamesAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Zelai Xu
Yancheng Liang
Chao Yu
Yu Wang
Yi Wu
331
12
0
05 Oct 2023
Composing Efficient, Robust Tests for Policy Selection
Composing Efficient, Robust Tests for Policy SelectionConference on Uncertainty in Artificial Intelligence (UAI), 2023
Dustin Morrill
Thomas J. Walsh
D. Hernández
Peter R. Wurman
Peter Stone
204
1
0
12 Jun 2023
ApproxED: Approximate exploitability descent via learned best responses
ApproxED: Approximate exploitability descent via learned best responsesAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Carlos Martin
Tuomas Sandholm
493
2
0
20 Jan 2023
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Alexander Shmakov
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
Tuomas Sandholm
296
23
0
13 Jul 2022
Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in
  Symmetric Zero-sum Games
Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum GamesInternational Conference on Machine Learning (ICML), 2022
Siqi Liu
Marc Lanctot
Luke Marris
N. Heess
MLT
1.0K
12
0
31 May 2022
1
Page 1 of 1