ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.08106
  4. Cited By
Open-ended Learning in Symmetric Zero-sum Games

Open-ended Learning in Symmetric Zero-sum Games

23 January 2019
David Balduzzi
M. Garnelo
Yoram Bachrach
Wojciech M. Czarnecki
Julien Perolat
Max Jaderberg
T. Graepel
ArXivPDFHTML

Papers citing "Open-ended Learning in Symmetric Zero-sum Games"

47 / 47 papers shown
Title
Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning
Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning
Ardian Selmonaj
Oleg Szehr
Giacomo Del Rio
Alessandro Antonucci
Adrian Schneider
Michael Rüegsegger
29
0
0
13 May 2025
Investigating Non-Transitivity in LLM-as-a-Judge
Investigating Non-Transitivity in LLM-as-a-Judge
Yi Xu
Laura Ruis
Tim Rocktaschel
Robert Kirk
43
0
0
19 Feb 2025
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
60
8
0
02 Aug 2024
Efficient Adaptation in Mixed-Motive Environments via Hierarchical
  Opponent Modeling and Planning
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning
Yizhe Huang
Guy Van den Broeck
Fanqi Kong
Yaodong Yang
Song-Chun Zhu
Xue Feng
39
3
0
12 Jun 2024
Carbon Market Simulation with Adaptive Mechanism Design
Carbon Market Simulation with Adaptive Mechanism Design
Han Wang
Wenhao Li
Hongyuan Zha
Baoxiang Wang
35
3
0
12 Jun 2024
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Jiesong Lian
Yucong Huang
Chengdong Ma
Mingzhi Wang
Ying Wen
Long Hu
Yixue Hao
65
0
0
31 May 2024
A social path to human-like artificial intelligence
A social path to human-like artificial intelligence
Edgar A. Duénez-Guzmán
Suzanne Sadedin
Jane X. Wang
Kevin R. McKee
Joel Z Leibo
GNN
31
28
0
22 May 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and
  Continuous Motion Planning in Dynamic Environments
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
30
1
0
17 Mar 2024
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
37
1
0
12 Dec 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
39
2
0
09 Aug 2023
Robust Driving Policy Learning with Guided Meta Reinforcement Learning
Robust Driving Policy Learning with Guided Meta Reinforcement Learning
Kanghoon Lee
Jiachen Li
David Isele
Jinkyoo Park
K. Fujimura
Mykel J. Kochenderfer
29
5
0
19 Jul 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Yang Li
Shao Zhang
Jichen Sun
Wenhao Zhang
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
32
13
0
05 Jun 2023
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in
  Sequential Social Dilemmas
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Udari Madhushani
Kevin R. McKee
J. Agapiou
Joel Z Leibo
Richard Everett
Thomas W. Anthony
Edward Hughes
K. Tuyls
Edgar A. Duénez-Guzmán
46
2
0
01 May 2023
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement
  Learning
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Tuomas Haarnoja
Ben Moran
Guy Lever
Sandy H. Huang
Dhruva Tirumala
...
Andrea Huber
N. Hurley
F. Nori
R. Hadsell
N. Heess
50
143
0
26 Apr 2023
Mastering Asymmetrical Multiplayer Game with Multi-Agent
  Asymmetric-Evolution Reinforcement Learning
Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning
Chenglu Sun
Yi-cui Zhang
Yu Zhang
Ziling Lu
Jingbin Liu
Si-Qi Xu
Weidong Zhang
27
0
0
20 Apr 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and
  Self-Play
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
26
17
0
15 Feb 2023
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Cooperative Open-ended Learning Framework for Zero-shot Coordination
Yang Li
Shao Zhang
Jichen Sun
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
32
22
0
09 Feb 2023
Learning Representations that Enable Generalization in Assistive Tasks
Learning Representations that Enable Generalization in Assistive Tasks
Jerry Zhi-Yang He
Aditi Raghunathan
Daniel S. Brown
Zackory M. Erickson
Anca Dragan
OOD
39
20
0
05 Dec 2022
Melting Pot 2.0
Melting Pot 2.0
J. Agapiou
A. Vezhnevets
Edgar A. Duénez-Guzmán
Jayd Matyas
Yiran Mao
...
Sukhdeep Singh
Julia Haas
Igor Mordatch
D. Mobbs
Joel Z Leibo
45
31
0
24 Nov 2022
Adversarial Policies Beat Superhuman Go AIs
Adversarial Policies Beat Superhuman Go AIs
T. T. Wang
Adam Gleave
Tom Tseng
Kellin Pelrine
Nora Belrose
...
Michael Dennis
Yawen Duan
V. Pogrebniak
Sergey Levine
Stuart Russell
AAML
13
21
0
01 Nov 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter
  Market Simulations
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
18
16
0
13 Oct 2022
Multi-AI Complex Systems in Humanitarian Response
Multi-AI Complex Systems in Humanitarian Response
Joseph Aylett-Bullock
M. Luengo-Oroz
21
0
0
24 Aug 2022
Revisiting Some Common Practices in Cooperative Multi-Agent
  Reinforcement Learning
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Wei Fu
Chao Yu
Zelai Xu
Jiaqi Yang
Yi Wu
34
32
0
15 Jun 2022
NeuPL: Neural Population Learning
NeuPL: Neural Population Learning
Siqi Liu
Luke Marris
Daniel Hennes
J. Merel
N. Heess
T. Graepel
35
17
0
15 Feb 2022
Anytime PSRO for Two-Player Zero-Sum Games
Anytime PSRO for Two-Player Zero-Sum Games
Stephen Marcus McAleer
Kevin A. Wang
John Lanier
Marc Lanctot
Pierre Baldi
T. Sandholm
Roy Fox
24
12
0
19 Jan 2022
Maximum Entropy Population-Based Training for Zero-Shot Human-AI
  Coordination
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
32
63
0
22 Dec 2021
Which priors matter? Benchmarking models for learning latent dynamics
Which priors matter? Benchmarking models for learning latent dynamics
Aleksandar Botev
Andrew Jaegle
Peter Wirnsberger
Daniel Hennes
I. Higgins
AI4CE
38
28
0
09 Nov 2021
A Game-Theoretic Approach for Improving Generalization Ability of TSP
  Solvers
A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Chenguang Wang
Yaodong Yang
Oliver Slumbers
Congying Han
Tiande Guo
Haifeng Zhang
Jun Wang
24
17
0
28 Oct 2021
Measuring the Non-Transitivity in Chess
Measuring the Non-Transitivity in Chess
R. Sanjaya
Jun Wang
Yaodong Yang
19
22
0
22 Oct 2021
Pick Your Battles: Interaction Graphs as Population-Level Objectives for
  Strategic Diversity
Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity
M. Garnelo
Wojciech M. Czarnecki
Siqi Liu
Dhruva Tirumala
Junhyuk Oh
Gauthier Gidel
H. V. Hasselt
David Balduzzi
34
25
0
08 Oct 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
55
181
0
27 Jul 2021
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium
  Meta-Solvers
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Luke Marris
Paul Muller
Marc Lanctot
K. Tuyls
T. Graepel
37
36
0
17 Jun 2021
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Ying Wen
Hui Chen
Yaodong Yang
Zheng Tian
Minne Li
Xu Chen
Jun Wang
38
11
0
12 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
31
130
0
25 May 2021
Modelling Behavioural Diversity for Learning in Open-Ended Games
Modelling Behavioural Diversity for Learning in Open-Ended Games
Nicolas Perez Nieves
Yaodong Yang
Oliver Slumbers
D. Mguni
Ying Wen
Jun Wang
22
67
0
14 Mar 2021
Evolutionary Game Theory Squared: Evolving Agents in Endogenously
  Evolving Zero-Sum Games
Evolutionary Game Theory Squared: Evolving Agents in Endogenously Evolving Zero-Sum Games
Stratis Skoulakis
Tanner Fiez
Ryan Sim
Georgios Piliouras
Lillian J. Ratliff
18
13
0
15 Dec 2020
TLeague: A Framework for Competitive Self-Play based Distributed
  Multi-Agent Reinforcement Learning
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Peng Sun
Jiechao Xiong
Lei Han
Xinghai Sun
Shuxing Li
Jiawei Xu
Meng Fang
Zhengyou Zhang
OffRL
LRM
33
19
0
25 Nov 2020
EigenGame: PCA as a Nash Equilibrium
EigenGame: PCA as a Nash Equilibrium
I. Gemp
Brian McWilliams
Claire Vernade
T. Graepel
32
46
0
01 Oct 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
24
42
0
08 Jun 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax
  Policies
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies
Stephan Zheng
Alexander R. Trott
Sunil Srinivasa
Nikhil Naik
Melvin Gruesbeck
David C. Parkes
R. Socher
31
131
0
28 Apr 2020
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded
  Invention of Learning Challenges and their Solutions
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
22
125
0
19 Mar 2020
A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I
  Learned to Stop Worrying about Mixed-Nash and Love Neural Nets
A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I Learned to Stop Worrying about Mixed-Nash and Love Neural Nets
Gauthier Gidel
David Balduzzi
Wojciech M. Czarnecki
M. Garnelo
Yoram Bachrach
13
7
0
14 Feb 2020
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
46
1,799
0
13 Dec 2019
A Generalized Training Approach for Multiagent Learning
A Generalized Training Approach for Multiagent Learning
Paul Muller
Shayegan Omidshafiei
Mark Rowland
K. Tuyls
Julien Perolat
...
Zhe Wang
Guy Lever
N. Heess
T. Graepel
Rémi Munos
22
89
0
27 Sep 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
30
248
0
26 Aug 2019
Adversarial Policies: Attacking Deep Reinforcement Learning
Adversarial Policies: Attacking Deep Reinforcement Learning
Adam Gleave
Michael Dennis
Cody Wild
Neel Kant
Sergey Levine
Stuart J. Russell
AAML
27
349
0
25 May 2019
Arena: A General Evaluation Platform and Building Toolkit for
  Multi-Agent Intelligence
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence
Yuhang Song
Andrzej Wojcicki
Thomas Lukasiewicz
Jianyi Wang
Abi Aryan
Zhenghua Xu
Mai Xu
Zihan Ding
Lianlong Wu
AI4CE
ELM
27
33
0
17 May 2019
1