Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.07927
Cited By
Modelling Behavioural Diversity for Learning in Open-Ended Games
14 March 2021
Nicolas Perez Nieves
Yaodong Yang
Oliver Slumbers
D. Mguni
Ying Wen
Jun Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Modelling Behavioural Diversity for Learning in Open-Ended Games"
39 / 39 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
49
8
0
02 Aug 2024
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Jiesong Lian
Yucong Huang
Chengdong Ma
Mingzhi Wang
Ying Wen
Long Hu
Yixue Hao
57
0
0
31 May 2024
Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning
Matteo Bettini
Ryan Kortvelesy
Amanda Prorok
27
4
0
23 May 2024
Self-adaptive PSRO: Towards an Automatic Population-based Game Solver
Pengdeng Li
Shuxin Li
Chang Yang
Xinrun Wang
Xiao Huang
Hau Chan
Bo An
29
1
0
17 Apr 2024
Policy Space Response Oracles: A Survey
Ariyan Bighashdel
Yongzhao Wang
Stephen Marcus McAleer
Rahul Savani
F. Oliehoek
25
6
0
04 Mar 2024
Feint in Multi-Player Games
Junyu Liu
Wangkai Jin
Xiangjun Peng
OffRL
25
0
0
04 Mar 2024
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning
Jiayu Chen
Zelai Xu
Yunfei Li
Chao Yu
Jiaming Song
Huazhong Yang
Fei Fang
Yu Wang
Yi Wu
24
4
0
07 Oct 2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games
Zelai Xu
Yancheng Liang
Chao Yu
Yu Wang
Yi Wu
17
8
0
05 Oct 2023
Diversifying AI: Towards Creative Chess with AlphaZero
Tom Zahavy
Vivek Veeriah
Shaobo Hou
Kevin Waugh
Matthew Lai
Edouard Leurent
Nenad Tomašev
Lisa Schut
Demis Hassabis
Satinder Singh
29
15
0
17 Aug 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
J. Wang
Zonghong Dai
Yaodong Yang
31
2
0
09 Aug 2023
Policy Space Diversity for Non-Transitive Games
Jian Yao
Weiming Liu
Haobo Fu
Yaodong Yang
Stephen Marcus McAleer
Qiang Fu
Wei Yang
35
9
0
29 Jun 2023
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
Xiangyu Liu
Souradip Chakraborty
Yanchao Sun
Furong Huang
AAML
26
4
0
27 May 2023
Learning Diverse Risk Preferences in Population-based Self-play
Y. Jiang
Qihan Liu
Xiaoteng Ma
Chenghao Li
Yiqin Yang
Jun Yang
Bin Liang
Qianchuan Zhao
54
3
0
19 May 2023
Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Xiyun Li
Ziyi Ni
Jingqing Ruan
Linghui Meng
Jing Shi
Tielin Zhang
Bo Xu
53
4
0
10 May 2023
Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Udari Madhushani
Kevin R. McKee
J. Agapiou
Joel Z. Leibo
Richard Everett
Thomas W. Anthony
Edward Hughes
K. Tuyls
Edgar A. Duénez-Guzmán
36
2
0
01 May 2023
ASP: Learn a Universal Neural Solver!
Chenguang Wang
Zhouliang Yu
Stephen Marcus McAleer
Tianshu Yu
Yao-Chun Yang
AAML
32
24
0
01 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
14
0
0
28 Feb 2023
Diverse Policy Optimization for Structured Action Space
Wenhao Li
Baoxiang Wang
Shanchao Yang
H. Zha
OffRL
29
1
0
23 Feb 2023
A Unified Algorithm Framework for Unsupervised Discovery of Skills based on Determinantal Point Process
Jiayu Chen
Vaneet Aggarwal
Tian-Shing Lan
16
1
0
01 Dec 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
27
18
0
13 Jul 2022
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems
Oliver Slumbers
D. Mguni
Stephen Marcus McAleer
Stefano B. Blumberg
Jun Wang
Yaodong Yang
30
9
0
30 May 2022
On the Convergence of Fictitious Play: A Decomposition Approach
Yurong Chen
Xiaotie Deng
Chenchen Li
D. Mguni
Jun Wang
Xiang Yan
Yaodong Yang
19
4
0
03 May 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
25
28
0
04 Apr 2022
Efficient Policy Space Response Oracles
Ming Zhou
Jingxiao Chen
Ying Wen
Weinan Zhang
Yaodong Yang
Yong Yu
Jun Wang
46
10
0
28 Jan 2022
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
Rui Zhao
Jinming Song
Yufeng Yuan
Haifeng Hu
Yang Gao
Yi Wu
Zhongqian Sun
Yang Wei
24
63
0
22 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
26
38
0
06 Dec 2021
Online MAP Inference and Learning for Nonsymmetric Determinantal Point Processes
Aravind Reddy
Ryan A. Rossi
Zhao-quan Song
Anup B. Rao
Tung Mai
Nedim Lipka
Gang Wu
Eunyee Koh
Nesreen Ahmed
27
2
0
29 Nov 2021
A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Chenguang Wang
Yaodong Yang
Oliver Slumbers
Congying Han
Tiande Guo
Haifeng Zhang
Jun Wang
19
17
0
28 Oct 2021
Measuring the Non-Transitivity in Chess
R. Sanjaya
Jun Wang
Yaodong Yang
11
22
0
22 Oct 2021
Online Markov Decision Processes with Non-oblivious Strategic Adversary
Le Cong Dinh
D. Mguni
Long Tran-Thanh
Jun Wang
Yaodong Yang
17
5
0
07 Oct 2021
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games
Xiaotie Deng
Ningyuan Li
D. Mguni
Jun Wang
Yaodong Yang
21
46
0
04 Sep 2021
Is Nash Equilibrium Approximator Learnable?
Zhijian Duan
Wenhan Huang
Dinghuai Zhang
Yali Du
Jun Wang
Yaodong Yang
Xiaotie Deng
6
6
0
17 Aug 2021
Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Xiangyu Liu
Hangtian Jia
Ying Wen
Yaodong Yang
Yujing Hu
Yingfeng Chen
Changjie Fan
Zhipeng Hu
20
18
0
09 Jun 2021
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Ming Zhou
Ziyu Wan
Hanjing Wang
Muning Wen
Runzhe Wu
Ying Wen
Yaodong Yang
Weinan Zhang
Jun Wang
OffRL
19
46
0
05 Jun 2021
Neural Auto-Curricula
Xidong Feng
Oliver Slumbers
Ziyu Wan
Bo Liu
Stephen Marcus McAleer
Ying Wen
Jun Wang
Yaodong Yang
15
1
0
04 Jun 2021
Online Double Oracle
Le Cong Dinh
Yaodong Yang
Stephen Marcus McAleer
Zheng Tian
Nicolas Perez Nieves
Oliver Slumbers
D. Mguni
Haitham Bou-Ammar
Jun Wang
26
30
0
13 Mar 2021
Quantifying the effects of environment and population diversity in multi-agent reinforcement learning
Kevin R. McKee
Joel Z. Leibo
Charlie Beattie
Richard Everett
42
31
0
16 Feb 2021
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou
Jun-Jie Luo
Julian Villela
Yaodong Yang
David Rusu
...
H. Ammar
Hongbo Zhang
Wulong Liu
Jianye Hao
Jun Wang
136
193
0
19 Oct 2020
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
162
1,122
0
25 Jul 2012
1