Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.01072
Cited By
A Survey on Self-play Methods in Reinforcement Learning
2 August 2024
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
Wenhao Tang
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey on Self-play Methods in Reinforcement Learning"
10 / 10 papers shown
Title
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu
Chao Yu
Ruize Zhang
Huining Yuan
Xiangmin Yi
Shilong Ji
Chuqi Wang
Wenhao Tang
Yu-Xiang Wang
123
0
0
04 Feb 2025
On-line Policy Improvement using Monte-Carlo Search
Gerald Tesauro
Gregory R. Galperin
62
270
0
09 Jan 2025
Self-Rewarding Language Models
Weizhe Yuan
Richard Yuanzhe Pang
Kyunghyun Cho
Xian Li
Sainbayar Sukhbaatar
Jing Xu
Jason Weston
ReLM
SyDa
ALM
LRM
215
291
0
18 Jan 2024
Neural Population Learning beyond Symmetric Zero-sum Games
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z. Leibo
N. Heess
MLT
59
3
0
10 Jan 2024
A survey on algorithms for Nash equilibria in finite normal-form games
Hanyu Li
Wenhan Huang
Zhijian Duan
D. Mguni
Kun Shao
Jun Wang
Xiaotie Deng
42
4
0
18 Dec 2023
Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games
Siqi Liu
Marc Lanctot
Luke Marris
N. Heess
MLT
35
12
0
31 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Efficient Policy Space Response Oracles
Ming Zhou
Jingxiao Chen
Ying Wen
Weinan Zhang
Yaodong Yang
Yong Yu
Jun Wang
38
10
0
28 Jan 2022
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
126
350
0
16 Oct 2019
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
146
1,045
0
25 Jul 2012
1