Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.10410
Cited By
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
18 June 2020
Eric Steinberger
Adam Lerer
Noam Brown
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DREAM: Deep Regret minimization with Advantage baselines and Model-free learning"
4 / 4 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Ruize Zhang
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
46
8
0
02 Aug 2024
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
J. Wang
Zonghong Dai
Yaodong Yang
24
2
0
09 Aug 2023
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
27
18
0
13 Jul 2022
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael H. Bowling
13
3
0
29 Oct 2021
1