Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.12234
Cited By
The Advantage Regret-Matching Actor-Critic
27 August 2020
A. Gruslys
Marc Lanctot
Rémi Munos
Finbarr Timbers
Martin Schmid
Julien Perolat
Dustin Morrill
V. Zambaldi
Jean-Baptiste Lespiau
John Schultz
M. G. Azar
Michael Bowling
K. Tuyls
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Advantage Regret-Matching Actor-Critic"
9 / 9 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
168
9
0
02 Aug 2024
A Survey of Decision Making in Adversarial Games
Xiuxian Li
Min Meng
Yiguang Hong
Jie-bin Chen
AAML
97
15
0
16 Jul 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Stephen Marcus McAleer
Gabriele Farina
Marc Lanctot
Tuomas Sandholm
174
26
0
08 Jun 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
119
28
0
30 Mar 2022
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent
Weiming Liu
Huacong Jiang
Bin Li
Houqiang Li
54
10
0
11 Oct 2021
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Michael Walton
Viliam Lisý
33
5
0
27 Feb 2021
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen Marcus McAleer
John Lanier
Roy Fox
Pierre Baldi
63
77
0
15 Jun 2020
Approximate exploitability: Learning a best response in large games
Finbarr Timbers
Nolan Bard
Edward Lockhart
Marc Lanctot
Martin Schmid
Neil Burch
Julian Schrittwieser
Thomas Hubert
Michael Bowling
AAML
80
27
0
20 Apr 2020
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
261
913
0
06 Jan 2017
1