ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.12234
  4. Cited By
The Advantage Regret-Matching Actor-Critic

The Advantage Regret-Matching Actor-Critic

27 August 2020
A. Gruslys
Marc Lanctot
Rémi Munos
Finbarr Timbers
Martin Schmid
Julien Perolat
Dustin Morrill
V. Zambaldi
Jean-Baptiste Lespiau
John Schultz
M. G. Azar
Michael Bowling
K. Tuyls
    OffRL
ArXiv (abs)PDFHTML

Papers citing "The Advantage Regret-Matching Actor-Critic"

9 / 9 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDaSSLOnRL
168
9
0
02 Aug 2024
A Survey of Decision Making in Adversarial Games
A Survey of Decision Making in Adversarial Games
Xiuxian Li
Min Meng
Yiguang Hong
Jie-bin Chen
AAML
97
15
0
16 Jul 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History
  Value Function to Estimate Regret
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Stephen Marcus McAleer
Gabriele Farina
Marc Lanctot
Tuomas Sandholm
174
26
0
08 Jun 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
119
28
0
30 Mar 2022
Equivalence Analysis between Counterfactual Regret Minimization and
  Online Mirror Descent
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent
Weiming Liu
Huacong Jiang
Bin Li
Houqiang Li
54
10
0
11 Oct 2021
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Michael Walton
Viliam Lisý
33
5
0
27 Feb 2021
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash
  Equilibria in Large Games
Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
Stephen Marcus McAleer
John Lanier
Roy Fox
Pierre Baldi
63
77
0
15 Jun 2020
Approximate exploitability: Learning a best response in large games
Approximate exploitability: Learning a best response in large games
Finbarr Timbers
Nolan Bard
Edward Lockhart
Marc Lanctot
Martin Schmid
Neil Burch
Julian Schrittwieser
Thomas Hubert
Michael Bowling
AAML
80
27
0
20 Apr 2020
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
261
913
0
06 Jan 2017
1