Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.7974
Cited By
v1
v2 (latest)
Solving Games with Functional Regret Estimation
28 November 2014
Kevin Waugh
Dustin Morrill
J. Andrew Bagnell
Michael Bowling
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Solving Games with Functional Regret Estimation"
24 / 24 papers shown
Title
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
168
9
0
02 Aug 2024
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Dustin Morrill
Ryan DÓrazio
Marc Lanctot
J. R. Wright
Michael Bowling
Amy Greenwald
124
21
0
24 May 2022
PerfectDou: Dominating DouDizhu with Perfect Information Distillation
Yang Guan
Minghuan Liu
Weijun Hong
Weinan Zhang
Fei Fang
Guangjun Zeng
Yue Lin
119
28
0
30 Mar 2022
Near-Optimal Learning of Extensive-Form Games with Imperfect Information
Yunru Bai
Chi Jin
Song Mei
Tiancheng Yu
104
26
0
03 Feb 2022
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
68
3
0
29 Oct 2021
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent
Weiming Liu
Huacong Jiang
Bin Li
Houqiang Li
57
10
0
11 Oct 2021
Multi-agent Reinforcement Learning in OpenSpiel: A Reproduction Report
Michael Walton
Viliam Lisý
43
5
0
27 Feb 2021
The Advantage Regret-Matching Actor-Critic
A. Gruslys
Marc Lanctot
Rémi Munos
Finbarr Timbers
Martin Schmid
...
Jean-Baptiste Lespiau
John Schultz
M. G. Azar
Michael Bowling
K. Tuyls
OffRL
73
28
0
27 Aug 2020
Unlocking the Potential of Deep Counterfactual Value Networks
Ryan Zarick
Bryan Pellegrino
Noam Brown
Caleb Banister
OffRL
66
18
0
20 Jul 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
132
54
0
18 Jun 2020
Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory
Yunlong Lu
Kai Yan
AI4CE
172
13
0
17 Jan 2020
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of
f
f
f
-Regression Counterfactual Regret Minimization
Ryan DÓrazio
Dustin Morrill
J. R. Wright
Michael Bowling
104
11
0
06 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
231
1,228
0
24 Nov 2019
Combining No-regret and Q-learning
Ian A. Kash
Michael Sullins
Katja Hofmann
OffRL
105
17
0
07 Oct 2019
Bounds for Approximate Regret-Matching Algorithms
Scott Fujimoto
Dustin Morrill
J. R. Wright
72
3
0
03 Oct 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
135
254
0
26 Aug 2019
Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent
Edward Lockhart
Marc Lanctot
Julien Pérolat
Jean-Baptiste Lespiau
Dustin Morrill
Finbarr Timbers
K. Tuyls
179
82
0
13 Mar 2019
Double Neural Counterfactual Regret Minimization
Hui Li
Kailiang Hu
Zhibang Ge
Tao Jiang
Yuan Qi
Le Song
71
52
0
27 Dec 2018
Deep Counterfactual Regret Minimization
Noam Brown
Adam Lerer
Sam Gross
Tuomas Sandholm
183
215
0
01 Nov 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
81
149
0
21 Oct 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Martin Schmid
Neil Burch
Marc Lanctot
Matej Moravcík
Rudolf Kadlec
Michael Bowling
158
64
0
09 Sep 2018
Regret Minimization for Partially Observable Deep Reinforcement Learning
Peter H. Jin
Kurt Keutzer
Sergey Levine
83
51
0
31 Oct 2017
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
Johannes Heinrich
David Silver
SSL
97
400
0
03 Mar 2016
Imperfect-Recall Abstractions with Bounds in Games
Christian Kroer
Tuomas Sandholm
80
33
0
11 Sep 2014
1