Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1207.1411
Cited By
Bayes' Bluff: Opponent Modelling in Poker
4 July 2012
F. Southey
Michael Bowling
Bryce Larson
Carmelo Piccione
Neil Burch
Darse Billings
D. C. Rayner
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Bayes' Bluff: Opponent Modelling in Poker"
32 / 82 papers shown
Title
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning
A. Celli
Marco Ciccone
Raffaele Bongo
N. Gatti
61
12
0
16 Dec 2019
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of
f
f
f
-Regression Counterfactual Regret Minimization
Ryan DÓrazio
Dustin Morrill
J. R. Wright
Michael Bowling
99
11
0
06 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kai Zhang
Zhuoran Yang
Tamer Basar
231
1,228
0
24 Nov 2019
Optimistic Regret Minimization for Extensive-Form Games via Dilated Distance-Generating Functions
Gabriele Farina
Christian Kroer
Tuomas Sandholm
81
46
0
24 Oct 2019
A Generalized Training Approach for Multiagent Learning
Paul Muller
Shayegan Omidshafiei
Mark Rowland
K. Tuyls
Julien Perolat
...
Zhe Wang
Guy Lever
N. Heess
T. Graepel
Rémi Munos
102
93
0
27 Sep 2019
OpenSpiel: A Framework for Reinforcement Learning in Games
Marc Lanctot
Edward Lockhart
Jean-Baptiste Lespiau
V. Zambaldi
Satyaki Upadhyay
...
Julian Schrittwieser
Thomas W. Anthony
Edward Hughes
Ivo Danihelka
Jonah Ryan-Davis
OffRL
133
254
0
26 Aug 2019
Low-Variance and Zero-Variance Baselines for Extensive-Form Games
Trevor Davis
Martin Schmid
Michael Bowling
OffRL
77
19
0
22 Jul 2019
Reasoning about Hypothetical Agent Behaviours and their Parameters
Stefano V. Albrecht
Peter Stone
88
63
0
26 Jun 2019
Neural Replicator Dynamics
Daniel Hennes
Dustin Morrill
Shayegan Omidshafiei
Rémi Munos
Julien Perolat
...
A. Gruslys
Jean-Baptiste Lespiau
Paavo Parmas
Edgar A. Duénez-Guzmán
K. Tuyls
74
16
0
01 Jun 2019
Value Functions for Depth-Limited Solving in Zero-Sum Imperfect-Information Games
Vojtěch Kovařík
Dominik Seitz
Viliam Lisý
Jan Rudolf
Shuo Sun
Karel Ha
FAtt
91
1
0
31 May 2019
Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent
Edward Lockhart
Marc Lanctot
Julien Pérolat
Jean-Baptiste Lespiau
Dustin Morrill
Finbarr Timbers
K. Tuyls
177
82
0
13 Mar 2019
α
α
α
-Rank: Multi-Agent Evaluation by Evolution
Shayegan Omidshafiei
Christos H. Papadimitriou
Georgios Piliouras
K. Tuyls
Mark Rowland
Jean-Baptiste Lespiau
Wojciech M. Czarnecki
Marc Lanctot
Julien Perolat
Rémi Munos
119
121
0
04 Mar 2019
Single Deep Counterfactual Regret Minimization
Eric Steinberger
BDL
52
40
0
22 Jan 2019
Double Neural Counterfactual Regret Minimization
Hui Li
Kailiang Hu
Zhibang Ge
Tao Jiang
Yuan Qi
Le Song
71
52
0
27 Dec 2018
Learning Sharing Behaviors with Arbitrary Numbers of Agents
Katherine Metcalf
B. Theobald
N. Apostoloff
23
2
0
10 Dec 2018
Solving Large Extensive-Form Games with Strategy Constraints
Trevor Davis
Kevin Waugh
Michael Bowling
45
12
0
20 Sep 2018
Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games
Gabriele Farina
Christian Kroer
Tuomas Sandholm
106
59
0
10 Sep 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Martin Schmid
Neil Burch
Marc Lanctot
Matej Moravcík
Rudolf Kadlec
Michael Bowling
158
64
0
09 Sep 2018
ExIt-OOS: Towards Learning from Planning in Imperfect Information Games
Andy Kitchen
Michela Benedetti
OnRL
LRM
22
1
0
30 Aug 2018
A Generalised Method for Empirical Game Theoretic Analysis
K. Tuyls
Julien Perolat
Marc Lanctot
Joel Z Leibo
T. Graepel
52
57
0
16 Mar 2018
Symmetric Decomposition of Asymmetric Games
K. Tuyls
Julien Perolat
Marc Lanctot
Georg Ostrovski
Rahul Savani
Joel Z Leibo
Toby Ord
T. Graepel
Shane Legg
75
39
0
14 Nov 2017
Regret Minimization in Behaviorally-Constrained Zero-Sum Games
Gabriele Farina
Christian Kroer
Tuomas Sandholm
61
23
0
09 Nov 2017
Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems
Stefano V. Albrecht
Peter Stone
158
475
0
23 Sep 2017
Theoretical and Practical Advances on Smoothing for Extensive-Form Games
Christian Kroer
Kevin Waugh
Fatma Kılınç Karzan
Tuomas Sandholm
101
24
0
16 Feb 2017
AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games
Neil Burch
Martin Schmid
Matej Moravcík
Michael Bowling
83
22
0
20 Dec 2016
Opponent Modeling in Deep Reinforcement Learning
He He
Jordan L. Boyd-Graber
Kevin Kwok
Hal Daumé III
BDL
86
327
0
18 Sep 2016
Reduced Space and Faster Convergence in Imperfect-Information Games via Regret-Based Pruning
Noam Brown
Tuomas Sandholm
39
5
0
12 Sep 2016
Bayesian Opponent Exploitation in Imperfect-Information Games
Sam Ganzfried
Qingyun Sun
20
16
0
10 Mar 2016
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
Johannes Heinrich
David Silver
SSL
88
400
0
03 Mar 2016
Belief and Truth in Hypothesised Behaviours
Stefano V. Albrecht
J. Crandall
S. Ramamoorthy
92
76
0
28 Jul 2015
Solving Games with Functional Regret Estimation
Kevin Waugh
Dustin Morrill
J. Andrew Bagnell
Michael Bowling
OffRL
92
58
0
28 Nov 2014
A Methodology for Player Modeling based on Machine Learning
Marlos C. Machado
43
0
0
13 Dec 2013
Previous
1
2