Bayes' Bluff: Opponent Modelling in Poker

4 July 2012

Papers citing "Bayes' Bluff: Opponent Modelling in Poker"

32 / 82 papers shown

Title
Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning A. Celli Marco Ciccone Raffaele Bongo N. Gatti 61 12 0 16 Dec 2019
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of $f$ -Regression Counterfactual Regret Minimization Ryan DÓrazio Dustin Morrill J. R. Wright Michael Bowling 99 11 0 06 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms Kai Zhang Zhuoran Yang Tamer Basar 231 1,228 0 24 Nov 2019
Optimistic Regret Minimization for Extensive-Form Games via Dilated Distance-Generating Functions Gabriele Farina Christian Kroer Tuomas Sandholm 81 46 0 24 Oct 2019
A Generalized Training Approach for Multiagent Learning Paul Muller Shayegan Omidshafiei Mark Rowland K. Tuyls Julien Perolat ... Zhe Wang Guy Lever N. Heess T. Graepel Rémi Munos 102 93 0 27 Sep 2019
OpenSpiel: A Framework for Reinforcement Learning in Games Marc Lanctot Edward Lockhart Jean-Baptiste Lespiau V. Zambaldi Satyaki Upadhyay ... Julian Schrittwieser Thomas W. Anthony Edward Hughes Ivo Danihelka Jonah Ryan-Davis OffRL 133 254 0 26 Aug 2019
Low-Variance and Zero-Variance Baselines for Extensive-Form Games Trevor Davis Martin Schmid Michael Bowling OffRL 77 19 0 22 Jul 2019
Reasoning about Hypothetical Agent Behaviours and their Parameters Stefano V. Albrecht Peter Stone 88 63 0 26 Jun 2019
Neural Replicator Dynamics Daniel Hennes Dustin Morrill Shayegan Omidshafiei Rémi Munos Julien Perolat ... A. Gruslys Jean-Baptiste Lespiau Paavo Parmas Edgar A. Duénez-Guzmán K. Tuyls 74 16 0 01 Jun 2019
Value Functions for Depth-Limited Solving in Zero-Sum Imperfect-Information Games Vojtěch Kovařík Dominik Seitz Viliam Lisý Jan Rudolf Shuo Sun Karel Ha FAtt 91 1 0 31 May 2019
Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent Edward Lockhart Marc Lanctot Julien Pérolat Jean-Baptiste Lespiau Dustin Morrill Finbarr Timbers K. Tuyls 177 82 0 13 Mar 2019
$α$ -Rank: Multi-Agent Evaluation by Evolution Shayegan Omidshafiei Christos H. Papadimitriou Georgios Piliouras K. Tuyls Mark Rowland Jean-Baptiste Lespiau Wojciech M. Czarnecki Marc Lanctot Julien Perolat Rémi Munos 119 121 0 04 Mar 2019
Single Deep Counterfactual Regret Minimization Eric Steinberger BDL 52 40 0 22 Jan 2019
Double Neural Counterfactual Regret Minimization Hui Li Kailiang Hu Zhibang Ge Tao Jiang Yuan Qi Le Song 71 52 0 27 Dec 2018
Learning Sharing Behaviors with Arbitrary Numbers of Agents Katherine Metcalf B. Theobald N. Apostoloff 23 2 0 10 Dec 2018
Solving Large Extensive-Form Games with Strategy Constraints Trevor Davis Kevin Waugh Michael Bowling 45 12 0 20 Sep 2018
Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games Gabriele Farina Christian Kroer Tuomas Sandholm 106 59 0 10 Sep 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines Martin Schmid Neil Burch Marc Lanctot Matej Moravcík Rudolf Kadlec Michael Bowling 158 64 0 09 Sep 2018
ExIt-OOS: Towards Learning from Planning in Imperfect Information Games Andy Kitchen Michela Benedetti OnRL LRM 22 1 0 30 Aug 2018
A Generalised Method for Empirical Game Theoretic Analysis K. Tuyls Julien Perolat Marc Lanctot Joel Z Leibo T. Graepel 52 57 0 16 Mar 2018
Symmetric Decomposition of Asymmetric Games K. Tuyls Julien Perolat Marc Lanctot Georg Ostrovski Rahul Savani Joel Z Leibo Toby Ord T. Graepel Shane Legg 75 39 0 14 Nov 2017
Regret Minimization in Behaviorally-Constrained Zero-Sum Games Gabriele Farina Christian Kroer Tuomas Sandholm 61 23 0 09 Nov 2017
Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems Stefano V. Albrecht Peter Stone 158 475 0 23 Sep 2017
Theoretical and Practical Advances on Smoothing for Extensive-Form Games Christian Kroer Kevin Waugh Fatma Kılınç Karzan Tuomas Sandholm 101 24 0 16 Feb 2017
AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games Neil Burch Martin Schmid Matej Moravcík Michael Bowling 83 22 0 20 Dec 2016
Opponent Modeling in Deep Reinforcement Learning He He Jordan L. Boyd-Graber Kevin Kwok Hal Daumé III BDL 86 327 0 18 Sep 2016
Reduced Space and Faster Convergence in Imperfect-Information Games via Regret-Based Pruning Noam Brown Tuomas Sandholm 39 5 0 12 Sep 2016
Bayesian Opponent Exploitation in Imperfect-Information Games Sam Ganzfried Qingyun Sun 20 16 0 10 Mar 2016
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games Johannes Heinrich David Silver SSL 88 400 0 03 Mar 2016
Belief and Truth in Hypothesised Behaviours Stefano V. Albrecht J. Crandall S. Ramamoorthy 92 76 0 28 Jul 2015
Solving Games with Functional Regret Estimation Kevin Waugh Dustin Morrill J. Andrew Bagnell Michael Bowling OffRL 92 58 0 28 Nov 2014
A Methodology for Player Modeling based on Machine Learning Marlos C. Machado 43 0 0 13 Dec 2013