Bayes' Bluff: Opponent Modelling in Poker

4 July 2012

Papers citing "Bayes' Bluff: Opponent Modelling in Poker"

50 / 82 papers shown

Title
Strategy-Augmented Planning for Large Language Models via Opponent Exploitation Shuai Xu Sijia Cui Yansen Wang Bo Xu Qi Wang RALM 113 0 0 13 May 2025
PolicyEvol-Agent: Evolving Policy via Environment Perception and Self-Awareness with Theory of Mind Yajie Yu Yue Feng LLMAG 81 0 0 20 Apr 2025
Learning in Games with Progressive Hiding Benjamin Heymann Marc Lanctot 72 0 0 05 Sep 2024
A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence Mingyang Liu Gabriele Farina Asuman Ozdaglar 76 3 0 01 Aug 2024
LiteEFG: An Efficient Python Library for Solving Extensive-form Games Mingyang Liu Gabriele Farina Asuman Ozdaglar 59 2 0 29 Jul 2024
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles Jiesong Lian Yucong Huang Chengdong Ma Mingzhi Wang Ying Wen Long Hu Yixue Hao 154 1 0 31 May 2024
Mixture of Public and Private Distributions in Imperfect Information Games Jérôme Arjonilla Abdallah Saffidine Tristan Cazenave 146 1 0 23 May 2024
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent Hang Xu Kai Li Bingyun Liu Haobo Fu Qiang Fu Junliang Xing Jian Cheng 70 3 0 22 Apr 2024
A Survey on Large Language Model-Based Game Agents Sihao Hu Tiansheng Huang Gaowen Liu Ramana Rao Kompella Gaowen Liu Selim Furkan Tekin Yichang Xu Zachary Yahn Ling Liu LLMAG LM&Ro AI4CE LM&MA 226 57 0 02 Apr 2024
Neural Population Learning beyond Symmetric Zero-sum Games Siqi Liu Luke Marris Marc Lanctot Georgios Piliouras Joel Z Leibo N. Heess MLT 89 3 0 10 Jan 2024
Efficient Learning in Polyhedral Games via Best Response Oracles Darshan Chakrabarti Gabriele Farina Christian Kroer 62 4 0 06 Dec 2023
Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability Revan MacQueen James R. Wright 72 2 0 17 Oct 2023
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4 Jiaxian Guo Bo Yang Paul D. Yoo Bill Yuchen Lin Yusuke Iwasawa Yutaka Matsuo LLMAG 118 45 0 29 Sep 2023
Local and adaptive mirror descents in extensive-form games Côme Fiegel Pierre Ménard Tadashi Kozuno Rémi Munos Vianney Perchet Michal Valko 104 2 0 01 Sep 2023
Block-Coordinate Methods and Restarting for Solving Extensive-Form Games D. Chakrabarti Jelena Diakonikolas Christian Kroer 63 8 0 31 Jul 2023
Policy Space Diversity for Non-Transitive Games Jian Yao Weiming Liu Haobo Fu Yaodong Yang Stephen Marcus McAleer Qiang Fu Wei Yang 109 11 0 29 Jun 2023
Hierarchical Deep Counterfactual Regret Minimization Jiayu Chen Tian-Shing Lan Vaneet Aggarwal 71 3 0 27 May 2023
Regret Matching+: (In)Stability and Fast Convergence in Games Gabriele Farina Julien Grand-Clément Christian Kroer Chung-Wei Lee Haipeng Luo 57 7 0 24 May 2023
$Equilibrium-Invariant Embedding, Metric Space, and Fundamental Set of $2\times2$ Normal-Form Games$ Equilibrium-Invariant Embedding, Metric Space, and Fundamental Set of $2\times2$ Normal-Form Games Luke Marris I. Gemp Georgios Piliouras 36 4 0 19 Apr 2023
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning Sotetsu Koyamada Shinri Okano Soichiro Nishimori Y. Murata Keigo Habara Haruka Kita Shin Ishii 110 26 0 29 Mar 2023
Convergence analysis and acceleration of the smoothing methods for solving extensive-form games Keigo Habara E. H. Fukuda N. Yamashita 12 0 0 20 Mar 2023
Adapting to game trees in zero-sum imperfect information games Côme Fiegel Pierre Ménard Tadashi Kozuno Rémi Munos Vianney Perchet Michal Valko 383 10 0 23 Dec 2022
The Power of Regularization in Solving Extensive-Form Games Ming-Yuan Liu Asuman Ozdaglar Tiancheng Yu Kai Zhang 58 23 0 19 Jun 2022
A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving Luca Carminati Federico Cacciamani Marco Ciccone N. Gatti 56 16 0 18 Jun 2022
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games Samuel Sokota Ryan DÓrazio J. Zico Kolter Nicolas Loizou Marc Lanctot Ioannis Mitliagkas Noam Brown Christian Kroer 72 1 0 12 Jun 2022
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections Dustin Morrill Ryan DÓrazio Marc Lanctot J. R. Wright Michael Bowling Amy Greenwald 124 21 0 24 May 2022
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning Youpeng Zhao Jian Zhao Xu Hu Wen-gang Zhou Houqiang Li 65 15 0 06 Apr 2022
Optimal Correlated Equilibria in General-Sum Extensive-Form Games: Fixed-Parameter Algorithms, Hardness, and Two-Sided Column-Generation B. Zhang Gabriele Farina A. Celli Tuomas Sandholm 81 22 0 14 Mar 2022
Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games Gabriele Farina Chung-Wei Lee Haipeng Luo Christian Kroer 46 32 0 01 Feb 2022
Student of Games: A unified learning algorithm for both perfect and imperfect information games Martin Schmid Matej Moravcík Neil Burch Rudolf Kadlec Josh Davidson ... Marc Lanctot G. Z. Holland Elnaz Davoodi Alden Christianson Michael Bowling 86 22 0 06 Dec 2021
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent Weiming Liu Huacong Jiang Bin Li Houqiang Li 57 10 0 11 Oct 2021
Last-iterate Convergence in Extensive-Form Games Chung-Wei Lee Christian Kroer Haipeng Luo 190 40 0 27 Jun 2021
Iterative Empirical Game Solving via Single Policy Best Response Max O. Smith Thomas W. Anthony Michael P. Wellman 63 18 0 03 Jun 2021
Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria Gabriele Farina Christian Kroer Tuomas Sandholm 61 26 0 27 May 2021
D2CFR: Minimize Counterfactual Regret with Deep Dueling Neural Network Huale Li Xuan Wang Zengyue Guo Jia-jia Zhang Shuhan Qi 36 1 0 26 May 2021
Online Double Oracle Le Cong Dinh Yaodong Yang Stephen Marcus McAleer Zheng Tian Nicolas Perez Nieves Oliver Slumbers D. Mguni Haitham Bou-Ammar Jun Wang 122 31 0 13 Mar 2021
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games Gabriele Farina Robin Schmucker Tuomas Sandholm 173 21 0 08 Mar 2021
Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games Gabriele Farina Tuomas Sandholm OffRL 83 18 0 08 Mar 2021
ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep Reinforcement Learning Naichen Shi Ruichen Li Sun Youran 40 0 0 15 Feb 2021
Safe Search for Stackelberg Equilibria in Extensive-Form Games Chun Kai Ling Noam Brown OffRL 31 3 0 02 Feb 2021
Deep Interactive Bayesian Reinforcement Learning via Meta-Learning L. Zintgraf Sam Devlin K. Ciosek Shimon Whiteson Katja Hofmann BDL 67 45 0 11 Jan 2021
Model-free Neural Counterfactual Regret Minimization with Bootstrap Learning Weiming Liu Bin Li Julian Togelius 83 8 0 03 Dec 2020
Faster Algorithms for Optimal Ex-Ante Coordinated Collusive Strategies in Extensive-Form Zero-Sum Games Gabriele Farina A. Celli N. Gatti Tuomas Sandholm 32 3 0 21 Sep 2020
Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent Gabriele Farina Christian Kroer Tuomas Sandholm 124 74 0 28 Jul 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning Eric Steinberger Adam Lerer Noam Brown 132 54 0 18 Jun 2020
Sparsified Linear Programming for Zero-Sum Equilibrium Finding B. Zhang Tuomas Sandholm 66 10 0 05 Jun 2020
No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium A. Celli A. Marchesi Gabriele Farina N. Gatti 146 47 0 01 Apr 2020
Robust Stochastic Bayesian Games for Behavior Space Coverage Julian Bernhard Alois Knoll 114 3 0 25 Mar 2020
Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework Ngoc Duy Nguyen Thanh Thi Nguyen Hai V. Nguyen Doug Creighton S. Nahavandi 163 4 0 27 Feb 2020
Stochastic Regret Minimization in Extensive-Form Games Gabriele Farina Christian Kroer Tuomas Sandholm 150 30 0 19 Feb 2020