Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1207.1411
Cited By
Bayes' Bluff: Opponent Modelling in Poker
4 July 2012
F. Southey
Michael Bowling
Bryce Larson
Carmelo Piccione
Neil Burch
Darse Billings
D. C. Rayner
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Bayes' Bluff: Opponent Modelling in Poker"
50 / 82 papers shown
Title
Strategy-Augmented Planning for Large Language Models via Opponent Exploitation
Shuai Xu
Sijia Cui
Yansen Wang
Bo Xu
Qi Wang
RALM
113
0
0
13 May 2025
PolicyEvol-Agent: Evolving Policy via Environment Perception and Self-Awareness with Theory of Mind
Yajie Yu
Yue Feng
LLMAG
81
0
0
20 Apr 2025
Learning in Games with Progressive Hiding
Benjamin Heymann
Marc Lanctot
72
0
0
05 Sep 2024
A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Mingyang Liu
Gabriele Farina
Asuman Ozdaglar
76
3
0
01 Aug 2024
LiteEFG: An Efficient Python Library for Solving Extensive-form Games
Mingyang Liu
Gabriele Farina
Asuman Ozdaglar
59
2
0
29 Jul 2024
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles
Jiesong Lian
Yucong Huang
Chengdong Ma
Mingzhi Wang
Ying Wen
Long Hu
Yixue Hao
154
1
0
31 May 2024
Mixture of Public and Private Distributions in Imperfect Information Games
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
146
1
0
23 May 2024
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
Hang Xu
Kai Li
Bingyun Liu
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
70
3
0
22 Apr 2024
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
226
57
0
02 Apr 2024
Neural Population Learning beyond Symmetric Zero-sum Games
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z Leibo
N. Heess
MLT
89
3
0
10 Jan 2024
Efficient Learning in Polyhedral Games via Best Response Oracles
Darshan Chakrabarti
Gabriele Farina
Christian Kroer
62
4
0
06 Dec 2023
Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability
Revan MacQueen
James R. Wright
72
2
0
17 Oct 2023
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4
Jiaxian Guo
Bo Yang
Paul D. Yoo
Bill Yuchen Lin
Yusuke Iwasawa
Yutaka Matsuo
LLMAG
118
45
0
29 Sep 2023
Local and adaptive mirror descents in extensive-form games
Côme Fiegel
Pierre Ménard
Tadashi Kozuno
Rémi Munos
Vianney Perchet
Michal Valko
104
2
0
01 Sep 2023
Block-Coordinate Methods and Restarting for Solving Extensive-Form Games
D. Chakrabarti
Jelena Diakonikolas
Christian Kroer
63
8
0
31 Jul 2023
Policy Space Diversity for Non-Transitive Games
Jian Yao
Weiming Liu
Haobo Fu
Yaodong Yang
Stephen Marcus McAleer
Qiang Fu
Wei Yang
109
11
0
29 Jun 2023
Hierarchical Deep Counterfactual Regret Minimization
Jiayu Chen
Tian-Shing Lan
Vaneet Aggarwal
71
3
0
27 May 2023
Regret Matching+: (In)Stability and Fast Convergence in Games
Gabriele Farina
Julien Grand-Clément
Christian Kroer
Chung-Wei Lee
Haipeng Luo
57
7
0
24 May 2023
Equilibrium-Invariant Embedding, Metric Space, and Fundamental Set of
2
×
2
2\times2
2
×
2
Normal-Form Games
Luke Marris
I. Gemp
Georgios Piliouras
36
4
0
19 Apr 2023
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning
Sotetsu Koyamada
Shinri Okano
Soichiro Nishimori
Y. Murata
Keigo Habara
Haruka Kita
Shin Ishii
110
26
0
29 Mar 2023
Convergence analysis and acceleration of the smoothing methods for solving extensive-form games
Keigo Habara
E. H. Fukuda
N. Yamashita
12
0
0
20 Mar 2023
Adapting to game trees in zero-sum imperfect information games
Côme Fiegel
Pierre Ménard
Tadashi Kozuno
Rémi Munos
Vianney Perchet
Michal Valko
383
10
0
23 Dec 2022
The Power of Regularization in Solving Extensive-Form Games
Ming-Yuan Liu
Asuman Ozdaglar
Tiancheng Yu
Kai Zhang
58
23
0
19 Jun 2022
A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving
Luca Carminati
Federico Cacciamani
Marco Ciccone
N. Gatti
56
16
0
18 Jun 2022
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
J. Zico Kolter
Nicolas Loizou
Marc Lanctot
Ioannis Mitliagkas
Noam Brown
Christian Kroer
72
1
0
12 Jun 2022
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
Dustin Morrill
Ryan DÓrazio
Marc Lanctot
J. R. Wright
Michael Bowling
Amy Greenwald
124
21
0
24 May 2022
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning
Youpeng Zhao
Jian Zhao
Xu Hu
Wen-gang Zhou
Houqiang Li
65
15
0
06 Apr 2022
Optimal Correlated Equilibria in General-Sum Extensive-Form Games: Fixed-Parameter Algorithms, Hardness, and Two-Sided Column-Generation
B. Zhang
Gabriele Farina
A. Celli
Tuomas Sandholm
81
22
0
14 Mar 2022
Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games
Gabriele Farina
Chung-Wei Lee
Haipeng Luo
Christian Kroer
46
32
0
01 Feb 2022
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
86
22
0
06 Dec 2021
Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent
Weiming Liu
Huacong Jiang
Bin Li
Houqiang Li
57
10
0
11 Oct 2021
Last-iterate Convergence in Extensive-Form Games
Chung-Wei Lee
Christian Kroer
Haipeng Luo
190
40
0
27 Jun 2021
Iterative Empirical Game Solving via Single Policy Best Response
Max O. Smith
Thomas W. Anthony
Michael P. Wellman
63
18
0
03 Jun 2021
Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria
Gabriele Farina
Christian Kroer
Tuomas Sandholm
61
26
0
27 May 2021
D2CFR: Minimize Counterfactual Regret with Deep Dueling Neural Network
Huale Li
Xuan Wang
Zengyue Guo
Jia-jia Zhang
Shuhan Qi
36
1
0
26 May 2021
Online Double Oracle
Le Cong Dinh
Yaodong Yang
Stephen Marcus McAleer
Zheng Tian
Nicolas Perez Nieves
Oliver Slumbers
D. Mguni
Haitham Bou-Ammar
Jun Wang
122
31
0
13 Mar 2021
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games
Gabriele Farina
Robin Schmucker
Tuomas Sandholm
173
21
0
08 Mar 2021
Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games
Gabriele Farina
Tuomas Sandholm
OffRL
83
18
0
08 Mar 2021
ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep Reinforcement Learning
Naichen Shi
Ruichen Li
Sun Youran
40
0
0
15 Feb 2021
Safe Search for Stackelberg Equilibria in Extensive-Form Games
Chun Kai Ling
Noam Brown
OffRL
31
3
0
02 Feb 2021
Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
L. Zintgraf
Sam Devlin
K. Ciosek
Shimon Whiteson
Katja Hofmann
BDL
67
45
0
11 Jan 2021
Model-free Neural Counterfactual Regret Minimization with Bootstrap Learning
Weiming Liu
Bin Li
Julian Togelius
83
8
0
03 Dec 2020
Faster Algorithms for Optimal Ex-Ante Coordinated Collusive Strategies in Extensive-Form Zero-Sum Games
Gabriele Farina
A. Celli
N. Gatti
Tuomas Sandholm
32
3
0
21 Sep 2020
Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent
Gabriele Farina
Christian Kroer
Tuomas Sandholm
124
74
0
28 Jul 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
132
54
0
18 Jun 2020
Sparsified Linear Programming for Zero-Sum Equilibrium Finding
B. Zhang
Tuomas Sandholm
66
10
0
05 Jun 2020
No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium
A. Celli
A. Marchesi
Gabriele Farina
N. Gatti
146
47
0
01 Apr 2020
Robust Stochastic Bayesian Games for Behavior Space Coverage
Julian Bernhard
Alois Knoll
114
3
0
25 Mar 2020
Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework
Ngoc Duy Nguyen
Thanh Thi Nguyen
Hai V. Nguyen
Doug Creighton
S. Nahavandi
163
4
0
27 Feb 2020
Stochastic Regret Minimization in Extensive-Form Games
Gabriele Farina
Christian Kroer
Tuomas Sandholm
150
30
0
19 Feb 2020
1
2
Next