Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.01724
Cited By
v1
v2
v3 (latest)
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
6 January 2017
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker"
50 / 306 papers shown
Title
Empirical Validation of the Independent Chip Model
Juho Kim
20
0
0
30 May 2025
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Montaser Mohammedalamen
Michael Bowling
97
0
0
13 May 2025
Meta-Learning in Self-Play Regret Minimization
David Sychrovský
Martin Schmid
Michal Sustr
Michael Bowling
71
0
0
26 Apr 2025
Approximating Nash Equilibria in General-Sum Games via Meta-Learning
David Sychrovský
Christopher Solinas
Revan MacQueen
Kevin Wang
James Wright
Nathan R Sturtevant
Michael Bowling
54
0
0
26 Apr 2025
Rethinking the Foundations for Continual Reinforcement Learning
Michael Bowling
Esraa Elelimy
CLL
OffRL
LRM
83
4
0
10 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
87
1
0
06 Apr 2025
Faster Rates for No-Regret Learning in General Games via Cautious Optimism
Ashkan Soleymani
Georgios Piliouras
Gabriele Farina
103
1
0
31 Mar 2025
Asynchronous Predictive Counterfactual Regret Minimization
+
^+
+
Algorithm in Solving Extensive-Form Games
Linjian Meng
Youzhi Zhang
Zhenxing Ge
Tianpei Yang
Yang Gao
100
0
0
17 Mar 2025
Multi-Agent Q-Learning Dynamics in Random Networks: Convergence due to Exploration and Sparsity
A. Hussain
D. Leonte
Francesco Belardinelli
Raphael Huser
Dario Paccagnan
78
0
0
13 Mar 2025
Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning
Kha Vo
Chin-Teng Lin
GNN
100
0
0
10 Mar 2025
On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games
Yang Cai
Gabriele Farina
Julien Grand-Clément
Christian Kroer
Chung-Wei Lee
Haipeng Luo
Weiqiang Zheng
77
1
0
04 Mar 2025
Two-Player Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire
Z. Xu
Yi Ren
SyDa
198
0
0
17 Feb 2025
A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
Xiachong Feng
Longxu Dou
Ella Li
Qinghao Wang
Haoran Wang
Yu Guo
Chang Ma
Lingpeng Kong
LM&Ro
LM&MA
ELM
LLMAG
AI4CE
150
7
0
05 Dec 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms
Thanh Nguyen-Tang
Raman Arora
113
1
0
01 Nov 2024
System 2 Reasoning via Generality and Adaptation
Sejin Kim
Sundong Kim
LRM
AI4CE
120
0
0
10 Oct 2024
Learning in Games with Progressive Hiding
Benjamin Heymann
Marc Lanctot
62
0
0
05 Sep 2024
GPU-Accelerated Counterfactual Regret Minimization
Juho Kim
78
0
0
27 Aug 2024
In-Context Exploiter for Extensive-Form Games
Shuxin Li
Chang Yang
Youzhi Zhang
Pengdeng Li
Xinrun Wang
Xiao Huang
Hau Chan
Bo An
76
0
0
10 Aug 2024
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Yauwai Yim
Chunkit Chan
Tianyu Shi
Zheye Deng
Wei Fan
Tianshi Zheng
Yangqiu Song
LLMAG
98
13
0
05 Aug 2024
Perfect Information Monte Carlo with Postponing Reasoning
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
66
0
0
05 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDa
SSL
OnRL
168
9
0
02 Aug 2024
Neural Network-based Information Set Weighting for Playing Reconnaissance Blind Chess
Timo Bertram
Johannes Fürnkranz
Martin Müller
108
1
0
08 Jul 2024
XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi
Chenliang Zhou
GNN
70
0
0
05 Jul 2024
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
Haruka Kita
Sotetsu Koyamada
Yotaro Yamaguchi
Shin Ishii
73
0
0
14 Jun 2024
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Wenzhe Li
Zihan Ding
Seth Karten
Chi Jin
103
2
0
04 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
Chen Zhang
Qiang He
Zhou Yuan
Elvis S. Liu
Hong Wang
Jian Zhao
Yang-Feng Wang
116
2
0
03 Jun 2024
Mixture of Public and Private Distributions in Imperfect Information Games
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
146
1
0
23 May 2024
Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents
Radovan Haluška
Martin Schmid
LLMAG
81
0
0
25 Apr 2024
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
Hang Xu
Kai Li
Bingyun Liu
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
70
3
0
22 Apr 2024
Transformer Based Planning in the Observation Space with Applications to Trick Taking Card Games
Douglas Rebstock
Christopher Solinas
Nathan R Sturtevant
M. Buro
49
0
0
19 Apr 2024
HSVI-based Online Minimax Strategies for Partially Observable Stochastic Games with Neural Perception Mechanisms
R. Yan
G. Santos
G. Norman
David Parker
Marta Z. Kwiatkowska
69
2
0
16 Apr 2024
LookALike: Human Mimicry based collaborative decision making
Rabimba Karanjai
Weidong Shi
45
0
0
16 Mar 2024
Trust in AI: Progress, Challenges, and Future Directions
S. Afroogh
Ali Akbari
Evan Malone
Mohammadali Kargar
Hananeh Alambeigi
AI4TS
105
40
0
12 Mar 2024
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating
Yifan YangGong
Haojun Pan
Lei Wang
72
1
0
21 Feb 2024
Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation
Ayesha Siddika Nipu
Siming Liu
Anthony Harris
43
2
0
13 Feb 2024
A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System
Jiaqi Liang
Sanjay Dominik Jena
Defeng Liu
Andrea Lodi
103
1
0
05 Feb 2024
PokerGPT: An End-to-End Lightweight Solver for Multi-Player Texas Holdém via Large Language Model
Chenghao Huang
Yanbo Cao
Yinlong Wen
Tao Zhou
Yanru Zhang
OffRL
LLMAG
81
7
0
04 Jan 2024
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property
Ioannis Anagnostides
Ioannis Panageas
Gabriele Farina
Tuomas Sandholm
88
3
0
19 Dec 2023
Recording and Describing Poker Hands
Juho Kim
LMTD
57
0
0
18 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
154
5
0
13 Dec 2023
Computing Perfect Bayesian Equilibria in Sequential Auctions with Verification
Vinzenz Thoma
Vitor Bosshard
Sven Seuken
155
1
0
07 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
82
6
0
05 Dec 2023
History Filtering in Imperfect Information Games: Algorithms and Complexity
Christopher Solinas
Douglas Rebstock
Nathan R Sturtevant
M. Buro
72
0
0
24 Nov 2023
PcLast: Discovering Plannable Continuous Latent States
Anurag Koul
Shivakanth Sujit
Shaoru Chen
Ben Evans
Lili Wu
...
Yonathan Efroni
Lekan Molu
Miro Dudik
John Langford
Alex Lamb
OffRL
BDL
102
1
0
06 Nov 2023
Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
Yang Cai
Gabriele Farina
Julien Grand-Clément
Christian Kroer
Chung-Wei Lee
Haipeng Luo
Weiqiang Zheng
80
2
0
01 Nov 2023
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Zelai Xu
Chao Yu
Fei Fang
Yu Wang
Yi Wu
LLMAG
125
95
0
29 Oct 2023
Partially Observable Stochastic Games with Neural Perception Mechanisms
R. Yan
G. Santos
G. Norman
David Parker
Marta Z. Kwiatkowska
80
4
0
17 Oct 2023
Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability
Revan MacQueen
James R. Wright
72
2
0
17 Oct 2023
BridgeHand2Vec Bridge Hand Representation
Anna Sztyber-Betley
Filip Kolodziej
Jan Betley
Piotr Duszak
GAN
38
0
0
10 Oct 2023
B
\mathcal{B}
B
-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis
Zishun Yu
Yunzhe Tao
Liyu Chen
Tao Sun
Hongxia Yang
83
13
0
04 Oct 2023
1
2
3
4
5
6
7
Next