Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.01724
Cited By
v1
v2
v3 (latest)
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
6 January 2017
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker"
50 / 306 papers shown
Title
On Representation Complexity of Model-based and Model-free Reinforcement Learning
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
76
4
0
03 Oct 2023
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4
Jiaxian Guo
Bo Yang
Paul D. Yoo
Bill Yuchen Lin
Yusuke Iwasawa
Yutaka Matsuo
LLMAG
105
45
0
29 Sep 2023
Efficient Last-iterate Convergence Algorithms in Solving Games
Lin Meng
Zhenxing Ge
Wenbin Li
Bo An
Yang Gao
Wenbin Li
Tianpei Yang
Bo An
Yang Gao
70
0
0
22 Aug 2023
AI planning in the imagination: High-level planning on learned abstract search spaces
Carlos Martin
Tuomas Sandholm
68
0
0
16 Aug 2023
PokerKit: A Comprehensive Python Library for Fine-Grained Multi-Variant Poker Game Simulations
Juho Kim
21
3
0
08 Aug 2023
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Xiangyu Liu
Benjamin Eysenbach
Tuomas Sandholm
Furong Huang
Stephen Marcus McAleer
OOD
70
0
0
22 Jul 2023
PyTAG: Challenges and Opportunities for Reinforcement Learning in Tabletop Games
Martin Balla
G. E. Long
Dominik Jeurissen
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
LMTD
OffRL
OnRL
82
1
0
19 Jul 2023
Composing Efficient, Robust Tests for Policy Selection
Dustin Morrill
Thomas J. Walsh
D. Hernández
Peter R. Wurman
Peter Stone
59
0
0
12 Jun 2023
Dual policy as self-model for planning
J. Yoo
Fernanda De La Torre
G. R. Yang
22
1
0
07 Jun 2023
Strategic Reasoning with Language Models
Kanishk Gandhi
Dorsa Sadigh
Noah D. Goodman
LM&Ro
LRM
79
41
0
30 May 2023
Hierarchical Deep Counterfactual Regret Minimization
Jiayu Chen
Tian-Shing Lan
Vaneet Aggarwal
71
3
0
27 May 2023
Regret Matching+: (In)Stability and Fast Convergence in Games
Gabriele Farina
Julien Grand-Clément
Christian Kroer
Chung-Wei Lee
Haipeng Luo
57
7
0
24 May 2023
Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria
Fivos Kalogiannis
Ioannis Panageas
89
8
0
23 May 2023
Information Design in Multi-Agent Reinforcement Learning
Yue Lin
Wenhao Li
H. Zha
Baoxiang Wang
72
11
0
08 May 2023
The Update-Equivalence Framework for Decision-Time Planning
Samuel Sokota
Gabriele Farina
David J. Wu
Hengyuan Hu
Kevin A. Wang
J. Zico Kolter
Noam Brown
115
4
0
25 Apr 2023
Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions
Chen Feng Tsai
Xiaochen Zhou
Sierra S. Liu
Jing Li
Mo Yu
Hongyuan Mei
LLMAG
ELM
AI4MH
LM&MA
102
32
0
06 Apr 2023
Learning not to Regret
David Sychrovský
Michal Sustr
Elnaz Davoodi
Michael Bowling
Marc Lanctot
Martin Schmid
90
4
0
02 Mar 2023
Population-size-Aware Policy Optimization for Mean-Field Games
Pengdeng Li
Xinrun Wang
Shuxin Li
Hau Chan
Bo An
62
2
0
07 Feb 2023
Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling
Zun Li
Marc Lanctot
Kevin R. McKee
Luke Marris
I. Gemp
Daniel Hennes
Paul Muller
Kate Larson
Yoram Bachrach
Michael P. Wellman
66
11
0
01 Feb 2023
Reinforcement Learning from Diverse Human Preferences
Wanqi Xue
Bo An
Shuicheng Yan
Zhongwen Xu
75
25
0
27 Jan 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
100
4
0
22 Jan 2023
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Florian Tambon
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
G. Antoniol
92
8
0
13 Jan 2023
Function Approximation for Solving Stackelberg Equilibrium in Large Perfect Information Games
Chun Kai Ling
J. Zico Kolter
Fei Fang
60
0
0
29 Dec 2022
Adapting to game trees in zero-sum imperfect information games
Côme Fiegel
Pierre Ménard
Tadashi Kozuno
Rémi Munos
Vianney Perchet
Michal Valko
376
10
0
23 Dec 2022
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox
Qiyue Yin
Tongtong Yu
S. Shen
Jun Yang
Meijing Zhao
Kaiqi Huang
Bin Liang
Liangsheng Wang
OffRL
74
13
0
01 Dec 2022
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Paria Rashidinejad
Hanlin Zhu
Kunhe Yang
Stuart J. Russell
Jiantao Jiao
OffRL
178
31
0
01 Nov 2022
DanZero: Mastering GuanDan Game with Reinforcement Learning
Yudong Lu
Jian Zhao
Youpeng Zhao
Wen-gang Zhou
Houqiang Li
61
6
0
31 Oct 2022
Observable Perfect Equilibrium
Sam Ganzfried
10
0
0
29 Oct 2022
HSVI can solve zero-sum Partially Observable Stochastic Games
Aurélien Delage
Olivier Buffet
J. Dibangoye
Abdallah Saffidine
96
11
0
26 Oct 2022
On the convergence of policy gradient methods to Nash equilibria in general stochastic games
Angeliki Giannou
Kyriakos Lotidis
P. Mertikopoulos
Emmanouil-Vasileios Vlatakis-Gkaragkounis
124
18
0
17 Oct 2022
Activation Learning by Local Competitions
Hongchao Zhou
AAML
96
7
0
26 Sep 2022
Why Deep Learning's Performance Data Are Misleading
J. Weng
34
10
0
23 Aug 2022
Learning Correlated Equilibria in Mean-Field Games
Paul Muller
Romuald Elie
Mark Rowland
Mathieu Lauriere
Julien Perolat
Sarah Perrin
Matthieu Geist
Georgios Piliouras
Olivier Pietquin
K. Tuyls
85
6
0
22 Aug 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
73
13
0
03 Aug 2022
Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess
T. Bertram
Johannes Furnkranz
Martin Müller
SSL
OnRL
88
7
0
03 Aug 2022
A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning
Zaharah Bukhsh
N. Jansen
Hajo Molegraaf
OffRL
AI4CE
116
6
0
01 Aug 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
75
17
0
19 Jul 2022
Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form Games
Georgios Piliouras
Lillian J. Ratliff
Ryann Sim
Stratis Skoulakis
MLT
69
3
0
18 Jul 2022
A Survey of Decision Making in Adversarial Games
Xiuxian Li
Min Meng
Yiguang Hong
Jie-bin Chen
AAML
97
15
0
16 Jul 2022
Algorithms to estimate Shapley value feature attributions
Hugh Chen
Ian Covert
Scott M. Lundberg
Su-In Lee
TDI
FAtt
93
235
0
15 Jul 2022
A Simple Adaptive Procedure Converging to Forgiving Correlated Equilibria
Hugh Zhang
66
4
0
13 Jul 2022
Self-Explaining Deviations for Coordination
Hengyuan Hu
Samuel Sokota
David J. Wu
A. Bakhtin
Andrei Lupu
Brandon Cui
Jakob N. Foerster
53
2
0
13 Jul 2022
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Julien Perolat
Bart De Vylder
Daniel Hennes
Eugene Tarassov
Florian Strub
...
Rémi Munos
David Silver
Satinder Singh
Demis Hassabis
K. Tuyls
101
205
0
30 Jun 2022
Generalized Beliefs for Cooperative AI
Darius Muglich
L. Zintgraf
Christian Schroeder de Witt
Shimon Whiteson
Jakob N. Foerster
85
7
0
26 Jun 2022
A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving
Luca Carminati
Federico Cacciamani
Marco Ciccone
N. Gatti
56
16
0
18 Jun 2022
Near-Optimal No-Regret Learning Dynamics for General Convex Games
Gabriele Farina
Ioannis Anagnostides
Haipeng Luo
Chung‐Wei Lee
Christian Kroer
Tuomas Sandholm
59
29
0
17 Jun 2022
Principal Trade-off Analysis
Alexander Strang
David Sewell
Alexander Kim
K. Alcedo
D. Rosenbluth
43
1
0
09 Jun 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Stephen Marcus McAleer
Gabriele Farina
Marc Lanctot
Tuomas Sandholm
174
26
0
08 Jun 2022
Understanding and Preventing Capacity Loss in Reinforcement Learning
Clare Lyle
Mark Rowland
Will Dabney
CLL
105
115
0
20 Apr 2022
Metaethical Perspectives on 'Benchmarking' AI Ethics
Travis LaCroix
A. Luccioni
57
8
0
11 Apr 2022
Previous
1
2
3
4
5
6
7
Next