Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.08456
Cited By
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
International Conference on Machine Learning (ICML), 2020
19 February 2020
Julien Perolat
Rémi Munos
Jean-Baptiste Lespiau
Shayegan Omidshafiei
Mark Rowland
Pedro A. Ortega
Neil Burch
Thomas W. Anthony
David Balduzzi
Bart De Vylder
Georgios Piliouras
Marc Lanctot
K. Tuyls
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization"
50 / 61 papers shown
Understanding Optimal Portfolios of Strategies for Solving Two-player Zero-sum Games
Karolina Drabent
Ondřej Kubíček
Viliam Lisý
74
0
0
23 Nov 2025
DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play
Akash Karthikeyan
Yash Vardhan Pant
167
0
0
17 Nov 2025
Outbidding and Outbluffing Elite Humans: Mastering Liar's Poker via Self-Play and Reinforcement Learning
Richard Dewey
Janos Botyanszki
C. Moallemi
Andrew Zheng
213
0
0
05 Nov 2025
Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria
Eason Yu
Tzu Hao Liu
Yunke Wang
Clément L. Canonne
Nguyen H. Tran
Chang Xu
206
0
0
21 Oct 2025
Look-ahead Reasoning with a Learned Model in Imperfect Information Games
Ondřej Kubíček
Viliam Lisý
LRM
133
0
0
06 Oct 2025
Efficient Last-Iterate Convergence in Regret Minimization via Adaptive Reward Transformation
Hang Ren
Yulin Wu
Shuhan Qi
Jiajia Zhang
Xiaozhen Sun
Tianzi Ma
Xuan Wang
274
0
0
17 Sep 2025
Last-Iterate Convergence in Adaptive Regret Minimization for Approximate Extensive-Form Perfect Equilibrium
Hang Ren
Xiaozhen Sun
Tianzi Ma
Jiajia Zhang
Xuan Wang
203
0
0
11 Aug 2025
Faster Game Solving via Asymmetry of Step Sizes
Linjian Meng
Youzhi Zhang
Youzhi Zhang
Zhenxing Ge
Yang Gao
369
2
0
17 Mar 2025
Two-Player Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire
Z. Xu
Yi Ren
SyDa
442
0
0
17 Feb 2025
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
Yongxu Liu
Argyris Oikonomou
Weiqiang Zheng
Yang Cai
Arman Cohan
361
4
0
30 Oct 2024
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
International Conference on Learning Representations (ICLR), 2024
Mingzhi Wang
Chengdong Ma
Qizhi Chen
Linjian Meng
Yang Han
Jiancong Xiao
Zhaowei Zhang
Jing Huo
Weijie Su
Wenbo Ding
762
19
0
22 Oct 2024
Last Iterate Convergence in Monotone Mean Field Games
Noboru Isobe
Kenshi Abe
Kaito Ariu
653
0
0
07 Oct 2024
Learning in Games with Progressive Hiding
Adaptive Agents and Multi-Agent Systems (AAMAS), 2024
Benjamin Heymann
Marc Lanctot
325
1
0
05 Sep 2024
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Wenbo Ding
Yu Wang
Yu Wang
SyDa
SSL
OnRL
755
31
0
02 Aug 2024
A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate Convergence
International Conference on Learning Representations (ICLR), 2024
Mingyang Liu
Gabriele Farina
Asuman Ozdaglar
474
3
0
01 Aug 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
275
5
0
30 Apr 2024
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
Hang Xu
Kai Li
Bingyun Liu
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
277
9
0
22 Apr 2024
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li
Zhixuan Fang
Longbo Huang
164
5
0
07 Mar 2024
Neural Population Learning beyond Symmetric Zero-sum Games
Adaptive Agents and Multi-Agent Systems (AAMAS), 2024
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z Leibo
N. Heess
MLT
303
6
0
10 Jan 2024
Nash Learning from Human Feedback
International Conference on Machine Learning (ICML), 2023
Rémi Munos
Michal Valko
Daniele Calandriello
M. G. Azar
Mark Rowland
...
Nikola Momchev
Olivier Bachem
D. Mankowitz
Doina Precup
Bilal Piot
752
206
0
01 Dec 2023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Daniel Bairamian
Philippe Marcotte
Joshua Romoff
Gabriel Robert
Derek Nowrouzezahrai
236
1
0
28 Nov 2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Zelai Xu
Yancheng Liang
Chao Yu
Yu Wang
Yi Wu
327
12
0
05 Oct 2023
Efficient Last-iterate Convergence Algorithms in Solving Games
Lin Meng
Youzhi Zhang
Wenbin Li
Bo An
Yang Gao
Wenbin Li
Tianpei Yang
Bo An
Yang Gao
370
1
0
22 Aug 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Alexander Shmakov
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
348
3
0
09 Aug 2023
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Xiangyu Liu
Benjamin Eysenbach
Tuomas Sandholm
Furong Huang
Alexander Shmakov
OOD
261
0
0
22 Jul 2023
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
International Conference on Learning Representations (ICLR), 2023
Xiangyu Liu
Souradip Chakraborty
Yanchao Sun
Furong Huang
AAML
384
10
0
27 May 2023
Adaptively Perturbed Mirror Descent for Learning in Games
International Conference on Machine Learning (ICML), 2023
Kenshi Abe
Kaito Ariu
Mitsuki Sakamoto
Atsushi Iwasaki
667
9
0
26 May 2023
The Update-Equivalence Framework for Decision-Time Planning
International Conference on Learning Representations (ICLR), 2023
Samuel Sokota
Gabriele Farina
David J. Wu
Hengyuan Hu
Kevin A. Wang
J. Zico Kolter
Noam Brown
390
5
0
25 Apr 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
International Conference on Machine Learning (ICML), 2023
Theodore H. Moskovitz
Brendan O'Donoghue
Vivek Veeriah
Sebastian Flennerhag
Satinder Singh
Tom Zahavy
319
24
0
02 Feb 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
International Conference on Machine Learning (ICML), 2023
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
337
8
0
22 Jan 2023
Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games
International Conference on Machine Learning (ICML), 2022
Batuhan Yardim
Semih Cayci
Matthieu Geist
Niao He
406
33
0
29 Dec 2022
Adversarial Policies Beat Superhuman Go AIs
International Conference on Machine Learning (ICML), 2022
T. T. Wang
Adam Gleave
Tom Tseng
Kellin Pelrine
Nora Belrose
...
Michael Dennis
Yawen Duan
V. Pogrebniak
Sergey Levine
Stuart Russell
AAML
449
33
0
01 Nov 2022
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments
AI Communications (AC), 2022
I. Gemp
Thomas W. Anthony
Yoram Bachrach
Avishkar Bhoopchand
Kalesha Bullard
...
Florian Strub
Andrea Tacchetti
Eugene Tarassov
Zhe Wang
K. Tuyls
LLMAG
AI4CE
226
4
0
22 Sep 2022
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Kenshi Abe
Kaito Ariu
Mitsuki Sakamoto
Kenta Toyoshima
Atsushi Iwasaki
348
15
0
21 Aug 2022
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Rongjun Qin
Fan Luo
Hong Qian
Yang Yu
324
2
0
19 Aug 2022
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
International Conference on Machine Learning (ICML), 2022
Delin Qu
Xiaohan Wei
Jieping Ye
Zhaoran Wang
Zhuoran Yang
OffRL
209
12
0
25 Jul 2022
Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form Games
Algorithmic Game Theory (AGT), 2022
Georgios Piliouras
Lillian J. Ratliff
Ryann Sim
Stratis Skoulakis
MLT
233
4
0
18 Jul 2022
A Survey of Decision Making in Adversarial Games
Science China Information Sciences (Sci. China Inf. Sci.), 2022
Xiuxian Li
Min Meng
Yiguang Hong
Jie-bin Chen
AAML
355
26
0
16 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Alexander Shmakov
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
Tuomas Sandholm
296
23
0
13 Jul 2022
The Power of Regularization in Solving Extensive-Form Games
International Conference on Learning Representations (ICLR), 2022
Ming-Yuan Liu
Asuman Ozdaglar
Tiancheng Yu
Jianchao Tan
406
28
0
19 Jun 2022
Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Kenshi Abe
Mitsuki Sakamoto
Atsushi Iwasaki
256
23
0
18 Jun 2022
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
J. Zico Kolter
Nicolas Loizou
Marc Lanctot
Alexia Jolicoeur-Martineau
Noam Brown
Christian Kroer
386
1
0
12 Jun 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
International Conference on Learning Representations (ICLR), 2022
Alexander Shmakov
Gabriele Farina
Marc Lanctot
Tuomas Sandholm
455
30
0
08 Jun 2022
Nash, Conley, and Computation: Impossibility and Incompleteness in Game Dynamics
Jason Milionis
Christos H. Papadimitriou
Georgios Piliouras
Kelly Spendlove
258
11
0
26 Mar 2022
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
International Conference on Machine Learning (ICML), 2022
Mathieu Laurière
Sarah Perrin
Sertan Girgin
Paul Muller
Ayush Jain
...
Georgios Piliouras
Julien Pérolat
Romuald Élie
Olivier Pietquin
Matthieu Geist
287
62
0
22 Mar 2022
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Science Advances (Sci Adv), 2021
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
359
31
0
06 Dec 2021
Online Learning in Periodic Zero-Sum Games
Neural Information Processing Systems (NeurIPS), 2021
Tanner Fiez
Ryann Sim
Stratis Skoulakis
Georgios Piliouras
Lillian J. Ratliff
157
18
0
05 Nov 2021
Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Stefanos Leonardos
Georgios Piliouras
Kelly Spendlove
299
40
0
24 Jun 2021
Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence
International Conference on Machine Learning (ICML), 2021
Yun Kuen Cheung
Georgios Piliouras
188
9
0
09 Jun 2021
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
International Conference on Learning Representations (ICLR), 2021
Zhen-Yu Tang
Chao Yu
Boyuan Chen
Huazhe Xu
Xiaolong Wang
Fei Fang
S. Du
Yu Wang
Yi Wu
289
63
0
08 Mar 2021
1
2
Next
Page 1 of 2