ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08456
  4. Cited By
From Poincaré Recurrence to Convergence in Imperfect Information
  Games: Finding Equilibrium via Regularization

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

19 February 2020
Julien Perolat
Rémi Munos
Jean-Baptiste Lespiau
Shayegan Omidshafiei
Mark Rowland
Pedro A. Ortega
Neil Burch
Thomas W. Anthony
David Balduzzi
Bart De Vylder
Georgios Piliouras
Marc Lanctot
K. Tuyls
ArXiv (abs)PDFHTML

Papers citing "From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization"

50 / 54 papers shown
Title
Asynchronous Predictive Counterfactual Regret Minimization$^+$ Algorithm in Solving Extensive-Form Games
Asynchronous Predictive Counterfactual Regret Minimization+^++ Algorithm in Solving Extensive-Form Games
Linjian Meng
Youzhi Zhang
Zhenxing Ge
Tianpei Yang
Yang Gao
100
0
0
17 Mar 2025
Two-Player Zero-Sum Differential Games with One-Sided Information
Two-Player Zero-Sum Differential Games with One-Sided Information
Mukesh Ghimire
Z. Xu
Yi Ren
SyDa
198
0
0
17 Feb 2025
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General
  Preferences
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences
Yongxu Liu
Argyris Oikonomou
Weiqiang Zheng
Yang Cai
Arman Cohan
93
1
0
30 Oct 2024
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
Mingzhi Wang
Chengdong Ma
Qizhi Chen
Linjian Meng
Yang Han
Jiancong Xiao
Zhaowei Zhang
Jing Huo
Weijie Su
Yaodong Yang
135
9
0
22 Oct 2024
Last Iterate Convergence in Monotone Mean Field Games
Last Iterate Convergence in Monotone Mean Field Games
Noboru Isobe
Kenshi Abe
Kaito Ariu
94
0
0
07 Oct 2024
Learning in Games with Progressive Hiding
Learning in Games with Progressive Hiding
Benjamin Heymann
Marc Lanctot
72
0
0
05 Sep 2024
A Survey on Self-play Methods in Reinforcement Learning
A Survey on Self-play Methods in Reinforcement Learning
Chao Yu
Zelai Xu
Chengdong Ma
Chao Yu
Weijuan Tu
...
Deheng Ye
Wenbo Ding
Yaodong Yang
Yu Wang
Yu Wang
SyDaSSLOnRL
168
9
0
02 Aug 2024
A Policy-Gradient Approach to Solving Imperfect-Information Games with
  Iterate Convergence
A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence
Mingyang Liu
Gabriele Farina
Asuman Ozdaglar
76
3
0
01 Aug 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement
  Learning
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
75
1
0
30 Apr 2024
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror
  Descent
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
Hang Xu
Kai Li
Bingyun Liu
Haobo Fu
Qiang Fu
Junliang Xing
Jian Cheng
70
3
0
22 Apr 2024
RL-CFR: Improving Action Abstraction for Imperfect Information
  Extensive-Form Games with Reinforcement Learning
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li
Zhixuan Fang
Longbo Huang
42
0
0
07 Mar 2024
Neural Population Learning beyond Symmetric Zero-sum Games
Neural Population Learning beyond Symmetric Zero-sum Games
Siqi Liu
Luke Marris
Marc Lanctot
Georgios Piliouras
Joel Z Leibo
N. Heess
MLT
89
3
0
10 Jan 2024
Nash Learning from Human Feedback
Nash Learning from Human Feedback
Rémi Munos
Michal Valko
Daniele Calandriello
M. G. Azar
Mark Rowland
...
Nikola Momchev
Olivier Bachem
D. Mankowitz
Doina Precup
Bilal Piot
130
147
0
01 Dec 2023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Daniel Bairamian
Philippe Marcotte
Joshua Romoff
Gabriel Robert
Derek Nowrouzezahrai
74
0
0
28 Nov 2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed
  Cooperative-Competitive Games
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games
Zelai Xu
Yancheng Liang
Chao Yu
Yu Wang
Yi Wu
93
9
0
05 Oct 2023
Efficient Last-iterate Convergence Algorithms in Solving Games
Efficient Last-iterate Convergence Algorithms in Solving Games
Lin Meng
Zhenxing Ge
Wenbin Li
Bo An
Yang Gao
Wenbin Li
Tianpei Yang
Bo An
Yang Gao
73
0
0
22 Aug 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player
  Zero-Sum Games
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games
Yang Li
Kun Xiong
Yingping Zhang
Jiangcheng Zhu
Stephen Marcus McAleer
Wei Pan
Jun Wang
Zonghong Dai
Yaodong Yang
126
2
0
09 Aug 2023
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled
  Perturbations
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
Yongyuan Liang
Yanchao Sun
Ruijie Zheng
Xiangyu Liu
Benjamin Eysenbach
Tuomas Sandholm
Furong Huang
Stephen Marcus McAleer
OOD
82
0
0
22 Jul 2023
Rethinking Adversarial Policies: A Generalized Attack Formulation and
  Provable Defense in RL
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
Xiangyu Liu
Souradip Chakraborty
Yanchao Sun
Furong Huang
AAML
66
5
0
27 May 2023
Adaptively Perturbed Mirror Descent for Learning in Games
Adaptively Perturbed Mirror Descent for Learning in Games
Kenshi Abe
Kaito Ariu
Mitsuki Sakamoto
Atsushi Iwasaki
57
6
0
26 May 2023
The Update-Equivalence Framework for Decision-Time Planning
The Update-Equivalence Framework for Decision-Time Planning
Samuel Sokota
Gabriele Farina
David J. Wu
Hengyuan Hu
Kevin A. Wang
J. Zico Kolter
Noam Brown
118
4
0
25 Apr 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for
  Last-Iterate Convergence in Constrained MDPs
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Theodore H. Moskovitz
Brendan O'Donoghue
Vivek Veeriah
Sebastian Flennerhag
Satinder Singh
Tom Zahavy
96
21
0
02 Feb 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
Chun Kai Ling
David J. Wu
J. Zico Kolter
Noam Brown
100
4
0
22 Jan 2023
Policy Mirror Ascent for Efficient and Independent Learning in Mean
  Field Games
Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games
Batuhan Yardim
Semih Cayci
Matthieu Geist
Niao He
126
29
0
29 Dec 2022
Adversarial Policies Beat Superhuman Go AIs
Adversarial Policies Beat Superhuman Go AIs
T. T. Wang
Adam Gleave
Tom Tseng
Kellin Pelrine
Nora Belrose
...
Michael Dennis
Yawen Duan
V. Pogrebniak
Sergey Levine
Stuart Russell
AAML
82
22
0
01 Nov 2022
Developing, Evaluating and Scaling Learning Agents in Multi-Agent
  Environments
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments
I. Gemp
Thomas W. Anthony
Yoram Bachrach
Avishkar Bhoopchand
Kalesha Bullard
...
Florian Strub
Andrea Tacchetti
Eugene Tarassov
Zhe Wang
K. Tuyls
LLMAGAI4CE
88
3
0
22 Sep 2022
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player
  Zero-Sum Games
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games
Kenshi Abe
Kaito Ariu
Mitsuki Sakamoto
Kenta Toyoshima
Atsushi Iwasaki
84
12
0
21 Aug 2022
Unified Policy Optimization for Continuous-action Reinforcement Learning
  in Non-stationary Tasks and Games
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
Rongjun Qin
Fan Luo
Hong Qian
Yang Yu
64
2
0
19 Aug 2022
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum
  Markov Games with Structured Transitions
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Shuang Qiu
Xiaohan Wei
Jieping Ye
Zhaoran Wang
Zhuoran Yang
OffRL
67
12
0
25 Jul 2022
Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum
  Extensive Form Games
Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form Games
Georgios Piliouras
Lillian J. Ratliff
Ryann Sim
Stratis Skoulakis
MLT
69
3
0
18 Jul 2022
A Survey of Decision Making in Adversarial Games
A Survey of Decision Making in Adversarial Games
Xiuxian Li
Min Meng
Yiguang Hong
Jie-bin Chen
AAML
97
15
0
16 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
Tuomas Sandholm
75
18
0
13 Jul 2022
The Power of Regularization in Solving Extensive-Form Games
The Power of Regularization in Solving Extensive-Form Games
Ming-Yuan Liu
Asuman Ozdaglar
Tiancheng Yu
Kai Zhang
58
23
0
19 Jun 2022
Mutation-Driven Follow the Regularized Leader for Last-Iterate
  Convergence in Zero-Sum Games
Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games
Kenshi Abe
Mitsuki Sakamoto
Atsushi Iwasaki
65
18
0
18 Jun 2022
A Unified Approach to Reinforcement Learning, Quantal Response
  Equilibria, and Two-Player Zero-Sum Games
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
Samuel Sokota
Ryan DÓrazio
J. Zico Kolter
Nicolas Loizou
Marc Lanctot
Ioannis Mitliagkas
Noam Brown
Christian Kroer
72
1
0
12 Jun 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History
  Value Function to Estimate Regret
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Stephen Marcus McAleer
Gabriele Farina
Marc Lanctot
Tuomas Sandholm
174
26
0
08 Jun 2022
Nash, Conley, and Computation: Impossibility and Incompleteness in Game
  Dynamics
Nash, Conley, and Computation: Impossibility and Incompleteness in Game Dynamics
Jason Milionis
Christos H. Papadimitriou
Georgios Piliouras
Kelly Spendlove
81
9
0
26 Mar 2022
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Mathieu Laurière
Sarah Perrin
Sertan Girgin
Paul Muller
Ayush Jain
...
Georgios Piliouras
Julien Pérolat
Romuald Élie
Olivier Pietquin
Matthieu Geist
99
44
0
22 Mar 2022
Student of Games: A unified learning algorithm for both perfect and
  imperfect information games
Student of Games: A unified learning algorithm for both perfect and imperfect information games
Martin Schmid
Matej Moravcík
Neil Burch
Rudolf Kadlec
Josh Davidson
...
Marc Lanctot
G. Z. Holland
Elnaz Davoodi
Alden Christianson
Michael Bowling
86
22
0
06 Dec 2021
Online Learning in Periodic Zero-Sum Games
Online Learning in Periodic Zero-Sum Games
Tanner Fiez
Ryann Sim
Stratis Skoulakis
Georgios Piliouras
Lillian J. Ratliff
38
15
0
05 Nov 2021
Exploration-Exploitation in Multi-Agent Competition: Convergence with
  Bounded Rationality
Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality
Stefanos Leonardos
Georgios Piliouras
Kelly Spendlove
136
31
0
24 Jun 2021
Online Optimization in Games via Control Theory: Connecting Regret,
  Passivity and Poincaré Recurrence
Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence
Yun Kuen Cheung
Georgios Piliouras
48
8
0
09 Jun 2021
Discovering Diverse Multi-Agent Strategic Behavior via Reward
  Randomization
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Zhen-Yu Tang
Chao Yu
Boyuan Chen
Huazhe Xu
Xiaolong Wang
Fei Fang
S. Du
Yu Wang
Yi Wu
103
55
0
08 Mar 2021
Learning in Matrix Games can be Arbitrarily Complex
Learning in Matrix Games can be Arbitrarily Complex
Gabriel P. Andrade
Rafael Frongillo
Georgios Piliouras
57
30
0
05 Mar 2021
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov
  Games
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games
Yulai Zhao
Yuandong Tian
Jason D. Lee
S. Du
OffRL
76
18
0
17 Feb 2021
Complex Momentum for Optimization in Games
Complex Momentum for Optimization in Games
Jonathan Lorraine
David Acuna
Paul Vicol
David Duvenaud
69
9
0
16 Feb 2021
Solving Min-Max Optimization with Hidden Structure via Gradient Descent
  Ascent
Solving Min-Max Optimization with Hidden Structure via Gradient Descent Ascent
Lampros Flokas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Georgios Piliouras
MLT
122
14
0
13 Jan 2021
Evolutionary Game Theory Squared: Evolving Agents in Endogenously
  Evolving Zero-Sum Games
Evolutionary Game Theory Squared: Evolving Agents in Endogenously Evolving Zero-Sum Games
Stratis Skoulakis
Tanner Fiez
Ryan Sim
Georgios Piliouras
Lillian J. Ratliff
52
15
0
15 Dec 2020
Exploration-Exploitation in Multi-Agent Learning: Catastrophe Theory
  Meets Game Theory
Exploration-Exploitation in Multi-Agent Learning: Catastrophe Theory Meets Game Theory
Stefanos Leonardos
Georgios Piliouras
57
45
0
05 Dec 2020
No-regret learning and mixed Nash equilibria: They do not mix
No-regret learning and mixed Nash equilibria: They do not mix
Lampros Flokas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Thanasis Lianeas
P. Mertikopoulos
Georgios Piliouras
74
87
0
19 Oct 2020
12
Next