From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

International Conference on Machine Learning (ICML), 2020

19 February 2020

Julien Perolat

Rémi Munos

Jean-Baptiste Lespiau

Papers citing "From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization"

50 / 61 papers shown

Understanding Optimal Portfolios of Strategies for Solving Two-player Zero-sum Games

Karolina Drabent

Ondřej Kubíček

Viliam Lisý

23 Nov 2025

DiffFP: Learning Behaviors from Scratch via Diffusion-based Fictitious Play

Akash Karthikeyan

Yash Vardhan Pant

167

17 Nov 2025

Outbidding and Outbluffing Elite Humans: Mastering Liar's Poker via Self-Play and Reinforcement Learning

213

05 Nov 2025

Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria

206

21 Oct 2025

Look-ahead Reasoning with a Learned Model in Imperfect Information Games

Ondřej Kubíček

Viliam Lisý

LRM

133

06 Oct 2025

Efficient Last-Iterate Convergence in Regret Minimization via Adaptive Reward Transformation

274

17 Sep 2025

Last-Iterate Convergence in Adaptive Regret Minimization for Approximate Extensive-Form Perfect Equilibrium

203

11 Aug 2025

Faster Game Solving via Asymmetry of Step Sizes

369

17 Mar 2025

Two-Player Zero-Sum Differential Games with One-Sided Information

442

17 Feb 2025

COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences

361

30 Oct 2024

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model AlignmentInternational Conference on Learning Representations (ICLR), 2024

762

22 Oct 2024

Last Iterate Convergence in Monotone Mean Field Games

Noboru Isobe

Kenshi Abe

Kaito Ariu

653

07 Oct 2024

Learning in Games with Progressive HidingAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

Benjamin Heymann

Marc Lanctot

325

05 Sep 2024

A Survey on Self-play Methods in Reinforcement Learning

...

755

02 Aug 2024

A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate ConvergenceInternational Conference on Learning Representations (ICLR), 2024

Mingyang Liu

Gabriele Farina

Asuman Ozdaglar

474

01 Aug 2024

A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning

Zun Li

Michael P. Wellman

275

30 Apr 2024

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent

Hang Xu

277

22 Apr 2024

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

Boning Li

Zhixuan Fang

Longbo Huang

164

07 Mar 2024

Neural Population Learning beyond Symmetric Zero-sum GamesAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

303

10 Jan 2024

Nash Learning from Human FeedbackInternational Conference on Machine Learning (ICML), 2023

Daniele Calandriello

...

Nikola Momchev

Olivier Bachem

D. Mankowitz

Doina Precup

Bilal Piot

752

206

01 Dec 2023

Minimax Exploiter: A Data Efficient Approach for Competitive Self-PlayAdaptive Agents and Multi-Agent Systems (AAMAS), 2023

236

28 Nov 2023

Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive GamesAdaptive Agents and Multi-Agent Systems (AAMAS), 2023

Chao Yu

327

05 Oct 2023

Efficient Last-iterate Convergence Algorithms in Solving Games

Wenbin Li

Tianpei Yang

Bo An

Yang Gao

370

22 Aug 2023

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Wei Pan

348

09 Aug 2023

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Furong Huang

261

22 Jul 2023

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RLInternational Conference on Learning Representations (ICLR), 2023

Furong Huang

384

27 May 2023

Adaptively Perturbed Mirror Descent for Learning in GamesInternational Conference on Machine Learning (ICML), 2023

667

26 May 2023

The Update-Equivalence Framework for Decision-Time PlanningInternational Conference on Learning Representations (ICLR), 2023

J. Zico Kolter

390

25 Apr 2023

ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPsInternational Conference on Machine Learning (ICML), 2023

Theodore H. Moskovitz

319

02 Feb 2023

Abstracting Imperfect Information Away from Two-Player Zero-Sum GamesInternational Conference on Machine Learning (ICML), 2023

J. Zico Kolter

337

22 Jan 2023

Policy Mirror Ascent for Efficient and Independent Learning in Mean Field GamesInternational Conference on Machine Learning (ICML), 2022

406

29 Dec 2022

Adversarial Policies Beat Superhuman Go AIsInternational Conference on Machine Learning (ICML), 2022

Adam Gleave

Kellin Pelrine

...

449

01 Nov 2022

Developing, Evaluating and Scaling Learning Agents in Multi-Agent EnvironmentsAI Communications (AC), 2022

...

226

22 Sep 2022

Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum GamesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

348

21 Aug 2022

Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games

324

19 Aug 2022

Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured TransitionsInternational Conference on Machine Learning (ICML), 2022

209

25 Jul 2022

Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form GamesAlgorithmic Game Theory (AGT), 2022

233

18 Jul 2022

A Survey of Decision Making in Adversarial GamesScience China Information Sciences (Sci. China Inf. Sci.), 2022

355

16 Jul 2022

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

296

13 Jul 2022

The Power of Regularization in Solving Extensive-Form GamesInternational Conference on Learning Representations (ICLR), 2022

406

19 Jun 2022

Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum GamesConference on Uncertainty in Artificial Intelligence (UAI), 2022

Kenshi Abe

Mitsuki Sakamoto

Atsushi Iwasaki

256

18 Jun 2022

A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games

J. Zico Kolter

Alexia Jolicoeur-Martineau

Noam Brown

Christian Kroer

386

12 Jun 2022

ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate RegretInternational Conference on Learning Representations (ICLR), 2022

455

08 Jun 2022

Nash, Conley, and Computation: Impossibility and Incompleteness in Game Dynamics

Jason Milionis

Christos H. Papadimitriou

Georgios Piliouras

Kelly Spendlove

258

26 Mar 2022

Scalable Deep Reinforcement Learning Algorithms for Mean Field GamesInternational Conference on Machine Learning (ICML), 2022

Mathieu Laurière

Sarah Perrin

Sertan Girgin

Paul Muller

Ayush Jain

...

Georgios Piliouras

Julien Pérolat

Romuald Élie

Olivier Pietquin

Matthieu Geist

287

22 Mar 2022

Student of Games: A unified learning algorithm for both perfect and imperfect information gamesScience Advances (Sci Adv), 2021

...

359

06 Dec 2021

Online Learning in Periodic Zero-Sum GamesNeural Information Processing Systems (NeurIPS), 2021

157

05 Nov 2021

Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality

Stefanos Leonardos

Georgios Piliouras

Kelly Spendlove

299

24 Jun 2021

Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré RecurrenceInternational Conference on Machine Learning (ICML), 2021

Yun Kuen Cheung

Georgios Piliouras

188

09 Jun 2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward RandomizationInternational Conference on Learning Representations (ICLR), 2021

Chao Yu

289

08 Mar 2021