Title
Asynchronous Predictive Counterfactual Regret Minimization $^+$ Algorithm in Solving Extensive-Form Games Linjian Meng Youzhi Zhang Zhenxing Ge Tianpei Yang Yang Gao 100 0 0 17 Mar 2025
Two-Player Zero-Sum Differential Games with One-Sided Information Mukesh Ghimire Z. Xu Yi Ren SyDa 198 0 0 17 Feb 2025
COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences Yongxu Liu Argyris Oikonomou Weiqiang Zheng Yang Cai Arman Cohan 93 1 0 30 Oct 2024
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment Mingzhi Wang Chengdong Ma Qizhi Chen Linjian Meng Yang Han Jiancong Xiao Zhaowei Zhang Jing Huo Weijie Su Yaodong Yang 135 9 0 22 Oct 2024
Last Iterate Convergence in Monotone Mean Field Games Noboru Isobe Kenshi Abe Kaito Ariu 94 0 0 07 Oct 2024
Learning in Games with Progressive Hiding Benjamin Heymann Marc Lanctot 72 0 0 05 Sep 2024
A Survey on Self-play Methods in Reinforcement Learning Chao Yu Zelai Xu Chengdong Ma Chao Yu Weijuan Tu ... Deheng Ye Wenbo Ding Yaodong Yang Yu Wang Yu Wang SyDa SSL OnRL 168 9 0 02 Aug 2024
A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence Mingyang Liu Gabriele Farina Asuman Ozdaglar 76 3 0 01 Aug 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning Zun Li Michael P. Wellman 75 1 0 30 Apr 2024
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent Hang Xu Kai Li Bingyun Liu Haobo Fu Qiang Fu Junliang Xing Jian Cheng 70 3 0 22 Apr 2024
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning Boning Li Zhixuan Fang Longbo Huang 42 0 0 07 Mar 2024
Neural Population Learning beyond Symmetric Zero-sum Games Siqi Liu Luke Marris Marc Lanctot Georgios Piliouras Joel Z Leibo N. Heess MLT 89 3 0 10 Jan 2024
Nash Learning from Human Feedback Rémi Munos Michal Valko Daniele Calandriello M. G. Azar Mark Rowland ... Nikola Momchev Olivier Bachem D. Mankowitz Doina Precup Bilal Piot 130 147 0 01 Dec 2023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play Daniel Bairamian Philippe Marcotte Joshua Romoff Gabriel Robert Derek Nowrouzezahrai 74 0 0 28 Nov 2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games Zelai Xu Yancheng Liang Chao Yu Yu Wang Yi Wu 93 9 0 05 Oct 2023
Efficient Last-iterate Convergence Algorithms in Solving Games Lin Meng Zhenxing Ge Wenbin Li Bo An Yang Gao Wenbin Li Tianpei Yang Bo An Yang Gao 73 0 0 22 Aug 2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games Yang Li Kun Xiong Yingping Zhang Jiangcheng Zhu Stephen Marcus McAleer Wei Pan Jun Wang Zonghong Dai Yaodong Yang 126 2 0 09 Aug 2023
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations Yongyuan Liang Yanchao Sun Ruijie Zheng Xiangyu Liu Benjamin Eysenbach Tuomas Sandholm Furong Huang Stephen Marcus McAleer OOD 82 0 0 22 Jul 2023
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL Xiangyu Liu Souradip Chakraborty Yanchao Sun Furong Huang AAML 66 5 0 27 May 2023
Adaptively Perturbed Mirror Descent for Learning in Games Kenshi Abe Kaito Ariu Mitsuki Sakamoto Atsushi Iwasaki 57 6 0 26 May 2023
The Update-Equivalence Framework for Decision-Time Planning Samuel Sokota Gabriele Farina David J. Wu Hengyuan Hu Kevin A. Wang J. Zico Kolter Noam Brown 118 4 0 25 Apr 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs Theodore H. Moskovitz Brendan O'Donoghue Vivek Veeriah Sebastian Flennerhag Satinder Singh Tom Zahavy 96 21 0 02 Feb 2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games Samuel Sokota Ryan DÓrazio Chun Kai Ling David J. Wu J. Zico Kolter Noam Brown 100 4 0 22 Jan 2023
Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games Batuhan Yardim Semih Cayci Matthieu Geist Niao He 126 29 0 29 Dec 2022
Adversarial Policies Beat Superhuman Go AIs T. T. Wang Adam Gleave Tom Tseng Kellin Pelrine Nora Belrose ... Michael Dennis Yawen Duan V. Pogrebniak Sergey Levine Stuart Russell AAML 82 22 0 01 Nov 2022
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments I. Gemp Thomas W. Anthony Yoram Bachrach Avishkar Bhoopchand Kalesha Bullard ... Florian Strub Andrea Tacchetti Eugene Tarassov Zhe Wang K. Tuyls LLMAG AI4CE 88 3 0 22 Sep 2022
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games Kenshi Abe Kaito Ariu Mitsuki Sakamoto Kenta Toyoshima Atsushi Iwasaki 84 12 0 21 Aug 2022
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games Rongjun Qin Fan Luo Hong Qian Yang Yu 64 2 0 19 Aug 2022
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions Shuang Qiu Xiaohan Wei Jieping Ye Zhaoran Wang Zhuoran Yang OffRL 67 12 0 25 Jul 2022
Fast Convergence of Optimistic Gradient Ascent in Network Zero-Sum Extensive Form Games Georgios Piliouras Lillian J. Ratliff Ryann Sim Stratis Skoulakis MLT 69 3 0 18 Jul 2022
A Survey of Decision Making in Adversarial Games Xiuxian Li Min Meng Yiguang Hong Jie-bin Chen AAML 97 15 0 16 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games Stephen Marcus McAleer JB Lanier Kevin A. Wang Pierre Baldi Roy Fox Tuomas Sandholm 75 18 0 13 Jul 2022
The Power of Regularization in Solving Extensive-Form Games Ming-Yuan Liu Asuman Ozdaglar Tiancheng Yu Kai Zhang 58 23 0 19 Jun 2022
Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games Kenshi Abe Mitsuki Sakamoto Atsushi Iwasaki 65 18 0 18 Jun 2022
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games Samuel Sokota Ryan DÓrazio J. Zico Kolter Nicolas Loizou Marc Lanctot Ioannis Mitliagkas Noam Brown Christian Kroer 72 1 0 12 Jun 2022
ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret Stephen Marcus McAleer Gabriele Farina Marc Lanctot Tuomas Sandholm 174 26 0 08 Jun 2022
Nash, Conley, and Computation: Impossibility and Incompleteness in Game Dynamics Jason Milionis Christos H. Papadimitriou Georgios Piliouras Kelly Spendlove 81 9 0 26 Mar 2022
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games Mathieu Laurière Sarah Perrin Sertan Girgin Paul Muller Ayush Jain ... Georgios Piliouras Julien Pérolat Romuald Élie Olivier Pietquin Matthieu Geist 99 44 0 22 Mar 2022
Student of Games: A unified learning algorithm for both perfect and imperfect information games Martin Schmid Matej Moravcík Neil Burch Rudolf Kadlec Josh Davidson ... Marc Lanctot G. Z. Holland Elnaz Davoodi Alden Christianson Michael Bowling 86 22 0 06 Dec 2021
Online Learning in Periodic Zero-Sum Games Tanner Fiez Ryann Sim Stratis Skoulakis Georgios Piliouras Lillian J. Ratliff 38 15 0 05 Nov 2021
Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality Stefanos Leonardos Georgios Piliouras Kelly Spendlove 136 31 0 24 Jun 2021
Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence Yun Kuen Cheung Georgios Piliouras 48 8 0 09 Jun 2021
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization Zhen-Yu Tang Chao Yu Boyuan Chen Huazhe Xu Xiaolong Wang Fei Fang S. Du Yu Wang Yi Wu 103 55 0 08 Mar 2021
Learning in Matrix Games can be Arbitrarily Complex Gabriel P. Andrade Rafael Frongillo Georgios Piliouras 57 30 0 05 Mar 2021
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games Yulai Zhao Yuandong Tian Jason D. Lee S. Du OffRL 76 18 0 17 Feb 2021
Complex Momentum for Optimization in Games Jonathan Lorraine David Acuna Paul Vicol David Duvenaud 69 9 0 16 Feb 2021
Solving Min-Max Optimization with Hidden Structure via Gradient Descent Ascent Lampros Flokas Emmanouil-Vasileios Vlatakis-Gkaragkounis Georgios Piliouras MLT 122 14 0 13 Jan 2021
Evolutionary Game Theory Squared: Evolving Agents in Endogenously Evolving Zero-Sum Games Stratis Skoulakis Tanner Fiez Ryan Sim Georgios Piliouras Lillian J. Ratliff 52 15 0 15 Dec 2020
Exploration-Exploitation in Multi-Agent Learning: Catastrophe Theory Meets Game Theory Stefanos Leonardos Georgios Piliouras 57 45 0 05 Dec 2020
No-regret learning and mixed Nash equilibria: They do not mix Lampros Flokas Emmanouil-Vasileios Vlatakis-Gkaragkounis Thanasis Lianeas P. Mertikopoulos Georgios Piliouras 76 87 0 19 Oct 2020