Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization

31 May 2021

Papers citing "Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization"

14 / 14 papers shown

Title
Faster WIND: Accelerating Iterative Best-of- $N$ Distillation for LLM Alignment Tong Yang Jincheng Mei H. Dai Zixin Wen Shicong Cen Dale Schuurmans Yuejie Chi Bo Dai 36 4 0 20 Feb 2025
Two-Player Zero-Sum Differential Games with One-Sided Information Mukesh Ghimire Z. Xu Yi Ren SyDa 93 0 0 17 Feb 2025
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games Tong Yang Bo Dai Lin Xiao Yuejie Chi OffRL 56 2 0 13 Feb 2025
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions Chanwoo Park K. Zhang Asuman Ozdaglar 21 8 0 13 Jul 2023
Can We Find Nash Equilibria at a Linear Rate in Markov Games? Zhuoqing Song Jason D. Lee Zhuoran Yang 14 8 0 03 Mar 2023
Differentiable Arbitrating in Zero-sum Markov Games Jing Wang Meichen Song Feng Gao Boyi Liu Zhaoran Wang Yi Wu 22 2 0 20 Feb 2023
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games Shicong Cen Yuejie Chi S. Du Lin Xiao 41 35 0 03 Oct 2022
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model Gen Li Yuejie Chi Yuting Wei Yuxin Chen 12 18 0 22 Aug 2022
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games Kenshi Abe Kaito Ariu Mitsuki Sakamoto Kenta Toyoshima Atsushi Iwasaki 20 11 0 21 Aug 2022
Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization Shicong Cen Fan Chen Yuejie Chi 16 15 0 12 Apr 2022
Convergence Rates of Two-Time-Scale Gradient Descent-Ascent Dynamics for Solving Nonconvex Min-Max Problems Thinh T. Doan 11 15 0 17 Dec 2021
Independent Learning in Stochastic Games Asuman Ozdaglar M. O. Sayin K. Zhang 10 23 0 23 Nov 2021
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes Guanghui Lan 87 135 0 30 Jan 2021
Independent Policy Gradient Methods for Competitive Reinforcement Learning C. Daskalakis Dylan J. Foster Noah Golowich 51 158 0 11 Jan 2021