Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.14555
Cited By
V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL
27 October 2021
Chi Jin
Qinghua Liu
Yuanhao Wang
Tiancheng Yu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL"
40 / 40 papers shown
Title
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Yuheng Zhang
Dian Yu
Tao Ge
Linfeng Song
Zhichen Zeng
Haitao Mi
Nan Jiang
Dong Yu
58
1
0
24 Feb 2025
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
Tong Yang
Bo Dai
Lin Xiao
Yuejie Chi
OffRL
56
2
0
13 Feb 2025
Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning
Emile Anand
Ishani Karmarkar
Guannan Qu
83
1
0
01 Dec 2024
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
49
5
0
11 Aug 2024
Learning to Steer Markovian Agents under Model Uncertainty
Jiawei Huang
Vinzenz Thoma
Zebang Shen
H. Nax
Niao He
40
2
0
14 Jul 2024
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
Yuheng Zhang
Dian Yu
Baolin Peng
Linfeng Song
Ye Tian
Mingyue Huo
Nan Jiang
Haitao Mi
Dong Yu
35
15
0
30 Jun 2024
Decentralized Online Learning in General-Sum Stackelberg Games
Yaolong Yu
Haipeng Chen
27
0
0
06 May 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Muhammad Aneeq uz Zaman
Alec Koppel
Mathieu Laurière
Tamer Basar
39
3
0
17 Mar 2024
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation
Yan Dai
Qiwen Cui
S. S. Du
41
1
0
11 Feb 2024
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang
Ziluo Ding
Zongqing Lu
24
2
0
03 Feb 2024
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Chanwoo Park
K. Zhang
Asuman Ozdaglar
30
8
0
13 Jul 2023
Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Volodymyr Tkachuk
Seyed Alireza Bakhtiari
Johannes Kirschner
Matej Jusup
Ilija Bogunovic
Csaba Szepesvári
24
4
0
08 Feb 2023
Offline Learning in Markov Games with General Function Approximation
Yuheng Zhang
Yunru Bai
Nan Jiang
OffRL
15
8
0
06 Feb 2023
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback
Yunchang Yang
Hangshi Zhong
Tianhao Wu
B. Liu
Liwei Wang
S. Du
OffRL
27
8
0
03 Feb 2023
What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?
Songyang Han
Sanbao Su
Sihong He
Shuo Han
Haizhao Yang
Shaofeng Zou
Fei Miao
AAML
20
22
0
06 Dec 2022
Provably Efficient Model-free RL in Leader-Follower MDP with Linear Function Approximation
A. Ghosh
15
1
0
28 Nov 2022
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Shicong Cen
Yuejie Chi
S. Du
Lin Xiao
51
35
0
03 Oct 2022
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model
Gen Li
Yuejie Chi
Yuting Wei
Yuxin Chen
32
18
0
22 Aug 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
31
13
0
03 Aug 2022
Regret Minimization and Convergence to Equilibria in General-sum Markov Games
Liad Erez
Tal Lancewicki
Uri Sherman
Tomer Koren
Yishay Mansour
40
25
0
28 Jul 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
33
3
0
18 Jul 2022
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Stephen Marcus McAleer
JB Lanier
Kevin A. Wang
Pierre Baldi
Roy Fox
T. Sandholm
27
18
0
13 Jul 2022
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
19
26
0
06 Jun 2022
Learning in Congestion Games with Bandit Feedback
Qiwen Cui
Zhihan Xiong
Maryam Fazel
S. Du
24
12
0
04 Jun 2022
Sample-Efficient Reinforcement Learning of Partially Observable Markov Games
Qinghua Liu
Csaba Szepesvári
Chi Jin
34
20
0
02 Jun 2022
Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Shicong Cen
Fan Chen
Yuejie Chi
31
15
0
12 Apr 2022
The Complexity of Markov Equilibrium in Stochastic Games
C. Daskalakis
Noah Golowich
K. Zhang
36
57
0
08 Apr 2022
Near-Optimal Learning of Extensive-Form Games with Imperfect Information
Yunru Bai
Chi Jin
Song Mei
Tiancheng Yu
21
26
0
03 Feb 2022
When is Offline Two-Player Zero-Sum Markov Game Solvable?
Qiwen Cui
S. Du
OffRL
30
29
0
10 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
29
30
0
27 Dec 2021
Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games
Zuguang Gao
Qianqian Ma
Tamer Bacsar
J. Birge
OffRL
22
7
0
15 Dec 2021
Independent Learning in Stochastic Games
Asuman Ozdaglar
M. O. Sayin
K. Zhang
16
22
0
23 Nov 2021
Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure
Hsu Kao
Chen-Yu Wei
V. Subramanian
13
12
0
01 Nov 2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
24
49
0
20 Oct 2021
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning
Weichao Mao
Lin F. Yang
K. Zhang
Tamer Bacsar
31
57
0
12 Oct 2021
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?
Ziang Song
Song Mei
Yu Bai
74
67
0
08 Oct 2021
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games
Xiaotie Deng
Ningyuan Li
D. Mguni
Jun Wang
Yaodong Yang
21
46
0
04 Sep 2021
Gradient play in stochastic games: stationary points, convergence, and sample complexity
Runyu Zhang
Zhaolin Ren
Na Li
26
43
0
01 Jun 2021
Independent Policy Gradient Methods for Competitive Reinforcement Learning
C. Daskalakis
Dylan J. Foster
Noah Golowich
62
159
0
11 Jan 2021
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
K. Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
47
119
0
15 Jul 2020
1