ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.07461
  4. Cited By
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal
  Sample Complexity
v1v2v3 (latest)

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity

Neural Information Processing Systems (NeurIPS), 2020
15 July 2020
Jianchao Tan
Sham Kakade
Tamer Bacsar
Lin F. Yang
ArXiv (abs)PDFHTML

Papers citing "Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity"

50 / 90 papers shown
Non-convex entropic mean-field optimization via Best Response flow
Non-convex entropic mean-field optimization via Best Response flow
Razvan-Andrei Lascu
Mateusz B. Majka
356
2
0
28 May 2025
Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
Multi-Robot Collaboration through Reinforcement Learning and Abstract SimulationIEEE International Conference on Robotics and Automation (ICRA), 2025
Adam Labiosa
Josiah P. Hanna
244
1
0
07 Mar 2025
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Natalia Zhang
X. Wang
Qiwen Cui
Runlong Zhou
Sham Kakade
Simon S. Du
OffRL
542
1
0
10 Jan 2025
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
Yan Gu
Zhaoze Liu
Shuhong Dai
Cong Liu
Ying Wang
Shen Wang
Georgios Theodoropoulos
Long Cheng
415
28
0
03 Jan 2025
Preference-based opponent shaping in differentiable games
Preference-based opponent shaping in differentiable games
Xinyu Qiao
Yudong Hu
Congying Han
Weiyan Wu
Tiande Guo
219
1
0
04 Dec 2024
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from
  Shifted-Dynamics Data
Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics DataInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Chengrui Qu
Laixi Shi
Kishan Panaganti
Pengcheng You
Adam Wierman
OffRLOnRL
309
6
0
06 Nov 2024
Transformers as Game Players: Provable In-context Game-playing
  Capabilities of Pre-trained Models
Transformers as Game Players: Provable In-context Game-playing Capabilities of Pre-trained ModelsNeural Information Processing Systems (NeurIPS), 2024
Chengshuai Shi
Kun Yang
Jing Yang
Cong Shen
267
1
0
13 Oct 2024
Roping in Uncertainty: Robustness and Regularization in Markov Games
Roping in Uncertainty: Robustness and Regularization in Markov Games
Jeremy McMahan
Giovanni Artiglio
Qiaomin Xie
239
5
0
13 Jun 2024
Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning:
  A Systematic Review
Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review
Hafez Ghaemi
Shirin Jamshidi
Mohammad Mashreghi
M. N. Ahmadabadi
Hamed Kebriaei
313
2
0
10 Jun 2024
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
Yuwei Fu
Haichao Zhang
Di Wu
Wei Xu
Benoit Boulet
VLM
411
31
0
02 Jun 2024
Efficient Multi-agent Reinforcement Learning by Planning
Efficient Multi-agent Reinforcement Learning by Planning
Qihan Liu
Jianing Ye
Xiaoteng Ma
Jun Yang
Bin Liang
Chongjie Zhang
263
17
0
20 May 2024
Taming Equilibrium Bias in Risk-Sensitive Multi-Agent Reinforcement
  Learning
Taming Equilibrium Bias in Risk-Sensitive Multi-Agent Reinforcement Learning
Yingjie Fei
Ruitu Xu
242
0
0
04 May 2024
RL in Markov Games with Independent Function Approximation: Improved
  Sample Complexity Bound under the Local Access Model
RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access ModelInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Junyi Fan
Yuxuan Han
Jialin Zeng
Jian-Feng Cai
Yang Wang
Yang Xiang
Jiheng Zhang
501
1
0
18 Mar 2024
Provable Policy Gradient Methods for Average-Reward Markov Potential
  Games
Provable Policy Gradient Methods for Average-Reward Markov Potential GamesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Min Cheng
Ruida Zhou
P. R. Kumar
Chao Tian
334
8
0
09 Mar 2024
Mirror Descent-Ascent for mean-field min-max problems
Mirror Descent-Ascent for mean-field min-max problems
Razvan-Andrei Lascu
Mateusz B. Majka
Lukasz Szpruch
388
1
0
12 Feb 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and
  RLHF
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHFInternational Conference on Machine Learning (ICML), 2024
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
432
33
0
10 Feb 2024
Risk-Sensitive Multi-Agent Reinforcement Learning in Network Aggregative
  Markov Games
Risk-Sensitive Multi-Agent Reinforcement Learning in Network Aggregative Markov Games
Hafez Ghaemi
Hamed Kebriaei
Alireza Ramezani Moghaddam
Majid Nili Ahamadabadi
294
3
0
08 Feb 2024
Optimistic Policy Gradient in Multi-Player Markov Games with a Single
  Controller: Convergence Beyond the Minty Property
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property
Ioannis Anagnostides
Ioannis Panageas
Gabriele Farina
Tuomas Sandholm
394
3
0
19 Dec 2023
Uncertainty-aware transfer across tasks using hybrid model-based
  successor feature reinforcement learning
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
357
3
0
16 Oct 2023
Sample-Efficient Multi-Agent RL: An Optimization Perspective
Sample-Efficient Multi-Agent RL: An Optimization PerspectiveInternational Conference on Learning Representations (ICLR), 2023
Nuoya Xiong
Zhihan Liu
Zhaoran Wang
Zhuoran Yang
322
2
0
10 Oct 2023
VDFD: Multi-Agent Value Decomposition Framework with Disentangled World Model
VDFD: Multi-Agent Value Decomposition Framework with Disentangled World Model
Zhizun Wang
David Meger
DRL
364
4
0
08 Sep 2023
Local and adaptive mirror descents in extensive-form games
Local and adaptive mirror descents in extensive-form gamesNeural Information Processing Systems (NeurIPS), 2023
Côme Fiegel
Pierre Ménard
Tadashi Kozuno
Rémi Munos
Vianney Perchet
Michal Valko
274
3
0
01 Sep 2023
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov
  Games
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov GamesInternational Conference on Machine Learning (ICML), 2023
Songtao Feng
Ming Yin
Yu Wang
J. Yang
Yitao Liang
190
1
0
17 Aug 2023
Efficient Adversarial Attacks on Online Multi-agent Reinforcement
  Learning
Efficient Adversarial Attacks on Online Multi-agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Guanlin Liu
Lifeng Lai
AAML
237
18
0
15 Jul 2023
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Multi-Player Zero-Sum Markov Games with Networked Separable InteractionsNeural Information Processing Systems (NeurIPS), 2023
Chanwoo Park
Jianchao Tan
Asuman Ozdaglar
398
14
0
13 Jul 2023
Sharper Model-free Reinforcement Learning for Average-reward Markov
  Decision Processes
Sharper Model-free Reinforcement Learning for Average-reward Markov Decision ProcessesAnnual Conference Computational Learning Theory (COLT), 2023
Zihan Zhang
Qiaomin Xie
OffRL
288
29
0
28 Jun 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
338
4
0
23 May 2023
Multi-agent Policy Reciprocity with Theoretical Guarantee
Multi-agent Policy Reciprocity with Theoretical Guarantee
Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
232
1
0
12 Apr 2023
Neural Operators of Backstepping Controller and Observer Gain Functions
  for Reaction-Diffusion PDEs
Neural Operators of Backstepping Controller and Observer Gain Functions for Reaction-Diffusion PDEs
Miroslav Krstic
Luke Bhan
Yuanyuan Shi
315
47
0
18 Mar 2023
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum
  Markov Games
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games
Anna Winnicki
R. Srikant
457
2
0
17 Mar 2023
Learning Strategic Value and Cooperation in Multi-Player Stochastic
  Games through Side Payments
Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments
Alan Kuhnle
J. Richley
Darleen Perez-Lavin
395
1
0
09 Mar 2023
Can We Find Nash Equilibria at a Linear Rate in Markov Games?
Can We Find Nash Equilibria at a Linear Rate in Markov Games?International Conference on Learning Representations (ICLR), 2023
Zhuoqing Song
Jason D. Lee
Zhuoran Yang
448
10
0
03 Mar 2023
Finite-sample Guarantees for Nash Q-learning with Linear Function
  Approximation
Finite-sample Guarantees for Nash Q-learning with Linear Function ApproximationConference on Uncertainty in Artificial Intelligence (UAI), 2023
Pedro Cisneros-Velarde
Oluwasanmi Koyejo
349
3
0
01 Mar 2023
Model-Based Decentralized Policy Optimization
Model-Based Decentralized Policy Optimization
Hao Luo
Jiechuan Jiang
Zongqing Lu
299
3
0
16 Feb 2023
Breaking the Curse of Multiagency: Provably Efficient Decentralized
  Multi-Agent RL with Function Approximation
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2023
Yuanhao Wang
Qinghua Liu
Yunru Bai
Chi Jin
359
39
0
13 Feb 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov
  Games with Independent Linear Function Approximation
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2023
Qiwen Cui
Jianchao Tan
S. Du
438
30
0
07 Feb 2023
Population-size-Aware Policy Optimization for Mean-Field Games
Population-size-Aware Policy Optimization for Mean-Field GamesInternational Conference on Learning Representations (ICLR), 2023
Pengdeng Li
Xinrun Wang
Shuxin Li
Hau Chan
Bo An
247
3
0
07 Feb 2023
A Reduction-based Framework for Sequential Decision Making with Delayed
  Feedback
A Reduction-based Framework for Sequential Decision Making with Delayed FeedbackNeural Information Processing Systems (NeurIPS), 2023
Yunchang Yang
Hangshi Zhong
Tianhao Wu
B. Liu
Liwei Wang
S. Du
OffRL
604
10
0
03 Feb 2023
Adapting to game trees in zero-sum imperfect information games
Adapting to game trees in zero-sum imperfect information gamesInternational Conference on Machine Learning (ICML), 2022
Côme Fiegel
Pierre Ménard
Tadashi Kozuno
Rémi Munos
Vianney Perchet
Michal Valko
520
13
0
23 Dec 2022
Smoothing Policy Iteration for Zero-sum Markov Games
Smoothing Policy Iteration for Zero-sum Markov Games
Yangang Ren
Yao Lyu
Wenxuan Wang
Sheng Li
Zeyang Li
Jingliang Duan
184
1
0
03 Dec 2022
Offline congestion games: How feedback type affects data coverage
  requirement
Offline congestion games: How feedback type affects data coverage requirementInternational Conference on Learning Representations (ICLR), 2022
Haozhe Jiang
Qiwen Cui
Zhihan Xiong
Maryam Fazel
S. Du
OffRL
201
1
0
24 Oct 2022
Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning
  with Parameter Convergence
Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter ConvergenceInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
S. Pattathil
Jianchao Tan
Asuman Ozdaglar
370
15
0
23 Oct 2022
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum
  Markov Games
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov GamesInternational Conference on Learning Representations (ICLR), 2022
Shicong Cen
Yuejie Chi
S. Du
Lin Xiao
530
46
0
03 Oct 2022
$O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in
  Two-Player Zero-Sum Markov Games
O(T−1)O(T^{-1})O(T−1) Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games
Yuepeng Yang
Cong Ma
267
17
0
26 Sep 2022
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model
Minimax-Optimal Multi-Agent RL in Markov Games With a Generative ModelNeural Information Processing Systems (NeurIPS), 2022
Gen Li
Yuejie Chi
Yuting Wei
Yuxin Chen
431
20
0
22 Aug 2022
Regret Minimization and Convergence to Equilibria in General-sum Markov Games
Regret Minimization and Convergence to Equilibria in General-sum Markov GamesInternational Conference on Machine Learning (ICML), 2022
Liad Erez
Tal Lancewicki
Uri Sherman
Tomer Koren
Yishay Mansour
496
35
0
28 Jul 2022
Scalable Model-based Policy Optimization for Decentralized Networked
  Systems
Scalable Model-based Policy Optimization for Decentralized Networked SystemsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Yali Du
Chengdong Ma
Yuchen Liu
Runji Lin
Hao Dong
Jun Wang
Yaodong Yang
327
13
0
13 Jul 2022
Approximate Nash Equilibrium Learning for n-Player Markov Games in
  Dynamic Pricing
Approximate Nash Equilibrium Learning for n-Player Markov Games in Dynamic PricingPortuguese Conference on Artificial Intelligence (EPIA), 2022
Larkin Liu
339
1
0
13 Jul 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Interaction Pattern Disentangling for Multi-Agent Reinforcement LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shunyu Liu
Mingli Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Weilong Dai
484
17
0
08 Jul 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement LearningScience China Information Sciences (Sci. China Inf. Sci.), 2022
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRLLRM
468
165
0
19 Jun 2022
12
Next
Page 1 of 2