ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.08612
  4. Cited By
Settling the Variance of Multi-Agent Policy Gradients

Settling the Variance of Multi-Agent Policy Gradients

19 August 2021
J. Kuba
Muning Wen
Yaodong Yang
Linghui Meng
Shangding Gu
Haifeng Zhang
D. Mguni
Jun Wang
ArXivPDFHTML

Papers citing "Settling the Variance of Multi-Agent Policy Gradients"

27 / 27 papers shown
Title
MARFT: Multi-Agent Reinforcement Fine-Tuning
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
J. Wang
W. Zhang
OffRL
31
0
0
21 Apr 2025
Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control
Yifeng Zhang
Yilin Liu
Ping Gong
Peizhuo Li
Mingfeng Fan
Guillaume Sartoretti
43
0
0
14 Mar 2025
SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning
Xu Wan
Chao Yang
Cheng Yang
Jie Song
Mingyang Sun
61
0
0
03 Mar 2025
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Milad Farjadnasab
Shahin Sirouspour
33
0
0
08 Oct 2024
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in
  Autonomous Driving
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving
Zhi Zheng
Shangding Gu
35
2
0
28 May 2024
Reinforcing Language Agents via Policy Optimization with Action
  Decomposition
Reinforcing Language Agents via Policy Optimization with Action Decomposition
Muning Wen
Ziyu Wan
Weinan Zhang
Jun Wang
Ying Wen
38
7
0
23 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent
  Baseline
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
34
0
0
04 May 2024
Centralized vs. Decentralized Multi-Agent Reinforcement Learning for
  Enhanced Control of Electric Vehicle Charging Networks
Centralized vs. Decentralized Multi-Agent Reinforcement Learning for Enhanced Control of Electric Vehicle Charging Networks
Amin Shojaeighadikolaei
Zsolt Talata
Morteza Hashemi
33
2
0
18 Apr 2024
Multi-agent transformer-accelerated RL for satisfaction of STL
  specifications
Multi-agent transformer-accelerated RL for satisfaction of STL specifications
Albin Larsson Forsberg
Alexandros Nikou
Aneta Vulgarakis Feljan
Jana Tumova
32
1
0
23 Mar 2024
Offline Multi-Agent Reinforcement Learning with Coupled Value
  Factorization
Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization
Xiangsen Wang
Xianyuan Zhan
OffRL
19
5
0
15 Jun 2023
Heterogeneous-Agent Reinforcement Learning
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
J. Kuba
Xidong Feng
Siyi Hu
Jiaming Ji
Yaodong Yang
18
36
0
19 Apr 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
23
24
0
29 Dec 2022
ACE: Cooperative Multi-agent Q-learning with Bidirectional
  Action-Dependency
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Chuming Li
Jie Liu
Yinmin Zhang
Yuhong Wei
Yazhe Niu
Yaodong Yang
Y. Liu
Wanli Ouyang
43
23
0
29 Nov 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
39
49
0
21 Sep 2022
Taming Multi-Agent Reinforcement Learning with Estimator Variance
  Reduction
Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction
Taher Jafferjee
Juliusz Ziomek
Tianpei Yang
Zipeng Dai
Jianhong Wang
Matthew E. Taylor
Kun Shao
J. Wang
D. Mguni
29
0
0
02 Sep 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to
  Cooperative MARL
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
18
16
0
02 Aug 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Shunyu Liu
Jie Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Mingli Song
20
7
0
08 Jul 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement
  Learning
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
33
108
0
17 Jun 2022
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Muning Wen
J. Kuba
Runji Lin
Weinan Zhang
Ying Wen
J. Wang
Yaodong Yang
26
178
0
30 May 2022
A Review of Safe Reinforcement Learning: Methods, Theory and
  Applications
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
115
237
0
20 May 2022
Understanding Value Decomposition Algorithms in Deep Cooperative
  Multi-Agent Reinforcement Learning
Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning
Zehao Dou
J. Kuba
Yaodong Yang
FAtt
14
5
0
10 Feb 2022
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence
  Model Tackles All SMAC Tasks
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
26
38
0
06 Dec 2021
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent
  Learning
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
D. Mguni
Taher Jafferjee
Jianhong Wang
Oliver Slumbers
Nicolas Perez Nieves
Feifei Tong
Yang Li
Jiangcheng Zhu
Yaodong Yang
Jun Wang
31
18
0
05 Dec 2021
Multi-Agent Constrained Policy Optimisation
Multi-Agent Constrained Policy Optimisation
Shangding Gu
J. Kuba
Munning Wen
Ruiqing Chen
Ziyan Wang
Zheng Tian
Jun Wang
Alois Knoll
Yaodong Yang
95
49
0
06 Oct 2021
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
18
229
0
23 Sep 2021
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for
  Autonomous Driving
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou
Jun-Jie Luo
Julian Villela
Yaodong Yang
David Rusu
...
H. Ammar
Hongbo Zhang
Wulong Liu
Jianye Hao
Jun Wang
136
193
0
19 Oct 2020
Bi-level Actor-Critic for Multi-agent Coordination
Bi-level Actor-Critic for Multi-agent Coordination
Haifeng Zhang
Weizhe Chen
Zeren Huang
Minne Li
Yaodong Yang
Weinan Zhang
Jun Wang
96
91
0
08 Sep 2019
1