ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.11251
  4. Cited By
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning

23 September 2021
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
ArXivPDFHTML

Papers citing "Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning"

50 / 116 papers shown
Title
A Review of Cooperation in Multi-agent Learning
A Review of Cooperation in Multi-agent Learning
Yali Du
Joel Z. Leibo
Usman Islam
Richard Willis
P. Sunehag
38
31
0
08 Dec 2023
Controlling Large Language Model-based Agents for Large-Scale
  Decision-Making: An Actor-Critic Approach
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
Bin Zhang
Hangyu Mao
Jingqing Ruan
Ying Wen
Yang Li
...
Dapeng Li
Ziyue Li
Rui Zhao
Lijuan Li
Guoliang Fan
LM&Ro
LLMAG
19
34
0
23 Nov 2023
JaxMARL: Multi-Agent RL Environments in JAX
JaxMARL: Multi-Agent RL Environments in JAX
Alex Rutherford
Benjamin Ellis
Matteo Gallici
Jonathan Cook
Andrei Lupu
...
Bruno Lacerda
Nick Hawes
Tim Rocktaschel
Chris Xiaoxuan Lu
Jakob N. Foerster
28
20
0
16 Nov 2023
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline
  Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Daiki E. Matsunaga
Jongmin Lee
Jaeseok Yoon
Stefanos Leonardos
Pieter Abbeel
Kee-Eung Kim
OODD
OffRL
22
3
0
03 Nov 2023
Optimistic Multi-Agent Policy Gradient
Optimistic Multi-Agent Policy Gradient
Wenshuai Zhao
Yi Zhao
Zhiyuan Li
Juho Kannala
J. Pajarinen
18
0
0
03 Nov 2023
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Jiaming Ji
Borong Zhang
Jiayi Zhou
Xuehai Pan
Weidong Huang
Ruiyang Sun
Yiran Geng
Yifan Zhong
Juntao Dai
Yaodong Yang
OffRL
28
62
0
19 Oct 2023
Robust Multi-Agent Reinforcement Learning by Mutual Information
  Regularization
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Simin Li
Ruixiao Xu
Jingqiao Xiu
Yuwei Zheng
Pu Feng
Yaodong Yang
Xianglong Liu
23
3
0
15 Oct 2023
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation
  with Parameter-Sharing Versatility
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility
Lang Feng
Dong Xing
Junru Zhang
Gang Pan
21
1
0
08 Oct 2023
COMPOSER: Scalable and Robust Modular Policies for Snake Robots
COMPOSER: Scalable and Robust Modular Policies for Snake Robots
Yuyou Zhang
Yaru Niu
Xingyu Liu
Ding Zhao
19
2
0
02 Oct 2023
Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent
  Reinforcement Learning
Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent Reinforcement Learning
Weizheng Wang
Le Mao
Ruiqi Wang
Byung-Cheol Min
43
14
0
26 Sep 2023
Boosting Studies of Multi-Agent Reinforcement Learning on Google
  Research Football Environment: the Past, Present, and Future
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future
Yan Song
He Jiang
Haifeng Zhang
Zheng Tian
Weinan Zhang
Jun Wang
OffRL
21
8
0
22 Sep 2023
Counterfactual Conservative Q Learning for Offline Multi-agent
  Reinforcement Learning
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
13
19
0
22 Sep 2023
Policy Diversity for Cooperative Agents
Policy Diversity for Cooperative Agents
M. Tan
Andong Tian
Ludovic Denoyer
21
2
0
28 Aug 2023
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent
  Policy Optimization
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization
Mohammad Mehdi Nasiri
M. Rezghi
35
0
0
13 Aug 2023
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local
  Value Regularization
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Xiangsen Wang
Haoran Xu
Yinan Zheng
Xianyuan Zhan
OffRL
33
23
0
21 Jul 2023
Transformers in Reinforcement Learning: A Survey
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
24
18
0
12 Jul 2023
IMP-MARL: a Suite of Environments for Large-scale Infrastructure
  Management Planning via MARL
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
Pascal Leroy
P. G. Morato
J. Pisane
A. Kolios
D. Ernst
OffRL
35
9
0
20 Jun 2023
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent
  Reinforcement Learning
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning
Haolin Song
Ming Feng
Wen-gang Zhou
Houqiang Li
OffRL
17
5
0
03 Jun 2023
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Yanhao Huang
Jie Song
28
17
0
27 May 2023
Stackelberg Decision Transformer for Asynchronous Action Coordination in
  Multi-Agent Systems
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Bin Zhang
Hangyu Mao
Lijuan Li
Zhiwei Xu
Dapeng Li
Rui Zhao
Guoliang Fan
OffRL
31
5
0
13 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous
  Communication and Linear Function Approximation
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
38
7
0
10 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent
  Reinforcement Learning
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
35
3
0
08 May 2023
From Explicit Communication to Tacit Cooperation:A Novel Paradigm for
  Cooperative MARL
From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL
Dapeng Li
Zhiwei Xu
Bin Zhang
Guoliang Fan
46
7
0
28 Apr 2023
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential
  Decision-Making in Multi-Agent Reinforcement Learning
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning
Bin Zhang
Lijuan Li
Zhiwei Xu
Dapeng Li
Guoliang Fan
12
9
0
20 Apr 2023
Multi-agent Policy Reciprocity with Theoretical Guarantee
Multi-agent Policy Reciprocity with Theoretical Guarantee
Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
17
0
0
12 Apr 2023
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for
  Cooperative and Competitive Multi-Robot Tasks
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks
Guangzhen Hu
Haoran Li
Shasha Liu
Mingjun Ma
Yuanheng Zhu
Dongbin Zhao
OffRL
34
6
0
22 Mar 2023
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent
  Reinforcement Learning
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Xutong Zhao
Yangchen Pan
Chenjun Xiao
Sarath Chandar
Janarthanan Rajendran
19
5
0
16 Mar 2023
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent
  Reinforcement Learning
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning
Xiaoyang Yu
Youfang Lin
Xiangsen Wang
Sheng Han
Kai Lv
19
0
0
02 Mar 2023
Efficient Communication via Self-supervised Information Aggregation for
  Online and Offline Multi-agent Reinforcement Learning
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning
Cong Guan
F. Chen
Lei Yuan
Zongzhang Zhang
Yang Yu
OffRL
37
4
0
19 Feb 2023
Order Matters: Agent-by-agent Policy Optimization
Order Matters: Agent-by-agent Policy Optimization
Xihuai Wang
Zheng Tian
Ziyu Wan
Ying Wen
J. Wang
Weinan Zhang
25
26
0
13 Feb 2023
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Lebin Yu
Yunbo Qiu
Quanming Yao
Xudong Zhang
Jian Wang
16
1
0
10 Feb 2023
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial
  Minority Influence
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence
Simin Li
Jun Guo
Jingqiao Xiu
Pu Feng
Xin Yu
Aishan Liu
Wenjun Wu
Xianglong Liu
AAML
42
13
0
07 Feb 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement
  Learning
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan
Deheng Ye
Xiaoming Duan
Qiang Fu
Wei Yang
Jianping He
Mingfei Sun
OffRL
23
2
0
20 Jan 2023
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Zongwei Liu
Yonghong Song
Yuanlin Zhang
OffRL
25
2
0
10 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
25
25
0
29 Dec 2022
On Realization of Intelligent Decision-Making in the Real World: A
  Foundation Decision Model Perspective
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Ziyu Wan
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
J. Wang
AI4CE
18
10
0
24 Dec 2022
Curriculum Learning for Relative Overgeneralization
Curriculum Learning for Relative Overgeneralization
Lin Shi
Bei Peng
25
1
0
06 Dec 2022
ACE: Cooperative Multi-agent Q-learning with Bidirectional
  Action-Dependency
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Chuming Li
Jie Liu
Yinmin Zhang
Yuhong Wei
Yazhe Niu
Yaodong Yang
Y. Liu
Wanli Ouyang
46
23
0
29 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from
  Cooperation to Team Competition
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
17
3
0
21 Nov 2022
Decentralized Policy Optimization
Decentralized Policy Optimization
Kefan Su
Zongqing Lu
11
8
0
06 Nov 2022
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning
  Library
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
Siyi Hu
Yifan Zhong
Minquan Gao
Weixun Wang
Hao Dong
Xiaodan Liang
Zhihui Li
Xiaojun Chang
Yaodong Yang
15
14
0
11 Oct 2022
Stateful active facilitator: Coordination and Environmental
  Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu
Vedant Shah
Oussama Boussif
Cristian Meo
Anirudh Goyal
Tianmin Shu
Michael C. Mozer
N. Heess
Yoshua Bengio
24
7
0
04 Oct 2022
More Centralized Training, Still Decentralized Execution: Multi-Agent
  Conditional Policy Factorization
More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Jiangxing Wang
Deheng Ye
Zongqing Lu
OffRL
39
18
0
26 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
39
49
0
21 Sep 2022
Maximum Correntropy Value Decomposition for Multi-agent Deep
  Reinforcemen Learning
Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning
Kai Liu
Tianxian Zhang
L. Kong
28
0
0
07 Aug 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to
  Cooperative MARL
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
18
16
0
02 Aug 2022
Towards Global Optimality in Cooperative MARL with the Transformation
  And Distillation Framework
Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework
Jianing Ye
Chenghao Li
Jianhao Wang
Chongjie Zhang
37
2
0
12 Jul 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Shunyu Liu
Jie Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Mingli Song
26
7
0
08 Jul 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement
  Learning
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
38
108
0
17 Jun 2022
Revisiting Some Common Practices in Cooperative Multi-Agent
  Reinforcement Learning
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Wei Fu
Chao Yu
Zelai Xu
Jiaqi Yang
Yi Wu
32
32
0
15 Jun 2022
Previous
123
Next