Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.11251
Cited By
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
23 September 2021
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning"
50 / 116 papers shown
Title
A Review of Cooperation in Multi-agent Learning
Yali Du
Joel Z. Leibo
Usman Islam
Richard Willis
P. Sunehag
38
31
0
08 Dec 2023
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
Bin Zhang
Hangyu Mao
Jingqing Ruan
Ying Wen
Yang Li
...
Dapeng Li
Ziyue Li
Rui Zhao
Lijuan Li
Guoliang Fan
LM&Ro
LLMAG
19
34
0
23 Nov 2023
JaxMARL: Multi-Agent RL Environments in JAX
Alex Rutherford
Benjamin Ellis
Matteo Gallici
Jonathan Cook
Andrei Lupu
...
Bruno Lacerda
Nick Hawes
Tim Rocktaschel
Chris Xiaoxuan Lu
Jakob N. Foerster
28
20
0
16 Nov 2023
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
Daiki E. Matsunaga
Jongmin Lee
Jaeseok Yoon
Stefanos Leonardos
Pieter Abbeel
Kee-Eung Kim
OODD
OffRL
22
3
0
03 Nov 2023
Optimistic Multi-Agent Policy Gradient
Wenshuai Zhao
Yi Zhao
Zhiyuan Li
Juho Kannala
J. Pajarinen
18
0
0
03 Nov 2023
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Jiaming Ji
Borong Zhang
Jiayi Zhou
Xuehai Pan
Weidong Huang
Ruiyang Sun
Yiran Geng
Yifan Zhong
Juntao Dai
Yaodong Yang
OffRL
28
62
0
19 Oct 2023
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Simin Li
Ruixiao Xu
Jingqiao Xiu
Yuwei Zheng
Pu Feng
Yaodong Yang
Xianglong Liu
23
3
0
15 Oct 2023
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility
Lang Feng
Dong Xing
Junru Zhang
Gang Pan
21
1
0
08 Oct 2023
COMPOSER: Scalable and Robust Modular Policies for Snake Robots
Yuyou Zhang
Yaru Niu
Xingyu Liu
Ding Zhao
19
2
0
02 Oct 2023
Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent Reinforcement Learning
Weizheng Wang
Le Mao
Ruiqi Wang
Byung-Cheol Min
43
14
0
26 Sep 2023
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future
Yan Song
He Jiang
Haifeng Zhang
Zheng Tian
Weinan Zhang
Jun Wang
OffRL
21
8
0
22 Sep 2023
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
13
19
0
22 Sep 2023
Policy Diversity for Cooperative Agents
M. Tan
Andong Tian
Ludovic Denoyer
21
2
0
28 Aug 2023
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization
Mohammad Mehdi Nasiri
M. Rezghi
35
0
0
13 Aug 2023
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization
Xiangsen Wang
Haoran Xu
Yinan Zheng
Xianyuan Zhan
OffRL
33
23
0
21 Jul 2023
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
24
18
0
12 Jul 2023
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
Pascal Leroy
P. G. Morato
J. Pisane
A. Kolios
D. Ernst
OffRL
35
9
0
20 Jun 2023
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning
Haolin Song
Ming Feng
Wen-gang Zhou
Houqiang Li
OffRL
17
5
0
03 Jun 2023
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Yanhao Huang
Jie Song
28
17
0
27 May 2023
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Bin Zhang
Hangyu Mao
Lijuan Li
Zhiwei Xu
Dapeng Li
Rui Zhao
Guoliang Fan
OffRL
31
5
0
13 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
38
7
0
10 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
35
3
0
08 May 2023
From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL
Dapeng Li
Zhiwei Xu
Bin Zhang
Guoliang Fan
46
7
0
28 Apr 2023
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning
Bin Zhang
Lijuan Li
Zhiwei Xu
Dapeng Li
Guoliang Fan
12
9
0
20 Apr 2023
Multi-agent Policy Reciprocity with Theoretical Guarantee
Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
17
0
0
12 Apr 2023
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks
Guangzhen Hu
Haoran Li
Shasha Liu
Mingjun Ma
Yuanheng Zhu
Dongbin Zhao
OffRL
34
6
0
22 Mar 2023
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Xutong Zhao
Yangchen Pan
Chenjun Xiao
Sarath Chandar
Janarthanan Rajendran
19
5
0
16 Mar 2023
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning
Xiaoyang Yu
Youfang Lin
Xiangsen Wang
Sheng Han
Kai Lv
19
0
0
02 Mar 2023
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning
Cong Guan
F. Chen
Lei Yuan
Zongzhang Zhang
Yang Yu
OffRL
37
4
0
19 Feb 2023
Order Matters: Agent-by-agent Policy Optimization
Xihuai Wang
Zheng Tian
Ziyu Wan
Ying Wen
J. Wang
Weinan Zhang
25
26
0
13 Feb 2023
Improving Zero-Shot Coordination Performance Based on Policy Similarity
Lebin Yu
Yunbo Qiu
Quanming Yao
Xudong Zhang
Jian Wang
16
1
0
10 Feb 2023
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence
Simin Li
Jun Guo
Jingqiao Xiu
Pu Feng
Xin Yu
Aishan Liu
Wenjun Wu
Xianglong Liu
AAML
42
13
0
07 Feb 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan
Deheng Ye
Xiaoming Duan
Qiang Fu
Wei Yang
Jianping He
Mingfei Sun
OffRL
23
2
0
20 Jan 2023
Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework
Zongwei Liu
Yonghong Song
Yuanlin Zhang
OffRL
25
2
0
10 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
25
25
0
29 Dec 2022
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Ziyu Wan
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
J. Wang
AI4CE
18
10
0
24 Dec 2022
Curriculum Learning for Relative Overgeneralization
Lin Shi
Bei Peng
25
1
0
06 Dec 2022
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Chuming Li
Jie Liu
Yinmin Zhang
Yuhong Wei
Yazhe Niu
Yaodong Yang
Y. Liu
Wanli Ouyang
46
23
0
29 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
17
3
0
21 Nov 2022
Decentralized Policy Optimization
Kefan Su
Zongqing Lu
11
8
0
06 Nov 2022
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library
Siyi Hu
Yifan Zhong
Minquan Gao
Weixun Wang
Hao Dong
Xiaodan Liang
Zhihui Li
Xiaojun Chang
Yaodong Yang
15
14
0
11 Oct 2022
Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu
Vedant Shah
Oussama Boussif
Cristian Meo
Anirudh Goyal
Tianmin Shu
Michael C. Mozer
N. Heess
Yoshua Bengio
24
7
0
04 Oct 2022
More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Jiangxing Wang
Deheng Ye
Zongqing Lu
OffRL
39
18
0
26 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
39
49
0
21 Sep 2022
Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning
Kai Liu
Tianxian Zhang
L. Kong
28
0
0
07 Aug 2022
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
18
16
0
02 Aug 2022
Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework
Jianing Ye
Chenghao Li
Jianhao Wang
Chongjie Zhang
37
2
0
12 Jul 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Shunyu Liu
Jie Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Mingli Song
26
7
0
08 Jul 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
38
108
0
17 Jun 2022
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Wei Fu
Chao Yu
Zelai Xu
Jiaqi Yang
Yi Wu
32
32
0
15 Jun 2022
Previous
1
2
3
Next