Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.11251
Cited By
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
23 September 2021
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning"
16 / 116 papers shown
Title
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Muning Wen
J. Kuba
Runji Lin
Weinan Zhang
Ying Wen
J. Wang
Yaodong Yang
26
178
0
30 May 2022
Off-Beat Multi-Agent Reinforcement Learning
Wei Qiu
Weixun Wang
R. Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
24
2
0
27 May 2022
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
115
237
0
20 May 2022
FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence
Zhijie Xie
Shenghui Song
FedML
17
45
0
18 Apr 2022
CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning
Jian Zhao
Xu Hu
Mingyu Yang
Wen-gang Zhou
Jiangcheng Zhu
Houqiang Li
OffRL
17
16
0
16 Mar 2022
Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games
Dingyang Chen
Yile Li
Qi Zhang
OffRL
13
10
0
18 Feb 2022
Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning
Zehao Dou
J. Kuba
Yaodong Yang
FAtt
14
5
0
10 Feb 2022
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Mingfei Sun
Sam Devlin
Jacob Beck
Katja Hofmann
Shimon Whiteson
18
10
0
31 Jan 2022
Mirror Learning: A Unifying Framework of Policy Optimisation
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
18
24
0
07 Jan 2022
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
26
38
0
06 Dec 2021
DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
D. Mguni
Usman Islam
Taher Jafferjee
Xiuling Zhang
Joel Jennings
Aivar Sootla
Changmin Yu
Ziyan Wang
Jun Wang
Yaodong Yang
OffRL
26
7
0
27 Oct 2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
24
49
0
20 Oct 2021
Multi-Agent Constrained Policy Optimisation
Shangding Gu
J. Kuba
Munning Wen
Ruiqing Chen
Ziyan Wang
Zheng Tian
Jun Wang
Alois Knoll
Yaodong Yang
98
49
0
06 Oct 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
20
37
0
04 Aug 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
31
7
0
21 Feb 2021
Bi-level Actor-Critic for Multi-agent Coordination
Haifeng Zhang
Weizhe Chen
Zeren Huang
Minne Li
Yaodong Yang
Weinan Zhang
Jun Wang
98
91
0
08 Sep 2019
Previous
1
2
3