Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.01955
Cited By
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
2 March 2021
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games"
50 / 168 papers shown
Title
Single Node Injection Label Specificity Attack on Graph Neural Networks via Reinforcement Learning
Dayuan Chen
Jian Zhang
Yuqian Lv
Jinhuan Wang
Hongjie Ni
Shanqing Yu
Zhen Wang
Qi Xuan
AAML
23
3
0
04 May 2023
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning
Aravind Venugopal
Stephanie Milani
Fei Fang
Balaraman Ravindran
OffRL
18
0
0
12 Apr 2023
The challenge of redundancy on multi-agent value factorisation
Siddarth S. Singh
Benjamin Rosman
36
1
0
28 Mar 2023
Concept Learning for Interpretable Multi-Agent Reinforcement Learning
Renos Zabounidis
Joseph Campbell
Simon Stepputtis
Dana Hughes
Katia P. Sycara
31
15
0
23 Feb 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
37
4
0
21 Feb 2023
Graph Attention Multi-Agent Fleet Autonomy for Advanced Air Mobility
Malintha Fernando
Ransalu Senanayake
Heeyoul Choi
Martin Swany
37
4
0
14 Feb 2023
MANSA: Learning Fast and Slow in Multi-Agent Systems
D. Mguni
Hao Chen
Taher Jafferjee
Jianhong Wang
Long Fei
Xidong Feng
Stephen Marcus McAleer
Feifei Tong
Jun Wang
Yaodong Yang
30
1
0
12 Feb 2023
Learning Complex Teamwork Tasks Using a Given Sub-task Decomposition
Elliot Fosong
Arrasy Rahman
Ignacio Carlucho
Stefano V. Albrecht
30
5
0
09 Feb 2023
Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Hadi Nekoei
Akilesh Badrinaaraayanan
Amit Sinha
Mohammad Amini
Janarthanan Rajendran
Aditya Mahajan
Sarath Chandar
28
13
0
06 Feb 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Chao Yu
Jiaxuan Gao
Weiling Liu
Bo Xu
Hao Tang
Jiaqi Yang
Yu Wang
Yi Wu
31
39
0
03 Feb 2023
Best Possible Q-Learning
Jiechuan Jiang
Zongqing Lu
OffRL
20
5
0
02 Feb 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration
Chao Yu
Xinyi Yang
Jiaxuan Gao
Jiayu Chen
Yunfei Li
...
Yunfei Xiang
Rui Huang
Huazhong Yang
Yi Wu
Yu Wang
33
35
0
09 Jan 2023
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRL
MU
AI4CE
37
55
0
08 Jan 2023
Self-Motivated Multi-Agent Exploration
Shaowei Zhang
Jiahan Cao
Lei Yuan
Yang Yu
De-Chuan Zhan
44
5
0
05 Jan 2023
Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism
Xudong Guo
Daming Shi
Wenhui Fan
22
5
0
05 Jan 2023
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
19
83
0
14 Dec 2022
Effects of Spectral Normalization in Multi-agent Reinforcement Learning
K. Mehta
Anuj Mahajan
Kiran Ravish
21
7
0
10 Dec 2022
Curriculum Learning for Relative Overgeneralization
Lin Shi
Bei Peng
25
1
0
06 Dec 2022
What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?
Songyang Han
Sanbao Su
Sihong He
Shuo Han
Haizhao Yang
Shaofeng Zou
Fei Miao
AAML
25
22
0
06 Dec 2022
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance
C. Chang
Ni Mu
Jiajun Wu
Ling Pan
Huazhe Xu
50
7
0
05 Dec 2022
Satellite Navigation and Coordination with Limited Information Sharing
Sydney I. Dolan
Siddharth Nayak
H. Balakrishnan
22
5
0
07 Nov 2022
Machine Learning-Aided Operations and Communications of Unmanned Aerial Vehicles: A Contemporary Survey
Harrison Kurunathan
Hailong Huang
Kai Li
Wei Ni
E. Hossain
18
70
0
07 Nov 2022
Decentralized Policy Optimization
Kefan Su
Zongqing Lu
11
8
0
06 Nov 2022
Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
Siddharth Nayak
Kenneth M. F. Choi
Wenqi Ding
Sydney I. Dolan
Karthik Gopalakrishnan
H. Balakrishnan
17
29
0
03 Nov 2022
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
Yiqun Chen
Hangyu Mao
Jiaxin Mao
Shiguang Wu
Tianle Zhang
Bin Zhang
Bin Wang
Hong Chang
OffRL
36
7
0
17 Oct 2022
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N. Vadori
Leo Ardon
Sumitra Ganesh
Thomas Spooner
Selim Amrouni
Jared Vann
Mengda Xu
Zeyu Zheng
T. Balch
Manuela Veloso
18
16
0
13 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
11
7
0
11 Oct 2022
Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu
Vedant Shah
Oussama Boussif
Cristian Meo
Anirudh Goyal
Tianmin Shu
Michael C. Mozer
N. Heess
Yoshua Bengio
24
7
0
04 Oct 2022
MSRL: Distributed Reinforcement Learning with Dataflow Fragments
Huanzhou Zhu
Bo Zhao
Gang Chen
Weifeng Chen
Yijie Chen
Liang Shi
Yaodong Yang
Peter R. Pietzuch
Lei Chen
OffRL
MoE
16
6
0
03 Oct 2022
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Filippos Christianos
Georgios Papoudakis
Stefano V. Albrecht
29
4
0
28 Sep 2022
Scalable Task-Driven Robotic Swarm Control via Collision Avoidance and Learning Mean-Field Control
Kai Cui
Mengguang Li
Christian Fabian
Heinz Koeppl
AI4CE
37
5
0
15 Sep 2022
Decentralized Coordination in Partially Observable Queueing Networks
Jiekai Jia
Anam Tahir
Heinz Koeppl
39
1
0
29 Aug 2022
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N
Tianyu Zhang
Andrew Robert Williams
Soham R. Phade
Sunil Srinivasa
Yang Zhang
Prateek Gupta
Yoshua Bengio
Stephan Zheng
AI4CE
19
21
0
15 Aug 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games
Zihan Ding
DiJia Su
Qinghua Liu
Chi Jin
33
3
0
18 Jul 2022
Scalable Model-based Policy Optimization for Decentralized Networked Systems
Yali Du
Chengdong Ma
Yuchen Liu
Runji Lin
Hao Dong
Jun Wang
Yaodong Yang
31
8
0
13 Jul 2022
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning
Matteo Bettini
Ryan Kortvelesy
J. Blumenkamp
Amanda Prorok
18
36
0
07 Jul 2022
NVIF: Neighboring Variational Information Flow for Large-Scale Cooperative Multi-Agent Scenarios
Jiajun Chai
Yuanheng Zhu
Dongbin Zhao
26
0
0
03 Jul 2022
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Miguel Suau
Jinke He
Mustafa Mert cCelikok
M. Spaan
F. Oliehoek
16
1
0
01 Jul 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
19
22
0
24 Jun 2022
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world
Eugene Vinitsky
Nathan Lichtlé
Xiaomeng Yang
Brandon Amos
Jakob N. Foerster
OffRL
38
51
0
20 Jun 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
38
108
0
17 Jun 2022
Universally Expressive Communication in Multi-Agent Reinforcement Learning
Matthew Morris
Thomas D. Barrett
Arnu Pretorius
24
4
0
14 Jun 2022
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
Runyu Zhang
Qinghua Liu
Haiquan Wang
Caiming Xiong
Na Li
Yu Bai
21
26
0
06 Jun 2022
Learning Generalized Wireless MAC Communication Protocols via Abstraction
Luciano Miuccio
Salvatore Riolo
S. Samarakoon
D. Panno
M. Bennis
17
17
0
06 Jun 2022
Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games
Ziyi Liu
Xian Guo
Yongchun Fang
18
0
0
31 May 2022
MAVIPER: Learning Decision Tree Policies for Interpretable Multi-Agent Reinforcement Learning
Stephanie Milani
Zhicheng Zhang
Nicholay Topin
Z. Shi
Charles A. Kamhoua
Evangelos E. Papalexakis
Fei Fang
OffRL
78
13
0
25 May 2022
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
Linrui Zhang
Li Shen
Long Yang
Shi-Yong Chen
Bo Yuan
Xueqian Wang
Dacheng Tao
13
62
0
24 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
30
8
0
20 May 2022
Learning Progress Driven Multi-Agent Curriculum
Wenshuai Zhao
Zhiyuan Li
Joni Pajarinen
32
0
0
20 May 2022
RoMFAC: A robust mean-field actor-critic reinforcement learning against adversarial perturbations on states
Ziyuan Zhou
Guanjun Liu
AAML
35
23
0
15 May 2022
Previous
1
2
3
4
Next