Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.01955
Cited By
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
2 March 2021
Chao Yu
Akash Velu
Eugene Vinitsky
Jiaxuan Gao
Yu Wang
Alexandre M. Bayen
Yi Wu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games"
50 / 168 papers shown
Title
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang
Jiaxuan Gao
Feng Gao
Yi Wu
Chao Yu
31
0
0
15 May 2025
Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
Dusit Niyato
26
0
0
15 May 2025
Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria
Fengming Zhu
Fangzhen Lin
29
0
0
11 May 2025
Bi-level Mean Field: Dynamic Grouping for Large-Scale MARL
Yuxuan Zheng
Yihe Zhou
Feiyang Xu
Mingli Song
Shunyu Liu
OffRL
31
0
0
10 May 2025
JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes
Shalin Jain
Jiazhen Liu
Siva Kailas
Harish Ravichandar
37
0
0
10 May 2025
JAEGER: Dual-Level Humanoid Whole-Body Controller
Ziluo Ding
Haobin Jiang
Yuxuan Wang
Zhenguo Sun
Yu Zhang
Xiaojie Niu
M. Yang
Weishuai Zeng
Xinrun Xu
Zongqing Lu
31
0
0
10 May 2025
Bi-LSTM based Multi-Agent DRL with Computation-aware Pruning for Agent Twins Migration in Vehicular Embodied AI Networks
Yuxiang Wei
Zhuoqi Zeng
Yue Zhong
Jiawen Kang
R. W. Liu
M. S. Hossain
28
0
0
09 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian Sun
Gang Wang
AI4CE
57
1
0
08 May 2025
PPO-ACT: Proximal Policy Optimization with Adversarial Curriculum Transfer for Spatial Public Goods Games
Zhaoqilin Yang
Chanchan Li
Xin Wang
Youliang Tian
21
0
0
07 May 2025
Exploring Equity of Climate Policies using Multi-Agent Multi-Objective Reinforcement Learning
Palok Biswas
Zuzanna Osika
Isidoro Tamassia
Adit Whorra
J. Z. Salazar
Jan Kwakkel
F. Oliehoek
P. Murukannaiah
23
0
0
02 May 2025
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Mohamad Abdul Hady
Siyi Hu
Mahardhika Pratama
Jimmy Cao
Ryszard Kowalczyk
24
0
0
29 Apr 2025
Improving Human-AI Coordination through Adversarial Training and Generative Models
Paresh Chaudhary
Yancheng Liang
Daphne Chen
S. Du
Natasha Jaques
64
0
0
21 Apr 2025
MARFT: Multi-Agent Reinforcement Fine-Tuning
Junwei Liao
Muning Wen
Jun Wang
Wenbo Zhang
OffRL
31
0
0
21 Apr 2025
Attention-Augmented Inverse Reinforcement Learning with Graph Convolutions for Multi-Agent Task Allocation
Huilin Yin
Zhikun Yang
Linchuan Zhang
Daniel Watzenig
31
0
0
07 Apr 2025
An Organizationally-Oriented Approach to Enhancing Explainability and Control in Multi-Agent Reinforcement Learning
Julien Soulé
Jean-Paul Jamont
Michel Occello
Louis-Marie Traonouez
Paul Théron
40
0
0
30 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
58
0
0
17 Mar 2025
Why Do Multi-Agent LLM Systems Fail?
Mert Cemri
Melissa Z. Pan
Shuyi Yang
Lakshya A Agrawal
Bhavya Chopra
...
Dan Klein
Kannan Ramchandran
Matei A. Zaharia
Joseph E. Gonzalez
Ion Stoica
LLMAG
Presented at
ResearchTrend Connect | LLMAG
on
23 Apr 2025
129
8
0
17 Mar 2025
Generative Multi-Agent Q-Learning for Policy Optimization: Decentralized Wireless Networks
Talha Bozkus
U. Mitra
OffRL
40
0
0
07 Mar 2025
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Ziyan Wang
Zhicheng Zhang
Fei Fang
Yali Du
41
0
0
03 Mar 2025
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Meng Feng
Viraj Parimi
B. Williams
69
1
0
25 Feb 2025
Reflection of Episodes: Learning to Play Game from Expert and Self Experiences
Xiaojie Xu
Zongyuan Li
Chang Lu
Runnan Qi
Yanan Ni
...
Yongchun Fang
Kuihua Huang
Xian Guo
Zhanghua Wu
Zhenya Li
53
0
0
19 Feb 2025
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards
Xinyi Yang
Liang Zeng
Heng Dong
Chao Yu
X. Wu
H. Yang
Yu Wang
Milind Tambe
Tonghan Wang
76
2
0
18 Feb 2025
Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning
Beining Zhang
Aditya Kapoor
Mingfei Sun
54
0
0
08 Feb 2025
Deep Meta Coordination Graphs for Multi-agent Reinforcement Learning
Nikunj Gupta
James Zachary Hare
R. Kannan
Viktor Prasanna
GNN
76
0
0
06 Feb 2025
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu
Chao Yu
Ruize Zhang
Huining Yuan
Xiangmin Yi
...
Wenhao Tang
Yu-Xiang Wang
Wenbo Ding
Xiusi Chen
Yu Wang
139
0
0
04 Feb 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
99
0
0
22 Jan 2025
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
40
0
0
04 Jan 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
85
34
0
03 Jan 2025
Symmetries-enhanced Multi-Agent Reinforcement Learning
N. Bousias
Stefanos Pertigkiozoglou
Kostas Daniilidis
George Pappas
AI4CE
76
0
0
02 Jan 2025
Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation
Lucas C. D. Bezerra
Ataíde M. G. dos Santos
Shinkyu Park
37
0
0
29 Dec 2024
Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control
Timothée Anne
Noah Syrkis
Meriem Elhosni
Florian Turati
Franck Legendre
Alain Jaquier
Sebastian Risi
LLMAG
92
2
0
16 Dec 2024
Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding
He Jiang
Yutong Wang
Rishi Veerapaneni
Tanishq Duhan
Guillaume Sartoretti
Jiaoyang Li
37
0
0
28 Oct 2024
MARLIN: Multi-Agent Reinforcement Learning Guided by Language-Based Inter-Robot Negotiation
Toby Godfrey
William Hunt
Mohammad D. Soorati
61
1
0
18 Oct 2024
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Tonghan Wang
Heng Dong
Yanchen Jiang
David C. Parkes
Milind Tambe
DiffM
47
2
0
17 Oct 2024
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Fanqi Kong
Yizhe Huang
Song-Chun Zhu
Siyuan Qi
Xue Feng
26
2
0
10 Oct 2024
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Milad Farjadnasab
Shahin Sirouspour
38
0
0
08 Oct 2024
Enabling Multi-Robot Collaboration from Single-Human Guidance
Zhengran Ji
Lingyu Zhang
Paul Sajda
Boyuan Chen
34
1
0
30 Sep 2024
Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement Learning
X. Wang
Jin Zhou
Yuanli Feng
Jiahao Mei
Jiming Chen
Shuo Li
31
1
0
25 Sep 2024
OLiVia-Nav: An Online Lifelong Vision Language Approach for Mobile Robot Social Navigation
Siddarth Narasimhan
Aaron Hao Tan
Daniel Choi
G. Nejat
LM&Ro
38
3
0
20 Sep 2024
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
Alec Wilson
William Holmes
Ryan Menzies
Kez Smithson Whitehead
33
0
0
13 Sep 2024
Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning
Jiaming Yin
Weixiong Rao
Yu Xiao
Keshuang Tang
18
0
0
01 Sep 2024
Strategy Game-Playing with Size-Constrained State Abstraction
Linjie Xu
Diego Perez-Liebana
Alexander Dockhorn
35
0
0
12 Aug 2024
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
49
5
0
11 Aug 2024
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains
Yinzhu Quan
Zefang Liu
LLMAG
43
2
0
16 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
14
0
05 Jul 2024
The Overcooked Generalisation Challenge
Constantin Ruhdorfer
Matteo Bortoletto
Anna Penzkofer
Andreas Bulling
48
4
0
25 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
57
2
0
15 Jun 2024
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation
Claude Formanek
C. Tilbury
Louise Beyers
Jonathan P. Shock
Arnu Pretorius
OffRL
39
1
0
13 Jun 2024
Carbon Market Simulation with Adaptive Mechanism Design
Han Wang
Wenhao Li
Hongyuan Zha
Baoxiang Wang
27
3
0
12 Jun 2024
Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning
Xinran Li
Zifan Liu
Shibo Chen
Jun Zhang
31
2
0
28 May 2024
1
2
3
4
Next