Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.12322
Cited By
Off-Policy Multi-Agent Decomposed Policy Gradients
24 July 2020
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Off-Policy Multi-Agent Decomposed Policy Gradients"
34 / 34 papers shown
Title
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Shuai Han
Mehdi Dastani
Shihan Wang
24
0
0
13 May 2025
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards
Xinyi Yang
Liang Zeng
Heng Dong
C. Yu
X. Wu
H. Yang
Yu Wang
Milind Tambe
Tonghan Wang
73
2
0
18 Feb 2025
B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
Woojun Kim
Katia P. Sycara
OffRL
89
0
0
30 Jan 2025
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Tonghan Wang
Heng Dong
Yanchen Jiang
David C. Parkes
Milind Tambe
DiffM
44
2
0
17 Oct 2024
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
49
5
0
11 Aug 2024
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
28
8
0
15 Dec 2023
Policy Diversity for Cooperative Agents
M. Tan
Andong Tian
Ludovic Denoyer
16
2
0
28 Aug 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
27
5
0
22 Aug 2023
RGMComm: Return Gap Minimization via Discrete Communications in Multi-Agent Reinforcement Learning
Jingdi Chen
Tian-Shing Lan
Carlee Joe-Wong
15
15
0
07 Aug 2023
A Variational Approach to Mutual Information-Based Coordination for Multi-Agent Reinforcement Learning
Woojun Kim
Whiyoung Jung
Myungsik Cho
Young-Jin Sung
24
7
0
01 Mar 2023
MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
Guru Venkataramani
Peng Wei
39
38
0
21 Feb 2023
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
24
11
0
11 Feb 2023
Best Possible Q-Learning
Jiechuan Jiang
Zongqing Lu
OffRL
20
5
0
02 Feb 2023
A Bayesian Framework for Digital Twin-Based Control, Monitoring, and Data Collection in Wireless Systems
Clement Ruah
Osvaldo Simeone
Bashir M. Al-Hashimi
24
28
0
02 Dec 2022
Non-Linear Coordination Graphs
Yipeng Kang
Tonghan Wang
Xiao-Ren Wu
Qianlan Yang
Chongjie Zhang
29
9
0
26 Oct 2022
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
27
22
0
22 Oct 2022
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning
Kefan Su
Siyuan Zhou
Jiechuan Jiang
Chuang Gan
Xiangjun Wang
Zongqing Lu
OffRL
28
6
0
17 Sep 2022
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Wei Fu
Chao Yu
Zelai Xu
Jiaqi Yang
Yi Wu
32
32
0
15 Jun 2022
Learning-Based Data Storage [Vision] (Technical Report)
Xiang Lian
Xiaofei Zhang
28
0
0
12 Jun 2022
Multi-Agent Policy Transfer via Task Relationship Modeling
Rongjun Qin
F. Chen
Tonghan Wang
Lei Yuan
Xiaoran Wu
Zongzhang Zhang
Chongjie Zhang
Yang Yu
30
19
0
09 Mar 2022
Sound Adversarial Audio-Visual Navigation
Yinfeng Yu
Wenbing Huang
Fuchun Sun
Changan Chen
Yikai Wang
Xiaohong Liu
AAML
19
29
0
22 Feb 2022
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Jian Zhao
Yue Zhang
Xu Hu
Weixun Wang
Wen-gang Zhou
Jianye Hao
Jiangcheng Zhu
Houqiang Li
14
4
0
09 Feb 2022
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
16
82
0
22 Nov 2021
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
46
25
0
01 Oct 2021
Settling the Variance of Multi-Agent Policy Gradients
J. Kuba
Muning Wen
Yaodong Yang
Linghui Meng
Shangding Gu
Haifeng Zhang
D. Mguni
Jun Wang
16
58
0
19 Aug 2021
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
14
202
0
04 Oct 2020
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
14
433
0
03 Aug 2020
The Emergence of Individuality
Jiechuan Jiang
Zongqing Lu
15
34
0
10 Jun 2020
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
Jianhao Wang
Zhizhou Ren
Beining Han
Jianing Ye
Chongjie Zhang
OffRL
18
32
0
31 May 2020
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles
Tonghan Wang
Heng Dong
V. Lesser
Chongjie Zhang
55
210
0
18 Mar 2020
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan
Tabish Rashid
Mikayel Samvelyan
Shimon Whiteson
DRL
133
355
0
16 Oct 2019
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Afshin Oroojlooyjadid
Davood Hajinezhad
40
408
0
11 Aug 2019
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Nantas Nardelli
Gregory Farquhar
Triantafyllos Afouras
Philip H. S. Torr
Pushmeet Kohli
Shimon Whiteson
OffRL
109
595
0
28 Feb 2017
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
158
220
0
22 May 2012
1