Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.11883
Cited By
v1
v2 (latest)
Regularized Softmax Deep Multi-Agent
Q
Q
Q
-Learning
Neural Information Processing Systems (NeurIPS), 2021
22 March 2021
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Regularized Softmax Deep Multi-Agent $Q$-Learning"
12 / 12 papers shown
Title
Large-scale automatic carbon ion treatment planning for head and neck cancers via parallel multi-agent reinforcement learning
Jueye Zhang
Chao Yang
Youfang Lai
Kai-Wen Li
Wenting Yan
...
Jingjing Zhou
Gen Yang
Chen Lin
Tian Li
Yibao Zhang
OffRL
56
0
0
04 Nov 2025
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
274
9
0
03 Oct 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training
Neural Information Processing Systems (NeurIPS), 2024
Pihe Hu
Shaolong Li
Zhuoran Li
L. Pan
Longbo Huang
142
1
0
28 Sep 2024
Language-Conditioned Offline RL for Multi-Robot Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Steven D. Morad
Ajay Shankar
J. Blumenkamp
Amanda Prorok
LM&Ro
OffRL
195
10
0
29 Jul 2024
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
158
36
0
22 Sep 2023
Formal Modelling for Multi-Robot Systems Under Uncertainty
Current Robotics Reports (CRR), 2023
Charlie Street
Masoumeh Mansouri
Bruno Lacerda
176
4
0
26 May 2023
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
Neural Information Processing Systems (NeurIPS), 2022
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
179
60
0
21 Sep 2022
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Zichuan Liu
Zichuan Liu
Zhi Wang
Yuanyang Zhu
Chunlin Chen
468
13
0
15 Sep 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shunyu Liu
Mingli Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Weilong Dai
258
16
0
08 Jul 2022
Off-Beat Multi-Agent Reinforcement Learning
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Wei Qiu
Weixun Wang
Rongpin Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
125
2
0
27 May 2022
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
International Conference on Machine Learning (ICML), 2021
L. Pan
Longbo Huang
Tengyu Ma
Huazhe Xu
OffRL
OnRL
303
69
0
22 Nov 2021
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
282
28
0
01 Oct 2021
1