ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.11883
  4. Cited By
Regularized Softmax Deep Multi-Agent $Q$-Learning
v1v2 (latest)

Regularized Softmax Deep Multi-Agent QQQ-Learning

Neural Information Processing Systems (NeurIPS), 2021
22 March 2021
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
ArXiv (abs)PDFHTML

Papers citing "Regularized Softmax Deep Multi-Agent $Q$-Learning"

12 / 12 papers shown
Title
Large-scale automatic carbon ion treatment planning for head and neck cancers via parallel multi-agent reinforcement learning
Large-scale automatic carbon ion treatment planning for head and neck cancers via parallel multi-agent reinforcement learning
Jueye Zhang
Chao Yang
Youfang Lai
Kai-Wen Li
Wenting Yan
...
Jingjing Zhou
Gen Yang
Chen Lin
Tian Li
Yibao Zhang
OffRL
56
0
0
04 Nov 2025
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
274
9
0
03 Oct 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse
  Training
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse TrainingNeural Information Processing Systems (NeurIPS), 2024
Pihe Hu
Shaolong Li
Zhuoran Li
L. Pan
Longbo Huang
142
1
0
28 Sep 2024
Language-Conditioned Offline RL for Multi-Robot Navigation
Language-Conditioned Offline RL for Multi-Robot NavigationIEEE International Conference on Robotics and Automation (ICRA), 2024
Steven D. Morad
Ajay Shankar
J. Blumenkamp
Amanda Prorok
LM&RoOffRL
195
10
0
29 Jul 2024
Counterfactual Conservative Q Learning for Offline Multi-agent
  Reinforcement Learning
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
158
36
0
22 Sep 2023
Formal Modelling for Multi-Robot Systems Under Uncertainty
Formal Modelling for Multi-Robot Systems Under UncertaintyCurrent Robotics Reports (CRR), 2023
Charlie Street
Masoumeh Mansouri
Bruno Lacerda
176
4
0
26 May 2023
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARLNeural Information Processing Systems (NeurIPS), 2022
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
179
60
0
21 Sep 2022
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision TreesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Zichuan Liu
Zichuan Liu
Zhi Wang
Yuanyang Zhu
Chunlin Chen
468
13
0
15 Sep 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Interaction Pattern Disentangling for Multi-Agent Reinforcement LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shunyu Liu
Mingli Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Weilong Dai
258
16
0
08 Jul 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Wei Qiu
Weixun Wang
Rongpin Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
125
2
0
27 May 2022
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement
  Learning with Actor Rectification
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor RectificationInternational Conference on Machine Learning (ICML), 2021
L. Pan
Longbo Huang
Tengyu Ma
Huazhe Xu
OffRLOnRL
303
69
0
22 Nov 2021
Divergence-Regularized Multi-Agent Actor-Critic
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
282
28
0
01 Oct 2021
1