ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06011
  4. Cited By
Guided Deep Reinforcement Learning for Swarm Systems

Guided Deep Reinforcement Learning for Swarm Systems

18 September 2017
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
ArXiv (abs)PDFHTML

Papers citing "Guided Deep Reinforcement Learning for Swarm Systems"

50 / 62 papers shown
Conditional Diffusion Model for Multi-Agent Dynamic Task Decomposition
Conditional Diffusion Model for Multi-Agent Dynamic Task Decomposition
Yanda Zhu
Yuanyang Zhu
D. Dong
Caihua Chen
Chunlin Chen
DiffM
265
0
0
17 Nov 2025
Video Game Level Design as a Multi-Agent Reinforcement Learning Problem
Video Game Level Design as a Multi-Agent Reinforcement Learning Problem
Sam Earle
Zehua Jiang
Eugene Vinitsky
Julian Togelius
103
0
0
06 Oct 2025
Preference-Guided Learning for Sparse-Reward Multi-Agent Reinforcement Learning
Preference-Guided Learning for Sparse-Reward Multi-Agent Reinforcement Learning
Viet The Bui
Tien Mai
Hong Thanh Nguyen
OffRL
178
0
0
26 Sep 2025
VariAntNet: Learning Decentralized Control of Multi-Agent Systems
VariAntNet: Learning Decentralized Control of Multi-Agent Systems
Yigal Koifman
Erez Koifman
Eran Iceland
Ariel Barel
A. Bruckstein
49
0
0
02 Sep 2025
Multi-level Advantage Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
Multi-level Advantage Credit Assignment for Cooperative Multi-Agent Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Xutong Zhao
Yaqi Xie
119
0
0
09 Aug 2025
Concept Learning for Cooperative Multi-Agent Reinforcement Learning
Concept Learning for Cooperative Multi-Agent Reinforcement Learning
Zhonghan Ge
Yuanyang Zhu
Chunlin Chen
162
0
0
27 Jul 2025
Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning via Incorporating Generalized Human Expertise
Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning via Incorporating Generalized Human Expertise
Xuefei Wu
Xiao Yin
Yuanyang Zhu
Chunlin Chen
156
3
0
25 Jul 2025
Rethinking Generalizability and Discriminability of Self-Supervised
  Learning from Evolutionary Game Theory Perspective
Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory PerspectiveInternational Journal of Computer Vision (IJCV), 2024
Jiangmeng Li
Zehua Zang
Qirui Ji
Chuxiong Sun
Jingyao Wang
Junge Zhang
Changwen Zheng
Gang Hua
Hui Xiong
SSL
292
3
0
30 Nov 2024
Intrinsic Action Tendency Consistency for Cooperative Multi-Agent
  Reinforcement Learning
Intrinsic Action Tendency Consistency for Cooperative Multi-Agent Reinforcement Learning
Junkai Zhang
Yifan Zhang
Xi Sheryl Zhang
Yifan Zang
Jian Cheng
292
8
0
26 Jun 2024
MARL-LNS: Cooperative Multi-agent Reinforcement Learning via Large
  Neighborhoods Search
MARL-LNS: Cooperative Multi-agent Reinforcement Learning via Large Neighborhoods Search
Weizhe Chen
Sven Koenig
B. Dilkina
169
0
0
03 Apr 2024
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning
  with Goal Imagination
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Liangzhou Wang
Kaiwen Zhu
Fengming Zhu
Xinghu Yao
Shujie Zhang
Deheng Ye
Haobo Fu
Qiang Fu
Wei Yang
177
4
0
05 Mar 2024
SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for
  Adaptive Real-Time Subtask Recognition
SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for Adaptive Real-Time Subtask Recognition
Wenjing Zhang
Wei Zhang
183
3
0
04 Mar 2024
Multi-Task Multi-Agent Shared Layers are Universal Cognition of
  Multi-Agent Coordination
Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination
Jiawei Wang
Jian Zhao
Zhengtao Cao
Ruili Feng
Rongjun Qin
Yang Yu
204
1
0
25 Dec 2023
Noise Distribution Decomposition based Multi-Agent Distributional
  Reinforcement Learning
Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement LearningIEEE Transactions on Mobile Computing (IEEE TMC), 2023
Wei Geng
Baidi Xiao
Rongpeng Li
Ning Wei
Dong Wang
Zhifeng Zhao
274
2
0
12 Dec 2023
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Shanqi Liu
Weiwei Liu
Wenzhou Chen
Guanzhong Tian
Y. Liu
187
0
0
05 Jul 2023
TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent
  Reinforcement Learning
TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Xiao Hu
P. Guo
Chuanwei Zhou
Tong Zhang
Zhen Cui
159
0
0
24 Jun 2023
Boosting Value Decomposition via Unit-Wise Attentive State
  Representation for Cooperative Multi-Agent Reinforcement Learning
Boosting Value Decomposition via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning
Qingpeng Zhao
Yuanyang Zhu
Zichuan Liu
Zhi Wang
Chunlin Chen
OffRL
169
2
0
12 May 2023
Heterogeneous-Agent Reinforcement Learning
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
J. Kuba
Xidong Feng
Siyi Hu
Jiaming Ji
Yaodong Yang
216
103
0
19 Apr 2023
Multi-agent Policy Reciprocity with Theoretical Guarantee
Multi-agent Policy Reciprocity with Theoretical Guarantee
Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
201
1
0
12 Apr 2023
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent
  Reinforcement Learning
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement LearningConference on Uncertainty in Artificial Intelligence (UAI), 2023
Xutong Zhao
Yangchen Pan
Chenjun Xiao
Sarath Chandar
Janarthanan Rajendran
277
9
0
16 Mar 2023
MAC-PO: Multi-Agent Experience Replay via Collective Priority
  Optimization
MAC-PO: Multi-Agent Experience Replay via Collective Priority OptimizationAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
Guru Venkataramani
Peng Wei
399
46
0
21 Feb 2023
Adaptive Value Decomposition with Greedy Marginal Contribution
  Computation for Cooperative Multi-Agent Reinforcement Learning
Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Shanqi Liu
Yujing Hu
Runze Wu
Dongxian Xing
Yu Xiong
Changjie Fan
Kun Kuang
Y. Liu
103
1
0
14 Feb 2023
ReMIX: Regret Minimization for Monotonic Value Function Factorization in
  Multiagent Reinforcement Learning
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
259
12
0
11 Feb 2023
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement
  Learning
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning
M. Ibrahim
Ammar Fayad
138
1
0
14 Dec 2022
What is the Solution for State-Adversarial Multi-Agent Reinforcement
  Learning?
What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?
Songyang Han
Sanbao Su
Sihong He
Shuo Han
Haizhao Yang
Shaofeng Zou
Fei Miao
AAML
556
33
0
06 Dec 2022
ACE: Cooperative Multi-agent Q-learning with Bidirectional
  Action-Dependency
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyAAAI Conference on Artificial Intelligence (AAAI), 2022
Chuming Li
Jie Liu
Yinmin Zhang
Yuhong Wei
Yazhe Niu
Yaodong Yang
Y. Liu
Wanli Ouyang
215
33
0
29 Nov 2022
Credit-cognisant reinforcement learning for multi-agent cooperation
Credit-cognisant reinforcement learning for multi-agent cooperation
F. Bredell
S. M. I. H. A. Engelbrecht
M. I. J. C. Schoeman
95
0
0
18 Nov 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
211
2
0
18 Oct 2022
Collisionless Pattern Discovery in Robot Swarms Using Deep Reinforcement
  Learning
Collisionless Pattern Discovery in Robot Swarms Using Deep Reinforcement Learning
Nelson Sharma
A. Ghosh
R. Misra
S. Mukhopadhyay
Gokarna Sharma
84
1
0
20 Sep 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Wei Qiu
Weixun Wang
Rongpin Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
183
2
0
27 May 2022
LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent
  Reinforcement Learning
LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Mingyu Yang
Jian Zhao
Xu Hu
Wen-gang Zhou
Jiangcheng Zhu
Houqiang Li
272
49
0
05 May 2022
CTDS: Centralized Teacher with Decentralized Student for Multi-Agent
  Reinforcement Learning
CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement LearningIEEE Transactions on Games (IEEE Trans. Games), 2022
Jian Zhao
Xu Hu
Mingyu Yang
Wen-gang Zhou
Jiangcheng Zhu
Houqiang Li
OffRL
126
26
0
16 Mar 2022
Breaking the Curse of Dimensionality in Multiagent State Space: A
  Unified Agent Permutation Framework
Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework
Xiaotian Hao
Hangyu Mao
Weixun Wang
Yaodong Yang
Dong Li
Yan Zheng
Zhen Wang
Jianye Hao
LRM
200
10
0
10 Mar 2022
Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D
  Environments with Dynamic Obstacles
Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D Environments with Dynamic ObstaclesIEEE Access (IEEE Access), 2022
Suleman Qamar
Dr. Saddam Hussain Khan
Muhammad Arif Arshad
Maryam Qamar
Asifullah Khan
145
25
0
13 Feb 2022
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy
  Regularization
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization
Jian Zhao
Yue Zhang
Xu Hu
Weixun Wang
Wen-gang Zhou
Jianye Hao
Jiangcheng Zhu
Houqiang Li
173
5
0
09 Feb 2022
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent
  Learning
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent LearningInternational Conference on Learning Representations (ICLR), 2021
D. Mguni
Taher Jafferjee
Jianhong Wang
Oliver Slumbers
Nicolas Perez Nieves
Feifei Tong
Yang Li
Jiangcheng Zhu
Yaodong Yang
Jun Wang
292
19
0
05 Dec 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven
  Exploration
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven ExplorationNeural Information Processing Systems (NeurIPS), 2021
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
141
111
0
22 Nov 2021
Regularize! Don't Mix: Multi-Agent Reinforcement Learning without
  Explicit Centralized Structures
Regularize! Don't Mix: Multi-Agent Reinforcement Learning without Explicit Centralized Structures
Chapman Siu
Jason M. Traish
R. Xu
OffRL
110
0
0
19 Sep 2021
Learning to Swarm with Knowledge-Based Neural Ordinary Differential
  Equations
Learning to Swarm with Knowledge-Based Neural Ordinary Differential EquationsIEEE International Conference on Robotics and Automation (ICRA), 2021
Tom Z. Jiahao
Lishuo Pan
M. A. Hsieh
231
12
0
10 Sep 2021
Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative
  Reinforcement Learning
Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2021
Wanqi Xue
Wei Qiu
Bo An
Zinovi Rabinovich
S. Obraztsova
C. Yeo
AAML
296
41
0
09 Aug 2021
Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network
  Approach
Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network ApproachMathematics of Operations Research (MOR), 2021
Haotian Gu
Xin Guo
Xiaoli Wei
Renyuan Xu
OOD
284
43
0
05 Aug 2021
Policy Regularization via Noisy Advantage Values for Cooperative
  Multi-agent Actor-Critic methods
Policy Regularization via Noisy Advantage Values for Cooperative Multi-agent Actor-Critic methods
Jian Hu
Siyue Hu
Shih-Wei Liao
656
21
0
27 Jun 2021
Celebrating Diversity in Shared Multi-Agent Reinforcement Learning
Celebrating Diversity in Shared Multi-Agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Chenghao Li
Tonghan Wang
Chengjie Wu
Qianchuan Zhao
Jun Yang
Chongjie Zhang
191
180
0
04 Jun 2021
Reinforcement Learning using Guided Observability
Reinforcement Learning using Guided Observability
Stephan Weigand
Pascal Klink
Jan Peters
Joni Pajarinen
OffRL
120
5
0
22 Apr 2021
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement
  Learning Agents
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning AgentsNeural Information Processing Systems (NeurIPS), 2021
Wei Qiu
Xinrun Wang
Runsheng Yu
Xu He
Rongpin Wang
Bo An
S. Obraztsova
Zinovi Rabinovich
162
58
0
16 Feb 2021
Rethinking the Implementation Tricks and Monotonicity Constraint in
  Cooperative Multi-Agent Reinforcement Learning
Rethinking the Implementation Tricks and Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning
Jian Hu
Siyang Jiang
Seth Austin Harding
Haibin Wu
Shihua Liao
685
104
0
06 Feb 2021
Multi-agent navigation based on deep reinforcement learning and
  traditional pathfinding algorithm
Multi-agent navigation based on deep reinforcement learning and traditional pathfinding algorithm
Hong Qiu
AI4CE
107
8
0
05 Dec 2020
Is Independent Learning All You Need in the StarCraft Multi-Agent
  Challenge?
Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Christian Schroeder de Witt
Tarun Gupta
Denys Makoviichuk
Viktor Makoviychuk
Juil Sock
Mingfei Sun
Shimon Whiteson
257
472
0
18 Nov 2020
FireCommander: An Interactive, Probabilistic Multi-agent Environment for
  Heterogeneous Robot Teams
FireCommander: An Interactive, Probabilistic Multi-agent Environment for Heterogeneous Robot Teams
Esmaeil Seraj
Xiyang Wu
Matthew C. Gombolay
AI4CE
266
11
0
31 Oct 2020
BGC: Multi-Agent Group Belief with Graph Clustering
BGC: Multi-Agent Group Belief with Graph Clustering
Tianze Zhou
Fubiao Zhang
Pan Tang
Chenfei Wang
290
2
0
20 Aug 2020
12
Next
Page 1 of 2