A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge

15 July 2025

Shuangyao Huang

Haibo Zhang

Zhiyi Huang

ArXiv (abs)PDF HTML

Main:7 Pages

6 Figures

Bibliography:1 Pages

2 Tables

Appendix:1 Pages

Abstract

This paper presents a multi-agent reinforcement learning (MARL) framework for cooperative collision avoidance of UAV swarms leveraging domain knowledge-driven reward. The reward is derived from knowledge in the domain of image processing, approximating contours on a two-dimensional field. By modeling obstacles as maxima on the field, collisions are inherently avoided as contours never go through peaks or intersect. Additionally, counters are smooth and energy-efficient. Our framework enables training with large swarm sizes as the agent interaction is minimized and the need for complex credit assignment schemes or observation sharing mechanisms in state-of-the-art MARL approaches are eliminated. Moreover, UAVs obtain the ability to adapt to complex environments where contours may be non-viable or non-existent through intensive training. Extensive experiments are conducted to evaluate the performances of our framework against state-of-the-art MARL algorithms.

View on arXiv

Comments on this paper