5
0

A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge

Shuangyao Huang
Haibo Zhang
Zhiyi Huang
Main:7 Pages
6 Figures
Bibliography:1 Pages
2 Tables
Appendix:1 Pages
Abstract

This paper presents a multi-agent reinforcement learning (MARL) framework for cooperative collision avoidance of UAV swarms leveraging domain knowledge-driven reward. The reward is derived from knowledge in the domain of image processing, approximating contours on a two-dimensional field. By modeling obstacles as maxima on the field, collisions are inherently avoided as contours never go through peaks or intersect. Additionally, counters are smooth and energy-efficient. Our framework enables training with large swarm sizes as the agent interaction is minimized and the need for complex credit assignment schemes or observation sharing mechanisms in state-of-the-art MARL approaches are eliminated. Moreover, UAVs obtain the ability to adapt to complex environments where contours may be non-viable or non-existent through intensive training. Extensive experiments are conducted to evaluate the performances of our framework against state-of-the-art MARL algorithms.

View on arXiv
Comments on this paper