We propose a decentralized, learning-based framework for dynamic coalition formation in Multi-Robot Task Allocation (MRTA). Our approach extends Multi-Agent Proximal Policy Optimization (MAPPO) by integrating spatial action maps, robot motion planning, intention sharing, and task allocation revision to enable effective and adaptive coalition formation. Extensive simulation studies confirm the effectiveness of our model, enabling each robot to rely solely on local information to learn timely revisions of task selections and form coalitions with other robots to complete collaborative tasks. Additionally, our model significantly outperforms existing methods, including a market-based baseline. Furthermore, we evaluate the scalability and generalizability of the proposed framework, highlighting its ability to handle large robot populations and adapt to scenarios featuring diverse task sets.
View on arXiv@article{bezerra2025_2412.20397, title={ Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation }, author={ Lucas C. D. Bezerra and Ataíde M. G. dos Santos and Shinkyu Park }, journal={arXiv preprint arXiv:2412.20397}, year={ 2025 } }