Pareto-guided Pipeline for Distilling Featherweight AI Agents in Mobile MOBA Games

7 February 2026

Xionghui Yang

Bozhou Chen

Yunlong Lu

Yongyi Wang

Lingfeng Li

Lanxiao Huang

Lin Liu

Wenjun Wang

Meng Meng

Xia Lin

Wenxin Li

ArXiv (abs)PDF HTML Github

Main:8 Pages

9 Figures

Bibliography:1 Pages

14 Tables

Appendix:8 Pages

Abstract

Recent advances in game AI have demonstrated the feasibility of training agents that surpass top-tier human professionals in complex environments such as Honor of Kings (HoK), a leading mobile multiplayer online battle arena (MOBA) game. However, deploying such powerful agents on mobile devices remains a major challenge. On one hand, the intricate multi-modal state representation and hierarchical action space of HoK demand large, sophisticated policy networks that are inherently difficult to compress into lightweight forms. On the other hand, production deployment requires high-frequency inference under strict energy and latency constraints on mobile platform. To the best of our knowledge, bridging large-scale game AI and practical on-device deployment has not been systematically studied. In this work, we propose a Pareto optimality guided pipeline and design a high-efficiency student architecture search space tailored for mobile execution, enabling systematic exploration of the trade-off between performance and efficiency. Experimental results demonstrate that the distilled model achieves remarkable efficiency, including an $12.4\times$ faster inference speed (under 0.5ms per frame) and a $15.6\times$ improvement in energy efficiency (under 0.5mAh per game), while retaining a 40.32% win rate against the original teacher model.

View on arXiv

Comments on this paper