
![]() Large Language Model Partitioning for Low-Latency Inference at the EdgeInternational Symposium on Modeling and Optimization in Mobile, Ad-Hoc and Wireless Networks (WiOpt), 2025 |
![]() A Hybrid Swarm Intelligence Approach for Optimizing Multimodal Large Language Models Deployment in Edge-Cloud-based Federated Learning EnvironmentsComputer Communications (Comput. Commun.), 2025 |