362
v1v2v3 (latest)

FedP2^2EFT: Federated Learning to Personalize PEFT for Multilingual LLMs

Main:7 Pages
16 Figures
Bibliography:3 Pages
19 Tables
Appendix:21 Pages
Abstract

Federated learning (FL) has enabled the training of multilingual large language models (LLMs) on diverse and decentralized multilingual data, especially on low-resource languages. To improve client-specific performance, personalization via the use of parameter-efficient fine-tuning (PEFT) modules such as LoRA is common. This involves a personalization strategy (PS), such as the design of the PEFT adapter structures (e.g., in which layers to add LoRAs and what ranks) and choice of hyperparameters (e.g., learning rates) for fine-tuning. Instead of manual PS configuration, we propose FedP2^2EFT, a federated learning-to-personalize method for multilingual LLMs in cross-device FL settings. Unlike most existing PEFT structure selection methods, which are prone to overfitting low-data regimes, FedP2^2EFT collaboratively learns the optimal personalized PEFT structure for each client via Bayesian sparse rank selection. Evaluations on both simulated and real-world multilingual FL benchmarks demonstrate that FedP2^2EFT largely outperforms existing personalized fine-tuning methods, while complementing other existing FL methods.

View on arXiv
Comments on this paper