441
v1v2 (latest)

FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA

Main:10 Pages
5 Figures
Bibliography:4 Pages
11 Tables
Appendix:11 Pages
Abstract

Low-Rank Adaptation (LoRA), which introduces a product of two trainable low-rank matrices into frozen pre-trained weights, is widely used for efficient fine-tuning of language models in federated learning (FL). However, when combined with differentially private stochastic gradient descent (DP-SGD), LoRA faces substantial noise amplification: DP-SGD perturbs per-sample gradients, and the matrix multiplication of the LoRA update (BABA) intensifies this effect. Freezing one matrix (e.g., AA) reduces the noise but restricts model expressiveness, often resulting in suboptimal adaptation. To address this, we propose FedSVD\texttt{FedSVD}, a simple yet effective method that introduces a global reparameterization based on singular value decomposition (SVD). In our approach, each client optimizes only the BB matrix and transmits it to the server. The server aggregates the BB matrices, computes the product BABA using the previous AA, and refactorizes the result via SVD. This yields a new adaptive AA composed of the orthonormal right singular vectors of BABA, and an updated BB containing the remaining SVD components. This reparameterization avoids quadratic noise amplification, while allowing AA to better capture the principal directions of the aggregate updates. Moreover, the orthonormal structure of AA bounds the gradient norms of BB and preserves more signal under DP-SGD, as confirmed by our theoretical analysis. As a result, FedSVD\texttt{FedSVD} consistently improves stability and performance across a variety of privacy settings and benchmarks, outperforming relevant baselines under both private and non-private regimes.

View on arXiv
Comments on this paper