406
v1v2v3v4 (latest)

An Operator Splitting View of Federated Learning

Main:10 Pages
13 Figures
Bibliography:3 Pages
3 Tables
Appendix:10 Pages
Abstract

Over the past few years, the federated learning (FL\texttt{FL}) community has witnessed a proliferation of new FL\texttt{FL} algorithms. However, our understating of the theory of FL\texttt{FL} is still fragmented, and a thorough, formal comparison of these algorithms remains elusive. Motivated by this gap, we show that many of the existing FL\texttt{FL} algorithms can be understood from an operator splitting point of view. This unification allows us to compare different algorithms with ease, to refine previous convergence results and to uncover new algorithmic variants. In particular, our analysis reveals the vital role played by the step size in FL\texttt{FL} algorithms. The unification also leads to a streamlined and economic way to accelerate FL\texttt{FL} algorithms, without incurring any communication overhead. We perform numerical experiments on both convex and nonconvex models to validate our findings.

View on arXiv
Comments on this paper