40
8

DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization

Abstract

This work introduces DADAO: the first decentralized, accelerated, asynchronous, primal, first-order algorithm to minimize a sum of LL-smooth and μ\mu-strongly convex functions distributed over a given network of size nn. Our key insight is based on modeling the local gradient updates and gossip communication procedures with separate independent Poisson Point Processes. This allows us to decouple the computation and communication steps, which can be run in parallel, while making the whole approach completely asynchronous. This leads to communication acceleration compared to synchronous approaches. Our new method employs primal gradients and does not use a multi-consensus inner loop nor other ad-hoc mechanisms such as Error Feedback, Gradient Tracking, or a Proximal operator. By relating the inverse of the smallest positive eigenvalue of the Laplacian matrix χ1\chi_1 and the maximal resistance χ2χ1\chi_2\leq \chi_1 of the graph to a sufficient minimal communication rate between the nodes of the network, we show that our algorithm requires O(nLμlog(1ϵ))\mathcal{O}(n\sqrt{\frac{L}{\mu}}\log(\frac{1}{\epsilon})) local gradients and only O(nχ1χ2Lμlog(1ϵ))\mathcal{O}(n\sqrt{\chi_1\chi_2}\sqrt{\frac{L}{\mu}}\log(\frac{1}{\epsilon})) communications to reach a precision ϵ\epsilon, up to logarithmic terms. Thus, we simultaneously obtain an accelerated rate for both computations and communications, leading to an improvement over state-of-the-art works, our simulations further validating the strength of our relatively unconstrained method.

View on arXiv
Comments on this paper