19
1

Correlation Aware Sparsified Mean Estimation Using Random Projection

Abstract

We study the problem of communication-efficient distributed vector mean estimation, a commonly used subroutine in distributed optimization and Federated Learning (FL). Rand-kk sparsification is a commonly used technique to reduce communication cost, where each client sends k<dk < d of its coordinates to the server. However, Rand-kk is agnostic to any correlations, that might exist between clients in practical scenarios. The recently proposed Rand-kk-Spatial estimator leverages the cross-client correlation information at the server to improve Rand-kk's performance. Yet, the performance of Rand-kk-Spatial is suboptimal. We propose the Rand-Proj-Spatial estimator with a more flexible encoding-decoding procedure, which generalizes the encoding of Rand-kk by projecting the client vectors to a random kk-dimensional subspace. We utilize Subsampled Randomized Hadamard Transform (SRHT) as the projection matrix and show that Rand-Proj-Spatial with SRHT outperforms Rand-kk-Spatial, using the correlation information more efficiently. Furthermore, we propose an approach to incorporate varying degrees of correlation and suggest a practical variant of Rand-Proj-Spatial when the correlation information is not available to the server. Experiments on real-world distributed optimization tasks showcase the superior performance of Rand-Proj-Spatial compared to Rand-kk-Spatial and other more sophisticated sparsification techniques.

View on arXiv
Comments on this paper