26
6
v1v2 (latest)

Communication-Efficient Distributed Estimator for Generalized Linear Models with a Diverging Number of Covariates

Abstract

Distributed statistical inference has recently attracted immense attention. The asymptotic efficiency of the maximum likelihood estimator (MLE), the one-step MLE, and the aggregated estimating equation estimator are established for generalized linear models under the "large nn, diverging pnp_n" framework, where the dimension of the covariates pnp_n grows to infinity at a polynomial rate o(nα)o(n^\alpha) for some 0<α<10<\alpha<1. Then a novel method is proposed to obtain an asymptotically efficient estimator for large-scale distributed data by two rounds of communication. In this novel method, the assumption on the number of servers is more relaxed and thus practical for real-world applications. Simulations and a case study demonstrate the satisfactory finite-sample performance of the proposed estimators.

View on arXiv
Comments on this paper