109

Asynchronous COMID: the theoretic basis for sparse gradient tricks on Parameter Server

Abstract

Asynchronous FTRL and L2L2 norm done at server are two widely used tricks to improve training efficiency, but their convergences are not well-proved. In this paper, we propose asynchronous COMID algorithm and prove its convergence. Then, we establish the equivalence between asynchronous COMID and the above two tricks. Thus, the convergences of above two tricks are also proved. Experimental results show asynchronous COMID reduces the burden of the network without any harm on the convergence speed and final output.

View on arXiv
Comments on this paper