v1v2 (latest)

Memory and Communication Efficient Distributed Stochastic Optimization with Minibatch-Prox

Annual Conference Computational Learning Theory (COLT), 2017

21 February 2017

Papers citing "Memory and Communication Efficient Distributed Stochastic Optimization with Minibatch-Prox"

33 / 33 papers shown

Memory-Constrained Algorithms for Convex Optimization via Recursive Cutting-PlanesNeural Information Processing Systems (NeurIPS), 2023

Moise Blanchard

Junhui Zhang

Patrick Jaillet

164

16 Jun 2023

Sharper Analysis for Minibatch Stochastic Proximal Point Methods: Stability, Smoothness, and DeviationJournal of machine learning research (JMLR), 2023

Xiao-Tong Yuan

P. Li

225

09 Jan 2023

Uniform Stability for First-Order Empirical Risk MinimizationAnnual Conference Computational Learning Theory (COLT), 2022

Amit Attia

Tomer Koren

154

17 Jul 2022

On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and BeyondNeural Information Processing Systems (NeurIPS), 2022

Xiao-Tong Yuan

P. Li

FedML

194

10 Jun 2022

Minibatch and Momentum Model-based Methods for Stochastic Weakly Convex OptimizationNeural Information Processing Systems (NeurIPS), 2021

Qi Deng

Wenzhi Gao

218

06 Jun 2021

Algorithmic Instabilities of Accelerated Gradient DescentNeural Information Processing Systems (NeurIPS), 2021

Amit Attia

Tomer Koren

145

03 Feb 2021

The Min-Max Complexity of Distributed Stochastic Convex Optimization with Intermittent CommunicationAnnual Conference Computational Learning Theory (COLT), 2021

260

02 Feb 2021

Inverse Multiobjective Optimization Through Online Learning

Chaosheng Dong

Yijia Wang

Bo Zeng

131

12 Oct 2020

Adaptive Periodic Averaging: A Practical Approach to Reducing Communication in Distributed Learning

Peng Jiang

G. Agrawal

137

13 Jul 2020

Stochastic Proximal Gradient Algorithm with Minibatches. Application to Large Scale Learning Models

A. Pătraşcu

C. Paduraru

Paul Irofti

115

30 Mar 2020

Is Local SGD Better than Minibatch SGD?International Conference on Machine Learning (ICML), 2020

299

271

18 Feb 2020

Parallel Restarted SPIDER -- Communication Efficient Distributed Nonconvex Optimization with Optimal Computation Complexity

285

12 Dec 2019

Least Squares Approximation for a Distributed SystemJournal of Computational And Graphical Statistics (JCGS), 2019

Xuening Zhu

Feng Li

Hansheng Wang

394

14 Aug 2019

On Convergence of Distributed Approximate Newton Methods: Globalization, Sharper Bounds and BeyondJournal of machine learning research (JMLR), 2019

Xiao-Tong Yuan

Ping Li

248

06 Aug 2019

Communication-Efficient Accurate Statistical Estimation

Jianqing Fan

Yongyi Guo

Kaizheng Wang

175

134

12 Jun 2019

Convergence of Distributed Stochastic Variance Reduced Methods without Sampling Extra DataIEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2019

225

29 May 2019

A Distributed Hierarchical SGD Algorithm with Sparse Global Reduction

Fan Zhou

Guojing Cong

157

12 Mar 2019

Stochastic Approximation of Smooth and Strongly Convex Functions: Beyond the

O(1/T)

Convergence Rate

Lijun Zhang

Zhi Zhou

179

27 Jan 2019

ASVRG: Accelerated Proximal SVRG

277

07 Oct 2018

Generalized Inverse Optimization through Online Learning

Chaosheng Dong

Yiran Chen

Bo Zeng

238

03 Oct 2018

Don't Use Large Mini-Batches, Use Local SGD

718

454

22 Aug 2018

COLA: Decentralized Linear Learning

Lie He

An Bian

Martin Jaggi

244

130

13 Aug 2018

Robust Implicit Backpropagation

Francois Fagan

G. Iyengar

137

07 Aug 2018

The Effect of Network Width on the Performance of Large-batch Training

Lingjiao Chen

Hongyi Wang

Jinman Zhao

Dimitris Papailiopoulos

Paraschos Koutris

195

11 Jun 2018

Graph Oracle Models, Lower Bounds, and Gaps for Parallel Stochastic Optimization

245

126

25 May 2018

Double Quantization for Communication-Efficient Distributed Optimization

Yue Yu

Jiaxiang Wu

Longbo Huang

342

25 May 2018

Distributed Stochastic Optimization via Adaptive SGD

Ashok Cutkosky

R. Busa-Fekete

FedML

187

16 Feb 2018

Distributed Stochastic Multi-Task Learning with Graph Regularization

114

11 Feb 2018

Gradient Sparsification for Communication-Efficient Distributed OptimizationNeural Information Processing Systems (NeurIPS), 2017

Jianqiao Wangni

Jialei Wang

Ji Liu

Tong Zhang

282

572

26 Oct 2017

Stochastic Nonconvex Optimization with Large Minibatches

Weiran Wang

Nathan Srebro

369

25 Sep 2017

On the convergence properties of a

K

-step averaging stochastic gradient descent algorithm for nonconvex optimization

Fan Zhou

Guojing Cong

383

243

03 Aug 2017

Improved Optimization of Finite Sums with Minibatch Stochastic Variance Reduced Proximal Iterations

Jialei Wang

Tong Zhang

251

21 Jun 2017

Gradient Diversity: a Key Ingredient for Scalable Distributed Learning

Dong Yin

A. Pananjady

Max Lam

Dimitris Papailiopoulos

Kannan Ramchandran

Peter L. Bartlett

210

18 Jun 2017