ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.07839
  4. Cited By
Is Local SGD Better than Minibatch SGD?
v1v2 (latest)

Is Local SGD Better than Minibatch SGD?

International Conference on Machine Learning (ICML), 2020
18 February 2020
Blake E. Woodworth
Kumar Kshitij Patel
Sebastian U. Stich
Zhen Dai
Brian Bullins
H. B. McMahan
Ohad Shamir
Nathan Srebro
    FedML
ArXiv (abs)PDFHTML

Papers citing "Is Local SGD Better than Minibatch SGD?"

50 / 156 papers shown
On the Convergence and Stability of Distributed Sub-model Training
On the Convergence and Stability of Distributed Sub-model Training
Yuyang Deng
Fuli Qiao
M. Mahdavi
FedML
225
0
0
08 Nov 2025
Personalized Collaborative Learning with Affinity-Based Variance Reduction
Personalized Collaborative Learning with Affinity-Based Variance Reduction
Chenyu Zhang
Navid Azizan
130
0
0
17 Oct 2025
FedMuon: Federated Learning with Bias-corrected LMO-based Optimization
FedMuon: Federated Learning with Bias-corrected LMO-based Optimization
Yuki Takezawa
Anastasia Koloskova
Xiaowen Jiang
Sebastian U. Stich
163
0
0
30 Sep 2025
MAUI: Reconstructing Private Client Data in Federated Transfer Learning
MAUI: Reconstructing Private Client Data in Federated Transfer Learning
Ahaan Dabholkar
Atul Sharma
Z. Berkay Celik
S. Bagchi
150
0
0
14 Sep 2025
Understanding Outer Optimizers in Local SGD: Learning Rates, Momentum, and Acceleration
Understanding Outer Optimizers in Local SGD: Learning Rates, Momentum, and Acceleration
Ahmed Khaled
Satyen Kale
Arthur Douillard
Chi Jin
Rob Fergus
Manzil Zaheer
213
2
0
12 Sep 2025
Constant Stepsize Local GD for Logistic Regression: Acceleration by Instability
Constant Stepsize Local GD for Logistic Regression: Acceleration by Instability
M. Crawshaw
Blake Woodworth
Mingrui Liu
238
1
0
16 Jun 2025
Sharp Gaussian approximations for Decentralized Federated Learning
Sharp Gaussian approximations for Decentralized Federated Learning
Soham Bonnerjee
Sayar Karmakar
Wei Biao Wu
FedML
360
0
0
12 May 2025
Heterogeneity Matters even More in Distributed Learning: Study from Generalization Perspective
Heterogeneity Matters even More in Distributed Learning: Study from Generalization Perspective
Masoud Kavian
Romain Chor
Milad Sefidgaran
Abdellatif Zaidi
FedML
420
2
0
03 Mar 2025
Addressing Label Shift in Distributed Learning via Entropy Regularization
Addressing Label Shift in Distributed Learning via Entropy RegularizationInternational Conference on Learning Representations (ICLR), 2025
Zhiyuan Wu
Changkyu Choi
Xiangcheng Cao
Volkan Cevher
Ali Ramezani-Kebrya
386
0
0
04 Feb 2025
Revisiting LocalSGD and SCAFFOLD: Improved Rates and Missing Analysis
Revisiting LocalSGD and SCAFFOLD: Improved Rates and Missing AnalysisInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Ruichen Luo
Sebastian U Stich
Samuel Horváth
Martin Takáč
538
2
0
08 Jan 2025
Characterizing the Accuracy-Communication-Privacy Trade-off in Distributed Stochastic Convex OptimizationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Sudeep Salgia
Nikola Pavlovic
Yuejie Chi
Qing Zhao
299
0
0
06 Jan 2025
A Unified Analysis of Federated Learning with Arbitrary Client Participation
A Unified Analysis of Federated Learning with Arbitrary Client ParticipationNeural Information Processing Systems (NeurIPS), 2022
Maroun Touma
Mingyue Ji
FedML
676
80
0
31 Dec 2024
MARINA-P: Superior Performance in Non-smooth Federated Optimization with Adaptive Stepsizes
Igor Sokolov
Peter Richtárik
324
1
0
22 Dec 2024
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Zhixu Tao
I. Mason
Sanjeev R. Kulkarni
Xavier Boix
MoMeFedML
539
9
0
27 Nov 2024
An End-to-End Real-World Camera Imaging PipelineACM Multimedia (MM), 2024
Gang He
Zijia Ma
Kepeng Xu
Gang He
Yunsong Li
Wenxin Yu
Taichu Han
Cheng Yang
442
11
0
16 Nov 2024
SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data
  Parallelism for LLM Training
SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM TrainingNeural Information Processing Systems (NeurIPS), 2024
Jinda Jia
Cong Xie
Hanlin Lu
Daoce Wang
Hao Feng
...
Baixi Sun
Yanghua Peng
Zhi-Li Zhang
Xin Liu
Dingwen Tao
MQ
305
12
0
20 Oct 2024
Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical
  Framework for Low-Rank Adaptation
Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation
Grigory Malinovsky
Umberto Michieli
Hasan Hammoud
Taha Ceritli
Hayder Elesedy
Mete Ozay
Peter Richtárik
AI4CE
293
13
0
10 Oct 2024
Communication-Efficient Federated Group Distributionally Robust
  Optimization
Communication-Efficient Federated Group Distributionally Robust OptimizationNeural Information Processing Systems (NeurIPS), 2024
Zhishuai Guo
Tianbao Yang
FedML
372
1
0
08 Oct 2024
MindFlayer SGD: Efficient Parallel SGD in the Presence of Heterogeneous and Random Worker Compute Times
MindFlayer SGD: Efficient Parallel SGD in the Presence of Heterogeneous and Random Worker Compute TimesConference on Uncertainty in Artificial Intelligence (UAI), 2024
Artavazd Maranjyan
Omar Shaikh Omar
Peter Richtárik
293
4
0
05 Oct 2024
Does Worst-Performing Agent Lead the Pack? Analyzing Agent Dynamics in
  Unified Distributed SGD
Does Worst-Performing Agent Lead the Pack? Analyzing Agent Dynamics in Unified Distributed SGDNeural Information Processing Systems (NeurIPS), 2024
Jie Hu
Yi-Ting Ma
Do Young Eun
FedML
347
2
0
26 Sep 2024
Federated Frank-Wolfe Algorithm
Federated Frank-Wolfe Algorithm
Ali Dadras
Sourasekhar Banerjee
Karthik Prakhya
A. Yurtsever
FedML
237
5
0
19 Aug 2024
A New Theoretical Perspective on Data Heterogeneity in Federated
  Optimization
A New Theoretical Perspective on Data Heterogeneity in Federated Optimization
Jiayi Wang
Maroun Touma
Rong-Rong Chen
Mingyue Ji
FedML
245
3
0
22 Jul 2024
Accelerating Distributed Optimization: A Primal-Dual Perspective on
  Local Steps
Accelerating Distributed Optimization: A Primal-Dual Perspective on Local Steps
Junchi Yang
Murat Yildirim
Qiu Feng
459
1
0
02 Jul 2024
Communication-Efficient Adaptive Batch Size Strategies for Distributed
  Local Gradient Methods
Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Tim Tsz-Kit Lau
Weijian Li
Chenwei Xu
Han Liu
Mladen Kolar
336
3
0
20 Jun 2024
Distributed Stochastic Gradient Descent with Staleness: A Stochastic Delay Differential Equation Based Framework
Distributed Stochastic Gradient Descent with Staleness: A Stochastic Delay Differential Equation Based Framework
Siyuan Yu
Wei Chen
H. V. Poor
386
0
0
17 Jun 2024
The Limits and Potentials of Local SGD for Distributed Heterogeneous
  Learning with Intermittent Communication
The Limits and Potentials of Local SGD for Distributed Heterogeneous Learning with Intermittent Communication
Kumar Kshitij Patel
Margalit Glasgow
Ali Zindari
Lingxiao Wang
Sebastian U. Stich
Ziheng Cheng
Nirmit Joshi
Nathan Srebro
285
11
0
19 May 2024
Improved Generalization Bounds for Communication Efficient Federated
  Learning
Improved Generalization Bounds for Communication Efficient Federated Learning
Peyman Gholami
H. Seferoglu
FedMLAI4CE
408
6
0
17 Apr 2024
Communication-Efficient Large-Scale Distributed Deep Learning: A
  Comprehensive Survey
Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey
Feng Liang
Zhen Zhang
Haifeng Lu
Victor C. M. Leung
Yanyi Guo
Xiping Hu
GNN
363
26
0
09 Apr 2024
The Effectiveness of Local Updates for Decentralized Learning under Data
  Heterogeneity
The Effectiveness of Local Updates for Decentralized Learning under Data HeterogeneityIEEE Transactions on Signal Processing (IEEE TSP), 2024
Tongle Wu
Ying Sun
203
6
0
23 Mar 2024
On the Convergence of Federated Learning Algorithms without Data
  Similarity
On the Convergence of Federated Learning Algorithms without Data Similarity
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
332
6
0
29 Feb 2024
Decomposable Submodular Maximization in Federated Setting
Decomposable Submodular Maximization in Federated Setting
Akbar Rafiey
FedML
419
4
0
31 Jan 2024
High Confidence Level Inference is Almost Free using Parallel Stochastic
  Optimization
High Confidence Level Inference is Almost Free using Parallel Stochastic Optimization
Wanrong Zhu
Zhipeng Lou
Ziyang Wei
Wei Biao Wu
247
4
0
17 Jan 2024
Differentially Private Low-Rank Adaptation of Large Language Model Using
  Federated Learning
Differentially Private Low-Rank Adaptation of Large Language Model Using Federated LearningACM Transactions on Management Information Systems (ACM TMIS), 2023
Xiao-Yang Liu
Rongyi Zhu
Daochen Zha
Jiechao Gao
Shan Zhong
Matt White
Yijia Zhao
294
59
0
29 Dec 2023
A New Random Reshuffling Method for Nonsmooth Nonconvex Finite-sum Optimization
A New Random Reshuffling Method for Nonsmooth Nonconvex Finite-sum Optimization
Junwen Qiu
Xiao Li
Andre Milzarek
630
5
0
02 Dec 2023
Federated Online and Bandit Convex Optimization
Federated Online and Bandit Convex OptimizationInternational Conference on Machine Learning (ICML), 2023
Kumar Kshitij Patel
Lingxiao Wang
Aadirupa Saha
Nathan Srebro
FedML
295
9
0
29 Nov 2023
Asynchronous SGD on Graphs: a Unified Framework for Asynchronous
  Decentralized and Federated Optimization
Asynchronous SGD on Graphs: a Unified Framework for Asynchronous Decentralized and Federated OptimizationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Mathieu Even
Anastasia Koloskova
Laurent Massoulié
FedML
359
17
0
01 Nov 2023
High-probability Convergence Bounds for Nonlinear Stochastic Gradient
  Descent Under Heavy-tailed Noise
High-probability Convergence Bounds for Nonlinear Stochastic Gradient Descent Under Heavy-tailed Noise
Aleksandar Armacki
Pranay Sharma
Gauri Joshi
Dragana Bajović
D. Jakovetić
S. Kar
730
10
0
28 Oct 2023
A Quadratic Synchronization Rule for Distributed Deep Learning
A Quadratic Synchronization Rule for Distributed Deep LearningInternational Conference on Learning Representations (ICLR), 2023
Xinran Gu
Kaifeng Lyu
Sanjeev Arora
Jingzhao Zhang
Longbo Huang
305
4
0
22 Oct 2023
Utilizing Free Clients in Federated Learning for Focused Model
  Enhancement
Utilizing Free Clients in Federated Learning for Focused Model Enhancement
Aditya Narayan Ravi
Ilan Shomorony
FedML
281
0
0
06 Oct 2023
Minibatch and Local SGD: Algorithmic Stability and Linear Speedup in Generalization
Minibatch and Local SGD: Algorithmic Stability and Linear Speedup in Generalization
Yunwen Lei
Tao Sun
Mingrui Liu
473
4
0
02 Oct 2023
FedLALR: Client-Specific Adaptive Learning Rates Achieve Linear Speedup
  for Non-IID Data
FedLALR: Client-Specific Adaptive Learning Rates Achieve Linear Speedup for Non-IID Data
Hao Sun
Li Shen
Shi-Yong Chen
Jingwei Sun
Jing Li
Guangzhong Sun
Dacheng Tao
FedML
215
2
0
18 Sep 2023
FedGKD: Unleashing the Power of Collaboration in Federated Graph Neural
  Networks
FedGKD: Unleashing the Power of Collaboration in Federated Graph Neural Networks
Qiying Pan
Ruofan Wu
Tengfei Liu
Tianyi Zhang
Yifei Zhu
Weiqiang Wang
FedML
308
10
0
18 Sep 2023
FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in
  Federated Learning
FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in Federated LearningComputer Vision and Pattern Recognition (CVPR), 2023
Gihun Lee
Minchan Jeong
Sangmook Kim
Jaehoon Oh
Se-Young Yun
FedML
375
16
0
24 Aug 2023
Noise Balance and Stationary Distribution of Stochastic Gradient Descent
Noise Balance and Stationary Distribution of Stochastic Gradient DescentPhysical Review E (PRE), 2023
Liu Ziyin
Hongchao Li
Masakuni Ueda
246
9
0
13 Aug 2023
Efficient Federated Learning via Local Adaptive Amended Optimizer with
  Linear Speedup
Efficient Federated Learning via Local Adaptive Amended Optimizer with Linear SpeedupIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yan Sun
Li Shen
Hao Sun
Liang Ding
Dacheng Tao
FedML
178
31
0
30 Jul 2023
On the Computation-Communication Trade-Off with A Flexible Gradient
  Tracking Approach
On the Computation-Communication Trade-Off with A Flexible Gradient Tracking ApproachIEEE Conference on Decision and Control (CDC), 2023
Yan Huang
Jinming Xu
241
7
0
12 Jun 2023
Communication-Efficient Gradient Descent-Accent Methods for Distributed
  Variational Inequalities: Unified Analysis and Local Updates
Communication-Efficient Gradient Descent-Accent Methods for Distributed Variational Inequalities: Unified Analysis and Local UpdatesInternational Conference on Learning Representations (ICLR), 2023
Siqi Zhang
S. Choudhury
Sebastian U. Stich
Nicolas Loizou
FedML
510
9
0
08 Jun 2023
Improving Accelerated Federated Learning with Compression and Importance
  Sampling
Improving Accelerated Federated Learning with Compression and Importance Sampling
Michal Grudzieñ
Grigory Malinovsky
Peter Richtárik
FedML
287
12
0
05 Jun 2023
Federated Composite Saddle Point Optimization
Federated Composite Saddle Point Optimization
Site Bai
Brian Bullins
FedML
178
0
0
25 May 2023
Dynamic Regularized Sharpness Aware Minimization in Federated Learning:
  Approaching Global Consistency and Smooth Landscape
Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth LandscapeInternational Conference on Machine Learning (ICML), 2023
Yan Sun
Li Shen
Shi-Yong Chen
Liang Ding
Dacheng Tao
FedML
329
64
0
19 May 2023
1234
Next
Page 1 of 4