ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.04838
  4. Cited By
Optimization Methods for Large-Scale Machine Learning
v1v2v3 (latest)

Optimization Methods for Large-Scale Machine Learning

15 June 2016
Léon Bottou
Frank E. Curtis
J. Nocedal
ArXiv (abs)PDFHTML

Papers citing "Optimization Methods for Large-Scale Machine Learning"

50 / 1,490 papers shown
Cooperative SGD with Dynamic Mixing Matrices
Cooperative SGD with Dynamic Mixing Matrices
Soumya Sarkar
Shweta Jain
171
0
0
20 Aug 2025
Domain-Generalization to Improve Learning in Meta-Learning Algorithms
Domain-Generalization to Improve Learning in Meta-Learning Algorithms
Usman Anjum
Chris Stockman
Cat Luong
J. Zhan
FedML
201
0
0
13 Aug 2025
A Spin Glass Characterization of Neural Networks
A Spin Glass Characterization of Neural Networks
Jun Li
103
0
0
10 Aug 2025
Why Does Stochastic Gradient Descent Slow Down in Low-Precision Training?
Why Does Stochastic Gradient Descent Slow Down in Low-Precision Training?
Vincent-Daniel Yun
MQ
198
0
0
10 Aug 2025
Cumulative Learning Rate Adaptation: Revisiting Path-Based Schedules for SGD and Adam
Cumulative Learning Rate Adaptation: Revisiting Path-Based Schedules for SGD and Adam
Asma Atamna
Tom Maus
Fabian Kievelitz
Tobias Glasmachers
68
0
0
07 Aug 2025
Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex Optimization
Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex Optimization
Wei Liu
Anweshit Panda
Ujwal Pandey
Christopher Brissette
Yikang Shen
George M. Slota
Naigang Wang
Jie Chen
Yangyang Xu
118
1
0
07 Aug 2025
Learning Latent Graph Geometry via Fixed-Point Schrödinger-Type Activation: A Theoretical Study
Learning Latent Graph Geometry via Fixed-Point Schrödinger-Type Activation: A Theoretical Study
Dmitry Pasechnyuk-Vilensky
Daniil Doroshenko
66
0
0
27 Jul 2025
The Price equation reveals a universal force-metric-bias law of algorithmic learning and natural selection
The Price equation reveals a universal force-metric-bias law of algorithmic learning and natural selection
Steven A. Frank
FedML
380
1
0
24 Jul 2025
Physics-aware Truck and Drone Delivery Planning Using Optimization & Machine Learning
Physics-aware Truck and Drone Delivery Planning Using Optimization & Machine Learning
Yineng Sun
Armin Fügenschuh
Vikrant Vaze
112
0
0
22 Jul 2025
Whom to Trust? Adaptive Collaboration in Personalized Federated Learning
Whom to Trust? Adaptive Collaboration in Personalized Federated Learning
Amr Abourayya
Jens Kleesiek
Bharat Rao
Michael Kamp
FedML
387
0
0
30 Jun 2025
Rethinking LLM Training through Information Geometry and Quantum Metrics
Rethinking LLM Training through Information Geometry and Quantum Metrics
Riccardo Di Sipio
172
0
0
18 Jun 2025
ImprovDML: Improved Trade-off in Private Byzantine-Resilient Distributed Machine Learning
ImprovDML: Improved Trade-off in Private Byzantine-Resilient Distributed Machine Learning
Bing Liu
Chengcheng Zhao
L. Chai
Peng Cheng
Yaonan Wang
FedML
163
0
0
18 Jun 2025
Adjusted Shuffling SARAH: Advancing Complexity Analysis via Dynamic Gradient Weighting
Adjusted Shuffling SARAH: Advancing Complexity Analysis via Dynamic Gradient Weighting
Duc Toan Nguyen
Trang H. Tran
Lam M. Nguyen
146
0
0
14 Jun 2025
Convergence of Momentum-Based Optimization Algorithms with Time-Varying Parameters
Convergence of Momentum-Based Optimization Algorithms with Time-Varying Parameters
Mathukumalli Vidyasagar
303
1
0
13 Jun 2025
Complexity of normalized stochastic first-order methods with momentum under heavy-tailed noise
Complexity of normalized stochastic first-order methods with momentum under heavy-tailed noise
Chuan He
Zhaosong Lu
Defeng Sun
Zhanwang Deng
175
10
0
12 Jun 2025
NDCG-Consistent Softmax Approximation with Accelerated Convergence
Yuanhao Pu
Defu Lian
Xiaolong Chen
Xu Huang
Jin Chen
Enhong Chen
181
0
0
11 Jun 2025
Neural Tangent Kernel Analysis to Probe Convergence in Physics-informed Neural Solvers: PIKANs vs. PINNs
Neural Tangent Kernel Analysis to Probe Convergence in Physics-informed Neural Solvers: PIKANs vs. PINNs
Salah A Faroughi
Farinaz Mostajeran
129
5
0
09 Jun 2025
A Stable Whitening Optimizer for Efficient Neural Network Training
A Stable Whitening Optimizer for Efficient Neural Network Training
Kevin Frans
Sergey Levine
Pieter Abbeel
383
3
0
08 Jun 2025
Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner
Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner
Runa Eschenhagen
Aaron Defazio
Tsung-Hsien Lee
Richard Turner
Hao-Jun Michael Shi
296
5
0
04 Jun 2025
Deformable registration and generative modelling of aortic anatomies by auto-decoders and neural ODEs
Deformable registration and generative modelling of aortic anatomies by auto-decoders and neural ODEs
Riccardo Tenderini
Luca Pegolotti
Fanwei Kong
S. Pagani
Francesco Regazzoni
Alison L. Marsden
S. Deparis
MedImAI4CE
185
0
0
01 Jun 2025
Rethinking Regularization Methods for Knowledge Graph Completion
Rethinking Regularization Methods for Knowledge Graph Completion
Linyu Li
Zhi Jin
Yuanpeng He
Dongming Jin
Haoran Duan
Zhengwei Tao
Xuan Zhang
Jiandong Li
OffRL
201
5
0
29 May 2025
Moment Expansions of the Energy Distance
Moment Expansions of the Energy Distance
Ian Langmore
188
0
0
27 May 2025
Stationary MMD Points for Cubature
Stationary MMD Points for Cubature
Zonghao Chen
Toni Karvonen
Heishiro Kanagawa
F. Briol
Chris J. Oates
301
1
0
27 May 2025
Empirical Investigation of Latent Representational Dynamics in Large Language Models: A Manifold Evolution Perspective
Empirical Investigation of Latent Representational Dynamics in Large Language Models: A Manifold Evolution Perspective
Yukun Zhang
Qi Dong
AI4CE
185
0
0
24 May 2025
Implicit Neural Shape Optimization for 3D High-Contrast Electrical Impedance Tomography
Implicit Neural Shape Optimization for 3D High-Contrast Electrical Impedance Tomography
Junqing Chen
Haibo Liu
405
0
0
22 May 2025
MDVT: Enhancing Multimodal Recommendation with Model-Agnostic Multimodal-Driven Virtual Triplets
MDVT: Enhancing Multimodal Recommendation with Model-Agnostic Multimodal-Driven Virtual Triplets
Jinfeng Xu
Zheyu Chen
Jinze Li
Shuo Yang
Hewei Wang
Yijie Li
Mengran Li
Puzhen Wu
Edith C. H. Ngai
189
10
0
22 May 2025
TranSUN: A Preemptive Paradigm to Eradicate Retransformation Bias Intrinsically from Regression Models in Recommender Systems
TranSUN: A Preemptive Paradigm to Eradicate Retransformation Bias Intrinsically from Regression Models in Recommender Systems
Jiahao Yu
Haozhuang Liu
Yeqiu Yang
Lu Chen
Wu Jian
Yuning Jiang
Bo Zheng
AI4TS
291
2
0
20 May 2025
Never Skip a Batch: Continuous Training of Temporal GNNs via Adaptive Pseudo-Supervision
Never Skip a Batch: Continuous Training of Temporal GNNs via Adaptive Pseudo-Supervision
Alexander Panyshev
Dmitry Vinichenko
Oleg Travkin
Roman Alferov
Alexey Zaytsev
269
0
0
18 May 2025
On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm
On the O(dK1/4)O(\frac{\sqrt{d}}{K^{1/4}})O(K1/4d​​) Convergence Rate of AdamW Measured by ℓ1\ell_1ℓ1​ Norm
Huan Li
Yiming Dong
Zhouchen Lin
456
0
0
17 May 2025
Dynamic Perturbed Adaptive Method for Infinite Task-Conflicting Time Series
Dynamic Perturbed Adaptive Method for Infinite Task-Conflicting Time Series
Jiang You
Xiaozhen Wang
Arben Cela
AI4TS
210
0
0
17 May 2025
Sharp Gaussian approximations for Decentralized Federated Learning
Sharp Gaussian approximations for Decentralized Federated Learning
Soham Bonnerjee
Sayar Karmakar
Wei Biao Wu
FedML
327
0
0
12 May 2025
A stochastic gradient method for trilevel optimization
A stochastic gradient method for trilevel optimization
Tommaso Giovannelli
G. Kent
Luis Nunes Vicente
184
0
0
11 May 2025
Entropy-Guided Sampling of Flat Modes in Discrete Spaces
Entropy-Guided Sampling of Flat Modes in Discrete Spaces
Pinaki Mohanty
Riddhiman Bhattacharya
Ruqi Zhang
994
0
0
05 May 2025
Online Functional Principal Component Analysis on a Multidimensional Domain
Online Functional Principal Component Analysis on a Multidimensional Domain
Muye Nanshan
Nan Zhang
Jiguo Cao
176
0
0
04 May 2025
DHO$_2$: Accelerating Distributed Hybrid Order Optimization via Model Parallelism and ADMM
DHO2_22​: Accelerating Distributed Hybrid Order Optimization via Model Parallelism and ADMM
Shunxian Gu
Chaoqun You
Bangbang Ren
Lailong Luo
Junxu Xia
Deke Guo
229
0
0
02 May 2025
Ultra-fast feature learning for the training of two-layer neural networks in the two-timescale regime
Ultra-fast feature learning for the training of two-layer neural networks in the two-timescale regime
Raphael Barboni
Gabriel Peyré
François-Xavier Vialard
MLT
313
2
0
25 Apr 2025
TACO: Tackling Over-correction in Federated Learning with Tailored Adaptive Correction
TACO: Tackling Over-correction in Federated Learning with Tailored Adaptive CorrectionIEEE International Conference on Distributed Computing Systems (ICDCS), 2025
Weijie Liu
Ziwei Zhan
Carlee Joe-Wong
Edith Ngai
Jingpu Duan
Deke Guo
Xu Chen
Xinsong Zhang
FedML
400
1
0
24 Apr 2025
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents
OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents
Raghav Thind
Youran Sun
Ling Liang
Haizhao Yang
LLMAG
548
9
0
23 Apr 2025
MetaMolGen: A Neural Graph Motif Generation Model for De Novo Molecular Design
MetaMolGen: A Neural Graph Motif Generation Model for De Novo Molecular Design
Zimo Yan
Jie Zhang
Zheng Xie
Chang-rui Liu
Wenshu Fan
Yiping Song
427
2
0
22 Apr 2025
AlphaGrad: Non-Linear Gradient Normalization Optimizer
AlphaGrad: Non-Linear Gradient Normalization Optimizer
Soham Sane
ODL
405
0
0
22 Apr 2025
Mixed-Precision Conjugate Gradient Solvers with RL-Driven Precision Tuning
Mixed-Precision Conjugate Gradient Solvers with RL-Driven Precision Tuning
Xinye Chen
275
0
0
19 Apr 2025
Matrix-free Second-order Optimization of Gaussian Splats with Residual Sampling
Matrix-free Second-order Optimization of Gaussian Splats with Residual Sampling
Hamza Pehlivan
Andrea Boscolo Camiletto
Lin Geng Foo
Marc Habermann
Christian Theobalt
3DGS
465
4
0
17 Apr 2025
Stochastic Gradient Descent in Non-Convex Problems: Asymptotic Convergence with Relaxed Step-Size via Stopping Time Methods
Stochastic Gradient Descent in Non-Convex Problems: Asymptotic Convergence with Relaxed Step-Size via Stopping Time Methods
Ruinan Jin
Difei Cheng
Hong Qiao
Xin Shi
Shaodong Liu
Bo Zhang
206
0
0
17 Apr 2025
Towards Weaker Variance Assumptions for Stochastic Optimization
Towards Weaker Variance Assumptions for Stochastic Optimization
Ahmet Alacaoglu
Yura Malitsky
Stephen J. Wright
220
6
0
14 Apr 2025
A Tale of Two Learning Algorithms: Multiple Stream Random Walk and Asynchronous Gossip
A Tale of Two Learning Algorithms: Multiple Stream Random Walk and Asynchronous Gossip
Peyman Gholami
H. Seferoglu
174
0
0
14 Apr 2025
A Piecewise Lyapunov Analysis of Sub-quadratic SGD: Applications to Robust and Quantile Regression
A Piecewise Lyapunov Analysis of Sub-quadratic SGD: Applications to Robust and Quantile RegressionMeasurement and Modeling of Computer Systems (SIGMETRICS), 2025
Yixuan Zhang
Dongyan
Yudong Chen
Qiaomin Xie
285
1
0
11 Apr 2025
Min-Max Optimisation for Nonconvex-Nonconcave Functions Using a Random Zeroth-Order Extragradient Algorithm
Min-Max Optimisation for Nonconvex-Nonconcave Functions Using a Random Zeroth-Order Extragradient Algorithm
Amir Ali Farzin
Yuen-Man Pun
Philipp Braun
Antoine Lesage-Landry
Youssef Diouane
Iman Shames
292
2
0
10 Apr 2025
ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models
ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2025
Seonghwan Park
Jaehyeon Jeong
Yongjun Kim
Jaeho Lee
Namhoon Lee
VLM
309
5
0
09 Apr 2025
Decentralized Domain Generalization with Style Sharing: Formal Model and Convergence Analysis
Decentralized Domain Generalization with Style Sharing: Formal Model and Convergence Analysis
Shahryar Zehtabi
Dong-Jun Han
Seyyedali Hosseinalipour
Christopher G. Brinton
FedMLAI4CE
510
0
0
08 Apr 2025
Universal Collection of Euclidean Invariants between Pairs of Position-Orientations
Universal Collection of Euclidean Invariants between Pairs of Position-Orientations
Gijs Bellaard
B. Smets
R. Duits
364
0
0
04 Apr 2025
Previous
12345...282930
Next
Page 2 of 30
Pageof 30