ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.04838
  4. Cited By
Optimization Methods for Large-Scale Machine Learning
v1v2v3 (latest)

Optimization Methods for Large-Scale Machine Learning

15 June 2016
Léon Bottou
Frank E. Curtis
J. Nocedal
ArXiv (abs)PDFHTML

Papers citing "Optimization Methods for Large-Scale Machine Learning"

50 / 1,490 papers shown
PORTAL: Controllable Landscape Generator for Continuous Optimization-Part I: Framework
PORTAL: Controllable Landscape Generator for Continuous Optimization-Part I: Framework
D. Yazdani
Mai Peng
Delaram Yazdani
Shima F. Yazdi
M. Omidvar
Yuan Sun
Trung Thanh Nguyen
Changhe Li
Xiaodong Li
56
1
0
29 Nov 2025
Accelerating Wireless Distributed Learning via Hybrid Split and Federated Learning Optimization
Accelerating Wireless Distributed Learning via Hybrid Split and Federated Learning Optimization
Kun Guo
X. Li
Xijun Wang
Howard H. Yang
W. Feng
Tony Q.S. Quek
FedML
312
1
0
25 Nov 2025
Gradient Descent Algorithm Survey
Gradient Descent Algorithm Survey
Deng Fucheng
Wang Wanjie
Gong Ao
Wang Xiaoqi
Wang Fan
ODL
212
0
0
25 Nov 2025
Designing Preconditioners for SGD: Local Conditioning, Noise Floors, and Basin Stability
Designing Preconditioners for SGD: Local Conditioning, Noise Floors, and Basin Stability
Mitchell Scott
Tianshi Xu
Z. Tang
Alexandra Pichette-Emmons
Qiang Ye
Y. Saad
Yuanzhe Xi
AI4CE
263
1
0
24 Nov 2025
CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images
CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images
Avishka Perera
Kumal Hewagamage
Saeedha Nazar
Kavishka Abeywardana
Hasitha Gallella
Ranga Rodrigo
Mohamed Afham
3DV
215
0
0
23 Nov 2025
OpenCML: End-to-End Framework of Open-world Machine Learning to Learn Unknown Classes Incrementally
OpenCML: End-to-End Framework of Open-world Machine Learning to Learn Unknown Classes Incrementally
Jitendra Parmar
Praveen Singh Thakur
CLLVLM
233
0
0
23 Nov 2025
Stable Coresets via Posterior Sampling: Aligning Induced and Full Loss Landscapes
Stable Coresets via Posterior Sampling: Aligning Induced and Full Loss Landscapes
Wei-Kai Chang
Rajiv Khanna
202
0
0
21 Nov 2025
Belief Net: A Filter-Based Framework for Learning Hidden Markov Models from Observations
Belief Net: A Filter-Based Framework for Learning Hidden Markov Models from Observations
Reginald Zhiyan Chen
Heng-Sheng Chang
P. Mehta
72
0
0
13 Nov 2025
Linear Gradient Prediction with Control Variates
Linear Gradient Prediction with Control Variates
K. Ciosek
Nicolò Felicioni
Juan Elenter Litwin
131
0
0
07 Nov 2025
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training
Ahmet Erdem Pamuk
Emir Kaan Özdemir
Şuayp Talha Kocabay
88
0
0
01 Nov 2025
Exploring Landscapes for Better Minima along Valleys
Exploring Landscapes for Better Minima along Valleys
Tong Zhao
Jiacheng Li
Yuanchang Zhou
Guangming Tan
Weile Jia
102
0
0
31 Oct 2025
What Really Matters in Matrix-Whitening Optimizers?
What Really Matters in Matrix-Whitening Optimizers?
Kevin Frans
Pieter Abbeel
Sergey Levine
130
2
0
28 Oct 2025
SHA-256 Infused Embedding-Driven Generative Modeling of High-Energy Molecules in Low-Data Regimes
SHA-256 Infused Embedding-Driven Generative Modeling of High-Energy Molecules in Low-Data Regimes
Siddharth Verma
Alankar Alankar
130
0
0
28 Oct 2025
Self-induced stochastic resonance: A physics-informed machine learning approach
Self-induced stochastic resonance: A physics-informed machine learning approach
Divyesh Savaliya
Marius E. Yamakou
71
0
0
26 Oct 2025
Convergence Analysis of SGD under Expected Smoothness
Convergence Analysis of SGD under Expected Smoothness
Yuta Kawamoto
Hideaki Iiduka
152
0
0
23 Oct 2025
Statistical Inference for Linear Functionals of Online Least-squares SGD when $t \gtrsim d^{1+δ}$
Statistical Inference for Linear Functionals of Online Least-squares SGD when t≳d1+δt \gtrsim d^{1+δ}t≳d1+δ
Bhavya Agrawalla
Krishnakumar Balasubramanian
Promit Ghosal
91
0
0
22 Oct 2025
Geometric Convergence Analysis of Variational Inference via Bregman Divergences
Geometric Convergence Analysis of Variational Inference via Bregman Divergences
Sushil Bohara
Amedeo Roberto Esposito
119
0
0
17 Oct 2025
Exploring the Synergy of Quantitative Factors and Newsflow Representations from Large Language Models for Stock Return Prediction
Exploring the Synergy of Quantitative Factors and Newsflow Representations from Large Language Models for Stock Return Prediction
Tian Guo
E. Hauptmann
AIFinAI4TS
275
0
0
17 Oct 2025
Accelerated stochastic first-order method for convex optimization under heavy-tailed noise
Accelerated stochastic first-order method for convex optimization under heavy-tailed noise
Chuan He
Zhaosong Lu
101
1
0
13 Oct 2025
Bridging the Physics-Data Gap with FNO-Guided Conditional Flow Matching: Designing Inductive Bias through Hierarchical Physical Constraints
Bridging the Physics-Data Gap with FNO-Guided Conditional Flow Matching: Designing Inductive Bias through Hierarchical Physical Constraints
Tsuyoshi Okita
AI4CE
157
0
0
09 Oct 2025
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Ruoyao Xiao
H. Zhang
Seungju Han
Sheng Liu
Jianwen Xie
Yu Zhang
Yejin Choi
James Zou
Pan Lu
AIFin
173
3
0
07 Oct 2025
Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential
Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential
Yuping Zheng
Andrew G. Lamperski
182
0
0
03 Oct 2025
Topological Invariance and Breakdown in Learning
Topological Invariance and Breakdown in Learning
Yongyi Yang
Tomaso Poggio
Isaac Chuang
Liu Ziyin
134
0
0
03 Oct 2025
In-memory Training on Analog Devices with Limited Conductance States via Multi-tile Residual Learning
In-memory Training on Analog Devices with Limited Conductance States via Multi-tile Residual Learning
Jindan Li
Zhaoxian Wu
Gaowen Liu
Tayfun Gokmen
Tianyi Chen
105
1
0
02 Oct 2025
Non-Euclidean Broximal Point Method: A Blueprint for Geometry-Aware Optimization
Non-Euclidean Broximal Point Method: A Blueprint for Geometry-Aware Optimization
Kaja Gruntkowska
Peter Richtárik
175
2
0
01 Oct 2025
Random Feature Spiking Neural Networks
Random Feature Spiking Neural Networks
Maximilian Gollwitzer
Felix Dietrich
151
0
0
01 Oct 2025
CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs
CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs
Yongcheng Zeng
Guoqing Liu
Bokai Ji
Erxue Min
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Haifeng Zhang
Xu Chen
Jun Wang
LRM
120
0
0
01 Oct 2025
Approximately Unimodal Likelihood Models for Ordinal Regression
Approximately Unimodal Likelihood Models for Ordinal RegressionIEEE Transactions on Knowledge and Data Engineering (TKDE), 2025
Ryoya Yamasaki
68
0
0
30 Sep 2025
TAP: Two-Stage Adaptive Personalization of Multi-Task and Multi-Modal Foundation Models in Federated Learning
TAP: Two-Stage Adaptive Personalization of Multi-Task and Multi-Modal Foundation Models in Federated Learning
Seohyun Lee
Wenzhi Fang
Dong-Jun Han
Seyyedali Hosseinalipour
Christopher G. Brinton
118
0
0
30 Sep 2025
Conda: Column-Normalized Adam for Training Large Language Models Faster
Conda: Column-Normalized Adam for Training Large Language Models Faster
Junjie Wang
Pan Zhou
Yiming Dong
Huan Li
Jia Li
Xun Zhou
Qicheng Lao
Cong Fang
Zhouchen Lin
AI4CE
254
0
0
29 Sep 2025
Deep Learning as the Disciplined Construction of Tame Objects
Deep Learning as the Disciplined Construction of Tame Objects
Gilles Bareilles
Allen Gehret
Johannes Aspman
Jana Lepšová
Jakub Mareˇcek
86
0
0
22 Sep 2025
Graph Coloring for Multi-Task Learning
Graph Coloring for Multi-Task Learning
Santosh Patapati
284
0
0
21 Sep 2025
Federated Learning with Ad-hoc Adapter Insertions: The Case of Soft-Embeddings for Training Classifier-as-Retriever
Federated Learning with Ad-hoc Adapter Insertions: The Case of Soft-Embeddings for Training Classifier-as-Retriever
Marijan Fofonjka
Shahryar Zehtabi
Alireza Behtash
Tyler Mauer
David Stout
FedML
145
0
0
20 Sep 2025
Generalization and Optimization of SGD with Lookahead
Generalization and Optimization of SGD with Lookahead
Kangcheng Li
Yunwen Lei
MLT
90
0
0
19 Sep 2025
Accelerated Gradient Methods with Biased Gradient Estimates: Risk Sensitivity, High-Probability Guarantees, and Large Deviation Bounds
Accelerated Gradient Methods with Biased Gradient Estimates: Risk Sensitivity, High-Probability Guarantees, and Large Deviation Bounds
Mert Gurbuzbalaban
Yasa Syed
Necdet Serhat Aybat
215
0
0
17 Sep 2025
MAPGD: Multi-Agent Prompt Gradient Descent for Collaborative Prompt Optimization
MAPGD: Multi-Agent Prompt Gradient Descent for Collaborative Prompt Optimization
Yichen Han
Bojun Liu
Zhengpeng Zhou
Zhengpeng Zhou
Zeng Zhang
...
Wenli Wang
Isaac Shi
Lewei He
Lewei He
Tianyu Shi
LLMAGAI4CE
206
1
0
14 Sep 2025
Convergence Rate in Nonlinear Two-Time-Scale Stochastic Approximation with State (Time)-Dependence
Convergence Rate in Nonlinear Two-Time-Scale Stochastic Approximation with State (Time)-DependenceAAAI Conference on Artificial Intelligence (AAAI), 2025
Zixi Chen
Yumin Xu
Ruixun Zhang
150
3
0
14 Sep 2025
Balancing Utility and Privacy: Dynamically Private SGD with Random Projection
Balancing Utility and Privacy: Dynamically Private SGD with Random Projection
Zhanhong Jiang
Md Zahid Hasan
Nastaran Saadati
Aditya Balu
Chao Liu
Soumik Sarkar
200
0
0
11 Sep 2025
Theoretical Analysis on how Learning Rate Warmup Accelerates Convergence
Theoretical Analysis on how Learning Rate Warmup Accelerates Convergence
Yuxing Liu
Yuze Ge
Rui Pan
An Kang
Tong Zhang
AI4CE
171
2
0
09 Sep 2025
BULL-ODE: Bullwhip Learning with Neural ODEs and Universal Differential Equations under Stochastic Demand
BULL-ODE: Bullwhip Learning with Neural ODEs and Universal Differential Equations under Stochastic Demand
Nachiket N. Naik
Prathamesh Dinesh Joshi
Raj Abhijit Dandekar
Rajat Dandekar
Sreedath Panat
84
0
0
09 Sep 2025
Distributed Deep Learning using Stochastic Gradient Staleness
Distributed Deep Learning using Stochastic Gradient Staleness
Viet Hoang Pham
Hyo-Sung Ahn
85
0
0
06 Sep 2025
Delayed Momentum Aggregation: Communication-efficient Byzantine-robust Federated Learning with Partial Participation
Delayed Momentum Aggregation: Communication-efficient Byzantine-robust Federated Learning with Partial Participation
Kaoru Otsuka
Yuki Takezawa
Makoto Yamada
FedML
151
1
0
03 Sep 2025
Insights from Gradient Dynamics: Gradient Autoscaled Normalization
Insights from Gradient Dynamics: Gradient Autoscaled Normalization
Vincent-Daniel Yun
226
0
0
03 Sep 2025
VASSO: Variance Suppression for Sharpness-Aware Minimization
Bingcong Li
Yilang Zhang
G. Giannakis
276
1
0
02 Sep 2025
Fast Convergence Rates for Subsampled Natural Gradient Algorithms on Quadratic Model Problems
Fast Convergence Rates for Subsampled Natural Gradient Algorithms on Quadratic Model Problems
Gil Goldshlager
Jiang Hu
Lin Lin
121
0
0
28 Aug 2025
Clustering-based Feature Representation Learning for Oracle Bone Inscriptions Detection
Clustering-based Feature Representation Learning for Oracle Bone Inscriptions Detection
Ye Tao
Xinran Fu
Honglin Pang
Xi Yang
Chuntao Li
87
3
0
26 Aug 2025
Learning with springs and sticks
Learning with springs and sticks
Luis Mantilla Calderón
Alán Aspuru-Guzik
155
0
0
26 Aug 2025
HierCVAE: Hierarchical Attention-Driven Conditional Variational Autoencoders for Multi-Scale Temporal Modeling
HierCVAE: Hierarchical Attention-Driven Conditional Variational Autoencoders for Multi-Scale Temporal Modeling
Yao Wu
BDL
58
0
0
26 Aug 2025
Stochastic Gradient Descent with Strategic Querying
Stochastic Gradient Descent with Strategic Querying
Nanfei Jiang
Hoi-To Wai
M. Alizadeh
127
0
0
23 Aug 2025
Escaping Saddle Points via Curvature-Calibrated Perturbations: A Complete Analysis with Explicit Constants and Empirical Validation
Escaping Saddle Points via Curvature-Calibrated Perturbations: A Complete Analysis with Explicit Constants and Empirical Validation
Faruk Alpay
Hamdi Alakkad
153
0
0
22 Aug 2025
1234...282930
Next
Page 1 of 30
Pageof 30