Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1606.04838
Cited By
v1
v2
v3 (latest)
Optimization Methods for Large-Scale Machine Learning
15 June 2016
Léon Bottou
Frank E. Curtis
J. Nocedal
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Optimization Methods for Large-Scale Machine Learning"
50 / 1,490 papers shown
Towards Exact Gradient-based Training on Analog In-memory Computing
Neural Information Processing Systems (NeurIPS), 2024
Zhaoxian Wu
Tayfun Gokmen
Malte J. Rasch
Tianyi Chen
294
6
0
18 Jun 2024
Generative vs. Discriminative modeling under the lens of uncertainty quantification
Elouan Argouarc'h
François Desbouvries
Eric Barat
Eiji Kawasaki
UQCV
216
1
0
13 Jun 2024
Loss Gradient Gaussian Width based Generalization and Optimization Guarantees
A. Banerjee
Qiaobo Li
Yingxue Zhou
442
0
0
11 Jun 2024
A Generalized Version of Chung's Lemma and its Applications
Li Jiang
Xiao Li
Andre Milzarek
Junwen Qiu
224
1
0
09 Jun 2024
Provable Complexity Improvement of AdaGrad over SGD: Upper and Lower Bounds in Stochastic Non-Convex Optimization
Annual Conference Computational Learning Theory (COLT), 2024
Devyani Maladkar
Ruichen Jiang
Aryan Mokhtari
421
6
0
07 Jun 2024
Efficient Data-Parallel Continual Learning with Asynchronous Distributed Rehearsal Buffers
Thomas Bouvier
Bogdan Nicolae
Hugo Chaugier
Alexandru Costan
Ian Foster
Gabriel Antoniu
198
2
0
05 Jun 2024
Demystifying SGD with Doubly Stochastic Gradients
Kyurae Kim
Joohwan Ko
Yian Ma
Jacob R. Gardner
401
2
0
03 Jun 2024
Privacy-Aware Randomized Quantization via Linear Programming
Zhongteng Cai
Xueru Zhang
Mohammad Mahdi Khalili
378
2
0
01 Jun 2024
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Shangding Gu
Laixi Shi
Yuhao Ding
Alois Knoll
C. Spanos
Adam Wierman
Ming Jin
OffRL
288
7
0
31 May 2024
Symmetries in Overparametrized Neural Networks: A Mean-Field View
Javier Maass
Joaquin Fontbona
MLT
FedML
528
4
0
30 May 2024
A Pontryagin Perspective on Reinforcement Learning
Onno Eberhard
Claire Vernade
Michael Muehlebach
383
4
0
28 May 2024
WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Louis Fournier
Adel Nabli
Masih Aminbeidokhti
M. Pedersoli
Eugene Belilovsky
Edouard Oyallon
MoMe
FedML
340
7
0
27 May 2024
Derivatives of Stochastic Gradient Descent
F. Iutzeler
Edouard Pauwels
Samuel Vaiter
195
1
0
24 May 2024
Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks
Felix Dangel
Johannes Müller
Marius Zeinhofer
ODL
357
18
0
24 May 2024
Exact Gauss-Newton Optimization for Training Deep Neural Networks
Mikalai Korbit
Adeyemi Damilare Adeoye
Alberto Bemporad
Mario Zanon
ODL
301
6
0
23 May 2024
Thermodynamic Natural Gradient Descent
Kaelan Donatella
Samuel Duffield
Maxwell Aifer
Denis Melanson
Gavin Crooks
Patrick J. Coles
139
4
0
22 May 2024
Almost sure convergence rates of stochastic gradient methods under gradient domination
Simon Weissmann
Sara Klein
Waïss Azizian
Leif Döring
350
6
0
22 May 2024
Energy-Efficient Federated Edge Learning with Streaming Data: A Lyapunov Optimization Approach
Chung-Hsuan Hu
Zheng Chen
Erik G. Larsson
207
8
0
20 May 2024
Reinforcement learning
Florentin Wörgötter
628
2,932
0
16 May 2024
Minimisation of Polyak-Łojasewicz Functions Using Random Zeroth-Order Oracles
European Control Conference (ECC), 2024
Amir Ali Farzin
Iman Shames
150
3
0
15 May 2024
Robust Semi-supervised Learning by Wisely Leveraging Open-set Data
Yang Yang
Nan Jiang
Yi Tian Xu
De-Chuan Zhan
300
27
0
11 May 2024
Optimal Baseline Corrections for Off-Policy Contextual Bandits
ACM Conference on Recommender Systems (RecSys), 2024
Shashank Gupta
Olivier Jeunen
Harrie Oosterhuis
Maarten de Rijke
290
11
0
09 May 2024
Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solving
International Conference on Machine Learning (ICML), 2024
S. Arisaka
Qianxiao Li
210
0
0
05 May 2024
A Full Adagrad algorithm with O(Nd) operations
Antoine Godichon-Baggioni
Wei Lu
Bruno Portier
ODL
291
0
0
03 May 2024
The Privacy Power of Correlated Noise in Decentralized Learning
Youssef Allouah
Anastasia Koloskova
Aymane El Firdoussi
Martin Jaggi
R. Guerraoui
274
18
0
02 May 2024
On the Relevance of Byzantine Robust Optimization Against Data Poisoning
Sadegh Farhadkhani
R. Guerraoui
Nirupam Gupta
Rafael Pinot
AAML
256
2
0
01 May 2024
IID Relaxation by Logical Expressivity: A Research Agenda for Fitting Logics to Neurosymbolic Requirements
M. Stol
Alessandra Mileo
207
1
0
30 Apr 2024
Advancing Supervised Learning with the Wave Loss Function: A Robust and Smooth Approach
M. Akhtar
Muhammad Tanveer
Mohd. Arshad
238
28
0
28 Apr 2024
Second-order Information Promotes Mini-Batch Robustness in Variance-Reduced Gradients
Sachin Garg
A. Berahas
Michal Dereziñski
232
2
0
23 Apr 2024
Rate Analysis of Coupled Distributed Stochastic Approximation for Misspecified Optimization
Yaqun Yang
Jinlong Lei
190
0
0
21 Apr 2024
FedMeS: Personalized Federated Continual Learning Leveraging Local Memory
Jingru Xie
Chenqi Zhu
Songze Li
FedML
CLL
262
1
0
19 Apr 2024
DACAD: Domain Adaptation Contrastive Learning for Anomaly Detection in Multivariate Time Series
Zahra Zamanzadeh Darban
Yiyuan Yang
Geoffrey I. Webb
Charu C. Aggarwal
Qingsong Wen
Xiaojun Jia
Mahsa Salehi
495
12
0
17 Apr 2024
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
Noah Lewis
J. L. Bez
Suren Byna
538
4
0
16 Apr 2024
Minimizing Chebyshev Prototype Risk Magically Mitigates the Perils of Overfitting
Nathaniel R. Dean
Dilip Sarkar
AAML
207
0
0
10 Apr 2024
Unifying Low Dimensional Observations in Deep Learning Through the Deep Linear Unconstrained Feature Model
Connall Garrod
Jonathan P. Keating
356
10
0
09 Apr 2024
Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Hao Ma
Melanie Zeilinger
Michael Muehlebach
233
3
0
08 Apr 2024
A Structure-Guided Gauss-Newton Method for Shallow ReLU Neural Network
Zhiqiang Cai
Tong Ding
Min Liu
Xinyu Liu
Jianlin Xia
869
2
0
07 Apr 2024
Optimal Batch Allocation for Wireless Federated Learning
IEEE Internet of Things Journal (IEEE IoT J.), 2024
Jaeyoung Song
Sang-Woon Jeon
211
1
0
03 Apr 2024
Satellite Federated Edge Learning: Architecture Design and Convergence Analysis
IEEE Transactions on Wireless Communications (IEEE TWC), 2024
Yuanming Shi
Li Zeng
Jingyang Zhu
Yong Zhou
Chunxiao Jiang
Khaled B. Letaief
246
25
0
02 Apr 2024
What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks
International Conference on Machine Learning (ICML), 2024
Xingwu Chen
Difan Zou
ViT
271
20
0
02 Apr 2024
DRIVE: Dual Gradient-Based Rapid Iterative Pruning
Dhananjay Saikumar
Blesson Varghese
215
3
0
01 Apr 2024
Convergence Guarantees for RMSProp and Adam in Generalized-smooth Non-convex Optimization with Affine Noise Variance
Qi Zhang
Yi Zhou
Shaofeng Zou
436
13
0
01 Apr 2024
Communication Efficient Distributed Training with Distributed Lion
Bo Liu
Lemeng Wu
Lizhang Chen
Kaizhao Liang
Jiaxu Zhu
Chen Liang
Raghuraman Krishnamoorthi
Qiang Liu
316
11
0
30 Mar 2024
HERTA: A High-Efficiency and Rigorous Training Algorithm for Unfolded Graph Neural Networks
Yongyi Yang
Jiaming Yang
Wei Hu
Michal Dereziñski
183
0
0
26 Mar 2024
Computational Approaches for Exponential-Family Factor Analysis
Liang Wang
Luis Carvalho
246
2
0
22 Mar 2024
AI and Memory Wall
A. Gholami
Z. Yao
Sehoon Kim
Coleman Hooper
Michael W. Mahoney
Kurt Keutzer
255
268
0
21 Mar 2024
PETScML: Second-order solvers for training regression problems in Scientific Machine Learning
Platform for Advanced Scientific Computing Conference (PASC), 2024
Stefano Zampini
Umberto Zerbinati
George Turkyyiah
David E. Keyes
225
6
0
18 Mar 2024
Nonsmooth Implicit Differentiation: Deterministic and Stochastic Convergence Rates
International Conference on Machine Learning (ICML), 2024
Riccardo Grazzi
Massimiliano Pontil
Saverio Salzo
344
3
0
18 Mar 2024
A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques
Xuetong Li
Yuan Gao
Hong Chang
Danyang Huang
Yingying Ma
...
Ke Xu
Jing Zhou
Xuening Zhu
Yingqiu Zhu
Hansheng Wang
204
17
0
17 Mar 2024
Streamlining in the Riemannian Realm: Efficient Riemannian Optimization with Loopless Variance Reduction
Yury Demidovich
Grigory Malinovsky
Peter Richtárik
243
3
0
11 Mar 2024
Previous
1
2
3
4
5
6
...
28
29
30
Next
Page 5 of 30
Page
of 30
Go