Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization

Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization

SDM (SDM), 2019
    ODL

Papers citing "Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization"

30 / 30 papers shown
Title
On the Overlooked Structure of Stochastic Gradients
On the Overlooked Structure of Stochastic GradientsNeural Information Processing Systems (NeurIPS), 2022
168
7
0
05 Dec 2022
FIT: A Metric for Model Sensitivity
FIT: A Metric for Model SensitivityInternational Conference on Learning Representations (ICLR), 2022
131
9
0
16 Oct 2022
Large Scale Private Learning via Low-rank Reparametrization
Large Scale Private Learning via Low-rank ReparametrizationInternational Conference on Machine Learning (ICML), 2021
180
114
0
17 Jun 2021
Privately Learning Subspaces
Privately Learning SubspacesNeural Information Processing Systems (NeurIPS), 2021
257
22
0
28 May 2021
Empirically explaining SGD from a line search perspective
Empirically explaining SGD from a line search perspectiveInternational Conference on Artificial Neural Networks (ICANN), 2021
140
4
0
31 Mar 2021
Hessian Eigenspectra of More Realistic Nonlinear Models
Hessian Eigenspectra of More Realistic Nonlinear ModelsNeural Information Processing Systems (NeurIPS), 2021
169
37
0
02 Mar 2021
Gradient Descent on Neural Networks Typically Occurs at the Edge of
  Stability
Gradient Descent on Neural Networks Typically Occurs at the Edge of StabilityInternational Conference on Learning Representations (ICLR), 2021
253
325
0
26 Feb 2021
Provable Super-Convergence with a Large Cyclical Learning Rate
Provable Super-Convergence with a Large Cyclical Learning RateIEEE Signal Processing Letters (IEEE SPL), 2021
113
13
0
22 Feb 2021
Bypassing the Ambient Dimension: Private SGD with Gradient Subspace
  Identification
Bypassing the Ambient Dimension: Private SGD with Gradient Subspace IdentificationInternational Conference on Learning Representations (ICLR), 2020
143
116
0
07 Jul 2020
Dissecting Non-Vacuous Generalization Bounds based on the Mean-Field
  Approximation
Dissecting Non-Vacuous Generalization Bounds based on the Mean-Field ApproximationInternational Conference on Machine Learning (ICML), 2019
126
8
0
06 Sep 2019

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.