Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.03830
Cited By
Stochastic Mirror Descent on Overparameterized Nonlinear Models: Convergence, Implicit Regularization, and Generalization
10 June 2019
Navid Azizan
Sahin Lale
B. Hassibi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stochastic Mirror Descent on Overparameterized Nonlinear Models: Convergence, Implicit Regularization, and Generalization"
15 / 15 papers shown
Title
Primal-dual algorithm for contextual stochastic combinatorial optimization
Louis Bouvier
Thibault Prunet
Vincent Leclère
Axel Parmentier
37
0
0
07 May 2025
Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations
Yize Zhao
Tina Behnia
V. Vakilian
Christos Thrampoulidis
70
9
0
20 Feb 2025
Peer-to-Peer Learning Dynamics of Wide Neural Networks
Shreyas Chaudhari
Srinivasa Pranav
Emile Anand
José M. F. Moura
42
3
0
23 Sep 2024
SketchOGD: Memory-Efficient Continual Learning
Benjamin Wright
Youngjae Min
Jeremy Bernstein
Navid Azizan
CLL
28
0
0
25 May 2023
Mirror descent of Hopfield model
Hyungjoon Soh
D. Kim
Juno Hwang
Junghyo Jo
25
0
0
29 Nov 2022
Stochastic Mirror Descent in Average Ensemble Models
Taylan Kargin
Fariborz Salehi
B. Hassibi
26
1
0
27 Oct 2022
Uncertainty-Aware Meta-Learning for Multimodal Task Distributions
Cesar Almecija
Apoorva Sharma
Navid Azizan
OOD
UQCV
26
3
0
04 Oct 2022
SYNTHESIS: A Semi-Asynchronous Path-Integrated Stochastic Gradient Method for Distributed Learning in Computing Clusters
Zhuqing Liu
Xin Zhang
Jia-Wei Liu
38
1
0
17 Aug 2022
Control-oriented meta-learning
Spencer M. Richards
Navid Azizan
Jean-Jacques E. Slotine
Marco Pavone
37
24
0
14 Apr 2022
Stochastic Mirror Descent: Convergence Analysis and Adaptive Variants via the Mirror Stochastic Polyak Stepsize
Ryan DÓrazio
Nicolas Loizou
I. Laradji
Ioannis Mitliagkas
34
30
0
28 Oct 2021
Adaptive-Control-Oriented Meta-Learning for Nonlinear Systems
Spencer M. Richards
Navid Azizan
Jean-Jacques E. Slotine
Marco Pavone
37
70
0
07 Mar 2021
Sketching Curvature for Efficient Out-of-Distribution Detection for Deep Neural Networks
Apoorva Sharma
Navid Azizan
Marco Pavone
UQCV
33
45
0
24 Feb 2021
When Does Preconditioning Help or Hurt Generalization?
S. Amari
Jimmy Ba
Roger C. Grosse
Xuechen Li
Atsushi Nitanda
Taiji Suzuki
Denny Wu
Ji Xu
36
32
0
18 Jun 2020
On the distance between two neural networks and the stability of learning
Jeremy Bernstein
Arash Vahdat
Yisong Yue
Xuan Li
ODL
200
57
0
09 Feb 2020
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,748
0
26 Sep 2016
1