Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.14120
Cited By
Weight fluctuations in (deep) linear neural networks and a derivation of the inverse-variance flatness relation
23 November 2023
Markus Gross
A. Raulf
Christoph Räth
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Weight fluctuations in (deep) linear neural networks and a derivation of the inverse-variance flatness relation"
4 / 4 papers shown
Title
Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle
Rylan Schaeffer
Mikail Khona
Zachary Robertson
Akhilan Boopathy
Kateryna Pistunova
J. Rocks
Ila Rani Fiete
Oluwasanmi Koyejo
62
30
0
24 Mar 2023
What Happens after SGD Reaches Zero Loss? --A Mathematical Framework
Zhiyuan Li
Tianhao Wang
Sanjeev Arora
MLT
83
98
0
13 Oct 2021
Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Yixin Wu
Rui Luo
Chen Zhang
Jun Wang
Yaodong Yang
43
7
0
20 Sep 2021
Cleaning large correlation matrices: tools from random matrix theory
J. Bun
J. Bouchaud
M. Potters
27
262
0
25 Oct 2016
1