ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.14120
  4. Cited By
Weight fluctuations in (deep) linear neural networks and a derivation of
  the inverse-variance flatness relation

Weight fluctuations in (deep) linear neural networks and a derivation of the inverse-variance flatness relation

23 November 2023
Markus Gross
A. Raulf
Christoph Räth
ArXivPDFHTML

Papers citing "Weight fluctuations in (deep) linear neural networks and a derivation of the inverse-variance flatness relation"

4 / 4 papers shown
Title
Double Descent Demystified: Identifying, Interpreting & Ablating the
  Sources of a Deep Learning Puzzle
Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle
Rylan Schaeffer
Mikail Khona
Zachary Robertson
Akhilan Boopathy
Kateryna Pistunova
J. Rocks
Ila Rani Fiete
Oluwasanmi Koyejo
62
30
0
24 Mar 2023
What Happens after SGD Reaches Zero Loss? --A Mathematical Framework
What Happens after SGD Reaches Zero Loss? --A Mathematical Framework
Zhiyuan Li
Tianhao Wang
Sanjeev Arora
MLT
83
98
0
13 Oct 2021
Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Yixin Wu
Rui Luo
Chen Zhang
Jun Wang
Yaodong Yang
43
7
0
20 Sep 2021
Cleaning large correlation matrices: tools from random matrix theory
Cleaning large correlation matrices: tools from random matrix theory
J. Bun
J. Bouchaud
M. Potters
27
262
0
25 Oct 2016
1