ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.16441
  4. Cited By
Grokking in Linear Estimators -- A Solvable Model that Groks without
  Understanding

Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding

25 October 2023
Noam Levi
Alon Beck
Yohai Bar-Sinai
ArXivPDFHTML

Papers citing "Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding"

11 / 11 papers shown
Title
Deep Learning is Not So Mysterious or Different
Andrew Gordon Wilson
36
1
0
03 Mar 2025
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
Yoonsoo Nam
Seok Hyeong Lee
Clementine Domine
Yea Chan Park
Charles London
Wonyl Choi
Niclas Goring
Seungjai Lee
AI4CE
33
0
0
28 Feb 2025
Grokking Explained: A Statistical Phenomenon
Grokking Explained: A Statistical Phenomenon
B. W. Carvalho
Artur Garcez
Luís C. Lamb
Emílio Vital Brazil
59
0
0
03 Feb 2025
Grokking at the Edge of Linear Separability
Grokking at the Edge of Linear Separability
Alon Beck
Noam Levi
Yohai Bar-Sinai
24
0
0
06 Oct 2024
Why Do You Grok? A Theoretical Analysis of Grokking Modular Addition
Why Do You Grok? A Theoretical Analysis of Grokking Modular Addition
Mohamad Amin Mohamadi
Zhiyuan Li
Lei Wu
Danica J. Sutherland
33
10
0
17 Jul 2024
On Regularization via Early Stopping for Least Squares Regression
On Regularization via Early Stopping for Least Squares Regression
Rishi Sonthalia
Jackie Lok
E. Rebrova
20
2
0
06 Jun 2024
Phase Transitions in the Output Distribution of Large Language Models
Phase Transitions in the Output Distribution of Large Language Models
Julian Arnold
Flemming Holtorf
Frank Schafer
Niels Lörch
34
1
0
27 May 2024
Towards Uncovering How Large Language Model Works: An Explainability
  Perspective
Towards Uncovering How Large Language Model Works: An Explainability Perspective
Haiyan Zhao
Fan Yang
Bo Shen
Himabindu Lakkaraju
Mengnan Du
35
10
0
16 Feb 2024
Measuring Sharpness in Grokking
Measuring Sharpness in Grokking
Jack Miller
Patrick Gleeson
Charles OÑeill
Thang Bui
Noam Levi
19
1
0
14 Feb 2024
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce
  Grokking
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking
Kaifeng Lyu
Jikai Jin
Zhiyuan Li
Simon S. Du
Jason D. Lee
Wei Hu
AI4CE
22
32
0
30 Nov 2023
Grokking Beyond Neural Networks: An Empirical Exploration with Model
  Complexity
Grokking Beyond Neural Networks: An Empirical Exploration with Model Complexity
Jack Miller
Charles OÑeill
Thang Bui
19
9
0
26 Oct 2023
1