Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding

25 October 2023

Papers citing "Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding"

11 / 11 papers shown

Title
Deep Learning is Not So Mysterious or Different Andrew Gordon Wilson 36 1 0 03 Mar 2025
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking) Yoonsoo Nam Seok Hyeong Lee Clementine Domine Yea Chan Park Charles London Wonyl Choi Niclas Goring Seungjai Lee AI4CE 33 0 0 28 Feb 2025
Grokking Explained: A Statistical Phenomenon B. W. Carvalho Artur Garcez Luís C. Lamb Emílio Vital Brazil 59 0 0 03 Feb 2025
Grokking at the Edge of Linear Separability Alon Beck Noam Levi Yohai Bar-Sinai 24 0 0 06 Oct 2024
Why Do You Grok? A Theoretical Analysis of Grokking Modular Addition Mohamad Amin Mohamadi Zhiyuan Li Lei Wu Danica J. Sutherland 33 10 0 17 Jul 2024
On Regularization via Early Stopping for Least Squares Regression Rishi Sonthalia Jackie Lok E. Rebrova 20 2 0 06 Jun 2024
Phase Transitions in the Output Distribution of Large Language Models Julian Arnold Flemming Holtorf Frank Schafer Niels Lörch 34 1 0 27 May 2024
Towards Uncovering How Large Language Model Works: An Explainability Perspective Haiyan Zhao Fan Yang Bo Shen Himabindu Lakkaraju Mengnan Du 35 10 0 16 Feb 2024
Measuring Sharpness in Grokking Jack Miller Patrick Gleeson Charles OÑeill Thang Bui Noam Levi 19 1 0 14 Feb 2024
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking Kaifeng Lyu Jikai Jin Zhiyuan Li Simon S. Du Jason D. Lee Wei Hu AI4CE 22 32 0 30 Nov 2023
Grokking Beyond Neural Networks: An Empirical Exploration with Model Complexity Jack Miller Charles OÑeill Thang Bui 19 9 0 26 Oct 2023