Grokking Explained: A Statistical Phenomenon

3 February 2025

ArXiv (abs)PDF HTML Github

Papers citing "Grokking Explained: A Statistical Phenomenon"

10 / 10 papers shown

Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding

Noam Levi

Alon Beck

Yohai Bar-Sinai

189

25 Oct 2023

Grokking as the Transition from Lazy to Rich Training DynamicsInternational Conference on Learning Representations (ICLR), 2023

413

09 Oct 2023

Are Emergent Abilities of Large Language Models a Mirage?Neural Information Processing Systems (NeurIPS), 2023

519

610

28 Apr 2023

Progress measures for grokking via mechanistic interpretabilityInternational Conference on Learning Representations (ICLR), 2023

643

728

12 Jan 2023

Grokking phase transitions in learning local rules with gradient descentJournal of machine learning research (JMLR), 2022

Bojan Žunkovič

E. Ilievski

346

26 Oct 2022

Towards Understanding Grokking: An Effective Theory of Representation LearningNeural Information Processing Systems (NeurIPS), 2022

392

230

20 May 2022

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

479

560

06 Jan 2022

Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges

Joan Bruna

1.2K

1,489

27 Apr 2021

Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift

Stephan Rabanser

Stephan Günnemann

Zachary Chase Lipton

397

434

29 Oct 2018

Microsoft COCO: Common Objects in ContextEuropean Conference on Computer Vision (ECCV), 2014

Piotr Dollár

27.0K

51,414

01 May 2014