Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.11873
Cited By
A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks
21 March 2023
William Merrill
Nikolaos Tsilivis
Aman Shukla
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks"
7 / 7 papers shown
Title
Distributional Scaling Laws for Emergent Capabilities
Rosie Zhao
Tian Qin
David Alvarez-Melis
Sham Kakade
Naomi Saphra
LRM
37
0
0
24 Feb 2025
Information-Theoretic Progress Measures reveal Grokking is an Emergent Phase Transition
Kenzo Clauw
S. Stramaglia
Daniele Marinazzo
45
3
0
16 Aug 2024
Complexity Matters: Dynamics of Feature Learning in the Presence of Spurious Correlations
GuanWen Qiu
Da Kuang
Surbhi Goel
25
8
0
05 Mar 2024
Grokking as Compression: A Nonlinear Complexity Perspective
Ziming Liu
Ziqian Zhong
Max Tegmark
25
9
0
09 Oct 2023
Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Eran Malach
Cyril Zhang
43
7
0
07 Sep 2023
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
28
324
0
29 May 2023
Omnigrok: Grokking Beyond Algorithmic Data
Ziming Liu
Eric J. Michaud
Max Tegmark
54
76
0
03 Oct 2022
1