Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.19353
Cited By
A spring-block theory of feature learning in deep neural networks
28 July 2024
Chengzhi Shi
Liming Pan
Ivan Dokmanić
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A spring-block theory of feature learning in deep neural networks"
3 / 3 papers shown
Title
A Law of Next-Token Prediction in Large Language Models
Hangfeng He
Weijie J. Su
27
5
0
24 Aug 2024
Asymptotics of feature learning in two-layer networks after one gradient-step
Hugo Cui
Luca Pesce
Yatin Dandi
Florent Krzakala
Yue M. Lu
Lenka Zdeborová
Bruno Loureiro
MLT
44
16
0
07 Feb 2024
The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents
Yatin Dandi
Emanuele Troiani
Luca Arnaboldi
Luca Pesce
Lenka Zdeborová
Florent Krzakala
MLT
59
24
0
05 Feb 2024
1