Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.01115
Cited By
v1
v2 (latest)
Attention Retrieves, MLP Memorizes: Disentangling Trainable Components in the Transformer
1 June 2025
Yihe Dong
Lorenzo Noci
Mikhail Khodak
Mufan Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Retrieves, MLP Memorizes: Disentangling Trainable Components in the Transformer"
Title
No papers