Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.26030
Cited By
v1
v2 (latest)
Muon Outperforms Adam in Tail-End Associative Memory Learning
30 September 2025
Shuche Wang
Fengzhuo Zhang
Jiaxiang Li
Cunxiao Du
C. Du
Tianyu Pang
Zhuoran Yang
Mingyi Hong
Vincent Y. F. Tan
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (17 upvotes)
Papers citing
"Muon Outperforms Adam in Tail-End Associative Memory Learning"
2 / 2 papers shown
Isotropic Curvature Model for Understanding Deep Learning Optimization: Is Gradient Orthogonalization Optimal?
Weijie Su
145
0
0
01 Nov 2025
Optimal Scaling Needs Optimal Norm
Oleg Filatov
Jiangtao Wang
J. Ebert
Stefan Kesselheim
166
2
0
04 Oct 2025
1