Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.22014
Cited By
v1
v2 (latest)
Learning in Compact Spaces with Approximately Normalized Transformer
28 May 2025
Jörg Franke
Urs Spiegelhalter
Marianna Nezhurina
J. Jitsev
Katharina Eggensperger
Michael Hefenbrock
Re-assign community
ArXiv (abs)
PDF
HTML
Github (193★)
Papers citing
"Learning in Compact Spaces with Approximately Normalized Transformer"
4 / 4 papers shown
Towards Scaling Laws for Symbolic Regression
David Otte
Jörg Franke
Frank Hutter
131
0
0
30 Oct 2025
Transformers without Normalization
Computer Vision and Pattern Recognition (CVPR), 2025
Jiachen Zhu
Xinlei Chen
Kaiming He
Yann LeCun
Zhuang Liu
OffRL
ViT
491
92
0
13 Mar 2025
The Curse of Depth in Large Language Models
Wenfang Sun
Xinyuan Song
Pengxiang Li
Lu Yin
Yefeng Zheng
Shiwei Liu
415
22
0
09 Feb 2025
Resolving Discrepancies in Compute-Optimal Scaling of Language Models
Tomer Porian
Mitchell Wortsman
J. Jitsev
Ludwig Schmidt
Y. Carmon
471
53
0
27 Jun 2024
1
Page 1 of 1