ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.22014
  4. Cited By
Learning in Compact Spaces with Approximately Normalized Transformer
v1v2 (latest)

Learning in Compact Spaces with Approximately Normalized Transformer

28 May 2025
Jörg Franke
Urs Spiegelhalter
Marianna Nezhurina
J. Jitsev
Katharina Eggensperger
Michael Hefenbrock
ArXiv (abs)PDFHTMLGithub (193★)

Papers citing "Learning in Compact Spaces with Approximately Normalized Transformer"

4 / 4 papers shown
Towards Scaling Laws for Symbolic Regression
Towards Scaling Laws for Symbolic Regression
David Otte
Jörg Franke
Frank Hutter
131
0
0
30 Oct 2025
Transformers without Normalization
Transformers without NormalizationComputer Vision and Pattern Recognition (CVPR), 2025
Jiachen Zhu
Xinlei Chen
Kaiming He
Yann LeCun
Zhuang Liu
OffRLViT
491
92
0
13 Mar 2025
The Curse of Depth in Large Language Models
The Curse of Depth in Large Language Models
Wenfang Sun
Xinyuan Song
Pengxiang Li
Lu Yin
Yefeng Zheng
Shiwei Liu
415
22
0
09 Feb 2025
Resolving Discrepancies in Compute-Optimal Scaling of Language Models
Resolving Discrepancies in Compute-Optimal Scaling of Language Models
Tomer Porian
Mitchell Wortsman
J. Jitsev
Ludwig Schmidt
Y. Carmon
471
53
0
27 Jun 2024
1
Page 1 of 1