Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
2502.06742
Cited By
Gradient Multi-Normalization for Stateless and Scalable LLM Training
10 February 2025
M. Scetbon
Chao Ma
Wenbo Gong
Edward Meeds
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gradient Multi-Normalization for Stateless and Scalable LLM Training"
1 / 1 papers shown
Title
Deconstructing What Makes a Good Optimizer for Language Models
Rosie Zhao
Depen Morwani
David Brandfonbrener
Nikhil Vyas
Sham Kakade
188
28
0
10 Jul 2024
1