ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.06742
  4. Cited By
Gradient Multi-Normalization for Stateless and Scalable LLM Training

Gradient Multi-Normalization for Stateless and Scalable LLM Training

10 February 2025
M. Scetbon
Chao Ma
Wenbo Gong
Edward Meeds
ArXiv (abs)PDFHTML

Papers citing "Gradient Multi-Normalization for Stateless and Scalable LLM Training"

1 / 1 papers shown
Title
Deconstructing What Makes a Good Optimizer for Language Models
Deconstructing What Makes a Good Optimizer for Language Models
Rosie Zhao
Depen Morwani
David Brandfonbrener
Nikhil Vyas
Sham Kakade
188
28
0
10 Jul 2024
1