ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.05052
  4. Cited By
Initialization of Large Language Models via Reparameterization to
  Mitigate Loss Spikes

Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes

7 October 2024
Kosuke Nishida
Kyosuke Nishida
Kuniko Saito
ArXivPDFHTML

Papers citing "Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes"

Title
No papers