Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.05052
Cited By
Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes
7 October 2024
Kosuke Nishida
Kyosuke Nishida
Kuniko Saito
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes"
Title
No papers