Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.06842
Cited By
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
12 January 2025
Tianjin Huang
Ziquan Zhu
Gaojie Jin
Lu Liu
Zhangyang Wang
Shiwei Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training"
1 / 1 papers shown
Title
Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam
Tianjin Huang
Haotian Hu
Zhenyu (Allen) Zhang
Gaojie Jin
X. Li
...
Tianlong Chen
Lu Liu
Qingsong Wen
Zhangyang Wang
Shiwei Liu
MQ
33
0
0
24 Feb 2025
1