Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.00181
Cited By
On the Interaction of Noise, Compression Role, and Adaptivity under
(
L
0
,
L
1
)
(L_0, L_1)
(
L
0
,
L
1
)
-Smoothness: An SDE-based Approach
30 May 2025
Enea Monzio Compagnoni
Rustem Islamov
Antonio Orvieto
Eduard A. Gorbunov
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the Interaction of Noise, Compression Role, and Adaptivity under $(L_0, L_1)$-Smoothness: An SDE-based Approach"
1 / 1 papers shown
Title
Is your batch size the problem? Revisiting the Adam-SGD gap in language modeling
Teodora Srećković
Jonas Geiping
Antonio Orvieto
MoE
11
0
0
14 Jun 2025
1