Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.01913
Cited By
v1
v2
v3 (latest)
Generalized Gradient Norm Clipping & Non-Euclidean
(
L
0
,
L
1
)
(L_0,L_1)
(
L
0
,
L
1
)
-Smoothness
2 June 2025
Thomas Pethick
Wanyun Xie
Mete Erdogan
Kimon Antonakopoulos
Tony Silveti-Falls
Volkan Cevher
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3★)
Papers citing
"Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$-Smoothness"
5 / 5 papers shown
Beyond the Ideal: Analyzing the Inexact Muon Update
Egor Shulgin
Sultan AlRashed
Francesco Orabona
Peter Richtárik
170
8
0
22 Oct 2025
Robust Layerwise Scaling Rules by Proper Weight Decay Tuning
Zhiyuan Fan
Yifeng Liu
Qingyue Zhao
Angela Yuan
Quanquan Gu
149
3
0
17 Oct 2025
Adaptive Conditional Gradient Descent
Abbas Khademi
Antonio Silveti-Falls
122
3
0
13 Oct 2025
Optimal Scaling Needs Optimal Norm
Oleg Filatov
Jiangtao Wang
J. Ebert
Stefan Kesselheim
241
3
0
04 Oct 2025
LiMuon: Light and Fast Muon Optimizer for Large Models
Feihu Huang
Yuning Luo
Songcan Chen
263
10
0
18 Sep 2025
1
Page 1 of 1