ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.15054
10
0

Muon Optimizes Under Spectral Norm Constraints

18 June 2025
Lizhang Chen
Jonathan Li
Qiang Liu
ArXiv (abs)PDFHTML
Main:25 Pages
7 Figures
Bibliography:4 Pages
2 Tables
Appendix:5 Pages
Abstract

The pursuit of faster optimization algorithms remains an active and important research direction in deep learning. Recently, the Muon optimizer [JJB+24] has demonstrated promising empirical performance, but its theoretical foundation remains less understood. In this paper, we bridge this gap and provide a theoretical analysis of Muon by placing it within the Lion-K\mathcal{K}K family of optimizers [CLLL24]. Specifically, we show that Muon corresponds to Lion-K\mathcal{K}K when equipped with the nuclear norm, and we leverage the theoretical results of Lion-K\mathcal{K}K to establish that Muon (with decoupled weight decay) implicitly solves an optimization problem that enforces a constraint on the spectral norm of weight matrices. This perspective not only demystifies the implicit regularization effects of Muon but also leads to natural generalizations through varying the choice of convex map K\mathcal{K}K, allowing for the exploration of a broader class of implicitly regularized and constrained optimization algorithms.

View on arXiv
@article{chen2025_2506.15054,
  title={ Muon Optimizes Under Spectral Norm Constraints },
  author={ Lizhang Chen and Jonathan Li and Qiang Liu },
  journal={arXiv preprint arXiv:2506.15054},
  year={ 2025 }
}
Comments on this paper