Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.12033
Cited By
Momentum-SAM: Sharpness Aware Minimization without Computational Overhead
22 January 2024
Marlon Becker
Frederick Altrock
Benjamin Risse
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Momentum-SAM: Sharpness Aware Minimization without Computational Overhead"
9 / 9 papers shown
Title
SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation
Dahun Shin
Dongyeop Lee
Jinseok Chung
Namhoon Lee
ODL
AAML
90
0
0
25 Feb 2025
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Zhanpeng Zhou
Mingze Wang
Yuchen Mao
Bingrui Li
Junchi Yan
AAML
55
0
0
14 Oct 2024
Bilateral Sharpness-Aware Minimization for Flatter Minima
Jiaxin Deng
Junbiao Pang
Baochang Zhang
Qingming Huang
AAML
49
0
0
20 Sep 2024
Neighborhood and Global Perturbations Supported SAM in Federated Learning: From Local Tweaks To Global Awareness
Boyuan Li
Zihao Peng
Yafei Li
Mingliang Xu
Shengbo Chen
Baofeng Ji
Cong Shen
FedML
48
0
0
26 Aug 2024
Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization
Jiaxin Deng
Junbiao Pang
Baochang Zhang
64
1
0
12 Jun 2024
An Adaptive Policy to Employ Sharpness-Aware Minimization
Weisen Jiang
Hansi Yang
Yu Zhang
James T. Kwok
AAML
77
31
0
28 Apr 2023
Sharpness-Aware Minimization Improves Language Model Generalization
Dara Bahri
H. Mobahi
Yi Tay
116
82
0
16 Oct 2021
Efficient Sharpness-aware Minimization for Improved Training of Neural Networks
Jiawei Du
Hanshu Yan
Jiashi Feng
Joey Tianyi Zhou
Liangli Zhen
Rick Siow Mong Goh
Vincent Y. F. Tan
AAML
99
132
0
07 Oct 2021
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,696
0
15 Sep 2016
1