Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2102.11600
Cited By
v1
v2
v3 (latest)
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
International Conference on Machine Learning (ICML), 2021
23 February 2021
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks"
50 / 224 papers shown
Title
Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization
Youngsik Yun
Dongjun Gu
Youngjung Uh
120
0
0
22 Nov 2025
A Unified Stability Analysis of SAM vs SGD: Role of Data Coherence and Emergence of Simplicity Bias
Wei-Kai Chang
Rajiv Khanna
MLT
136
0
0
21 Nov 2025
Flat Minima and Generalization: Insights from Stochastic Convex Optimization
Matan Schliserman
Shira Vansover-Hager
Tomer Koren
76
0
0
05 Nov 2025
DP-FedPGN: Finding Global Flat Minima for Differentially Private Federated Learning via Penalizing Gradient Norm
Junkang Liu
Yuxuan Tian
Fanhua Shang
Yuanyuan Liu
Hongying Liu
Junchao Zhou
Daorui Ding
FedML
237
2
0
31 Oct 2025
Modality-Aware SAM: Sharpness-Aware-Minimization Driven Gradient Modulation for Harmonized Multimodal Learning
Hossein R. Nowdeh
Jie Ji
Xiaolong Ma
Fatemeh Afghah
112
0
0
28 Oct 2025
Position: Many generalization measures for deep learning are fragile
Shuofeng Zhang
A. Louis
AAML
218
0
0
21 Oct 2025
SAMOSA: Sharpness Aware Minimization for Open Set Active learning
Young In Kim
Andrea Agiollo
Rajiv Khanna
201
0
0
19 Oct 2025
Zeroth-Order Sharpness-Aware Learning with Exponential Tilting
Xuchen Gong
Tian Li
132
0
0
17 Oct 2025
When Flatness Does (Not) Guarantee Adversarial Robustness
Nils Philipp Walter
Linara Adilova
Jilles Vreeken
Michael Kamp
100
1
0
16 Oct 2025
AppForge: From Assistant to Independent Developer - Are GPTs Ready for Software Development?
Dezhi Ran
Yuan Cao
Mengzhou Wu
Simin Chen
Yuzhe Guo
...
Jialei Wei
Linyi Li
Wei Yang
Baishakhi Ray
Tao Xie
LLMAG
ALM
ELM
100
0
0
09 Oct 2025
Adjusting Initial Noise to Mitigate Memorization in Text-to-Image Diffusion Models
Hyeonggeun Han
Sehwan Kim
Hyungjun Joo
Sangwoo Hong
Jungwoo Lee
DiffM
157
1
0
08 Oct 2025
Adaptively Sampling-Reusing-Mixing Decomposed Gradients to Speed Up Sharpness Aware Minimization
Jiaxin Deng
Junbiao Pang
140
0
0
04 Oct 2025
Flatness-Aware Stochastic Gradient Langevin Dynamics
Stefano Bruno
Youngsik Hwang
Jaehyeon An
Sotirios Sabanis
Dong-Young Lim
144
0
0
02 Oct 2025
Beyond Magic Words: Sharpness-Aware Prompt Evolving for Robust Large Language Models with TARE
Guancheng Wan
Lucheng Fu
Haoxin Liu
Yiqiao Jin
Hui Yi Leong
...
Yunpu Ma
Xiangru Tang
B. A. Prakash
Yizhou Sun
Wei Wang
KELM
103
0
0
28 Sep 2025
Sharpness-Aware Minimization Can Hallucinate Minimizers
Chanwoong Park
Uijeong Jang
Ernest K. Ryu
Insoon Yang
103
0
0
26 Sep 2025
Development of Deep Learning Optimizers: Approaches, Concepts, and Update Rules
Doğay Altınel
116
0
0
22 Sep 2025
Adapt in the Wild: Test-Time Entropy Minimization with Sharpness and Feature Regularization
Shuaicheng Niu
Guohao Chen
Deyu Chen
Yifan Zhang
Jiaxiang Wu
Z. Wen
Yaofo Chen
P. Zhao
Chunyan Miao
Zhuliang Yu
AAML
160
1
0
05 Sep 2025
LSAM: Asynchronous Distributed Training with Landscape-Smoothed Sharpness-Aware Minimization
Yunfei Teng
Sixin Zhang
117
0
0
03 Sep 2025
VASSO: Variance Suppression for Sharpness-Aware Minimization
Bingcong Li
Yilang Zhang
G. Giannakis
232
1
0
02 Sep 2025
Adaptive Heavy-Tailed Stochastic Gradient Descent
Bodu Gong
Gustavo Enrique Batista
Pierre Lafaye de Micheaux
112
0
0
29 Aug 2025
Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
Yuhang Liu
Tao Li
Zhehao Huang
Zuopeng Yang
Xiaolin Huang
76
0
0
27 Aug 2025
Flatness-aware Curriculum Learning via Adversarial Difficulty
Hiroaki Aizawa
Yoshikazu Hayashi
ODL
212
0
0
26 Aug 2025
Curvature Learning for Generalization of Hyperbolic Neural Networks
International Journal of Computer Vision (IJCV), 2025
Xiaomeng Fan
Yuwei Wu
Zhi Gao
Mehrtash Harandi
Yunde Jia
216
2
0
24 Aug 2025
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Tianxiao Cao
Kyohei Atarashi
H. Kashima
198
0
0
14 Aug 2025
Domain-Generalization to Improve Learning in Meta-Learning Algorithms
Usman Anjum
Chris Stockman
Cat Luong
J. Zhan
FedML
170
0
0
13 Aug 2025
Tractable Sharpness-Aware Learning of Probabilistic Circuits
Hrithik Suresh
Sahil Sidheekh
Vishnu Shreeram M.P
S. Natarajan
N. C. Krishnan
TPM
156
0
0
07 Aug 2025
Efficiently Seeking Flat Minima for Better Generalization in Fine-Tuning Large Language Models and Beyond
Jiaxin Deng
Qingcheng Zhu
Junbiao Pang
Linlin Yang
Zhongqian Fu
Baochang Zhang
125
0
0
01 Aug 2025
Communication-Efficient Distributed Training for Collaborative Flat Optima Recovery in Deep Learning
Tolga Dimlioglu
A. Choromańska
FedML
242
1
0
27 Jul 2025
Large Learning Rates Simultaneously Achieve Robustness to Spurious Correlations and Compressibility
Melih Barsbey
Lucas Prieto
Stefanos Zafeiriou
Tolga Birdal
256
0
0
23 Jul 2025
Pre-Training LLMs on a budget: A comparison of three optimizers
Joel Schlotthauer
Christian Kroos
Chris Hinze
Viktor Hangya
Luzian Hahn
Fabian Küch
157
0
0
11 Jul 2025
DGSAM: Domain Generalization via Individual Sharpness-Aware Minimization
Youngjun Song
Youngsik Hwang
Jonghun Lee
Heechang Lee
Dong-Young Lim
AAML
271
0
0
01 Jul 2025
LightSAM: Parameter-Agnostic Sharpness-Aware Minimization
Yifei Cheng
Li Shen
Hao Sun
Nan Yin
Xiaochun Cao
Enhong Chen
AAML
210
0
0
30 May 2025
RoGA: Towards Generalizable Deepfake Detection through Robust Gradient Alignment
Lingyu Qiu
Ke Jiang
Xiaoyang Tan
302
0
0
27 May 2025
Optimization-Inspired Few-Shot Adaptation for Large Language Models
Boyan Gao
Xin Wang
Jianlong Wu
David A. Clifton
252
0
0
25 May 2025
Mitigating Fine-tuning Risks in LLMs via Safety-Aware Probing Optimization
Chengcan Wu
Zhixin Zhang
Zeming Wei
Yihao Zhang
Meng Sun
AAML
216
8
0
22 May 2025
Sharpness-Aware Minimization with Z-Score Gradient Filtering
Juyoung Yun
573
0
0
05 May 2025
A Model Zoo on Phase Transitions in Neural Networks
Konstantin Schurholt
Léo Meynent
Yefan Zhou
Haiquan Lu
Yaoqing Yang
Damian Borth
330
2
0
25 Apr 2025
Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning
International Conference on Learning Representations (ICLR), 2025
Yeoreum Lee
Jinwook Jung
Sungyong Baik
MoMe
348
6
0
20 Apr 2025
Layer-wise Adaptive Gradient Norm Penalizing Method for Efficient and Accurate Deep Learning
Knowledge Discovery and Data Mining (KDD), 2024
Sunwoo Lee
289
2
0
18 Mar 2025
Understanding Flatness in Generative Models: Its Role and Benefits
Taehwan Lee
Kyeongkook Seo
Jaejun Yoo
Sung Whan Yoon
DiffM
304
1
0
14 Mar 2025
Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance
Computer Vision and Pattern Recognition (CVPR), 2025
Sanchayan Santra
Vishal M. Chudasama
Pankaj Wasnik
Vineeth N. Balasubramanian
AI4TS
348
4
0
28 Feb 2025
Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Yilang Zhang
Bingcong Li
G. Giannakis
AAML
183
0
0
11 Jan 2025
KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Shu Zhao
Tan Yu
Xiaoshuai Hao
Wenchao Ma
Vijaykrishnan Narayanan
192
3
0
27 Dec 2024
Can Stability be Detrimental? Better Generalization through Gradient Descent Instabilities
Lawrence Wang
Stephen J. Roberts
233
0
0
23 Dec 2024
Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jinping Zou
Xiaoge Deng
Tao Sun
288
1
0
22 Dec 2024
SSE-SAM: Balancing Head and Tail Classes Gradually through Stage-Wise SAM
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xingyu Lyu
Qianqian Xu
Zhiyong Yang
Shaojie Lyu
Qingming Huang
432
1
0
18 Dec 2024
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Computer Vision and Pattern Recognition (CVPR), 2024
Aodi Li
Liansheng Zhuang
Xiao Long
Minghong Yao
Shafei Wang
1.1K
5
0
18 Dec 2024
Meta Curvature-Aware Minimization for Domain Generalization
Zhaoyu Chen
Yiwen Ye
Feilong Tang
Yongsheng Pan
Yong-quan Xia
BDL
907
1
0
16 Dec 2024
Towards Understanding the Role of Sharpness-Aware Minimization Algorithms for Out-of-Distribution Generalization
Samuel Schapiro
Han Zhao
333
1
0
06 Dec 2024
Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization
Ruotong Wang
Mingli Zhu
Zihao Zhu
Baoyuan Wu
AAML
342
4
0
18 Nov 2024
1
2
3
4
5
Next