ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.11600
  4. Cited By
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning
  of Deep Neural Networks
v1v2v3 (latest)

ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks

International Conference on Machine Learning (ICML), 2021
23 February 2021
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
ArXiv (abs)PDFHTML

Papers citing "ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks"

50 / 224 papers shown
Is Aggregation the Only Choice? Federated Learning via Layer-wise Model
  Recombination
Is Aggregation the Only Choice? Federated Learning via Layer-wise Model RecombinationKnowledge Discovery and Data Mining (KDD), 2023
Ming Hu
Zhihao Yue
Zhiwei Ling
Cheng Chen
Yihao Huang
Xian Wei
Xiang Lian
Yang Liu
Xiao He
FedML
219
25
0
18 May 2023
Sharpness & Shift-Aware Self-Supervised Learning
Sharpness & Shift-Aware Self-Supervised Learning
Ngoc N. Tran
S. Duong
Hoang Phan
Tung Pham
Dinh Q. Phung
Trung Le
SSL
192
1
0
17 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task
  Adaptation
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
216
1
0
08 May 2023
Venn Diagram Multi-label Class Interpretation of Diabetic Foot Ulcer with Color and Sharpness Enhancement
M. Hasan
Moi Hoon Yap
M. Hasan
176
2
0
01 May 2023
Towards the Flatter Landscape and Better Generalization in Federated
  Learning under Client-level Differential Privacy
Towards the Flatter Landscape and Better Generalization in Federated Learning under Client-level Differential PrivacyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yi Shi
Kang Wei
Li Shen
Yingqi Liu
Xueqian Wang
Bo Yuan
Dacheng Tao
FedML
265
5
0
01 May 2023
An Adaptive Policy to Employ Sharpness-Aware Minimization
An Adaptive Policy to Employ Sharpness-Aware MinimizationInternational Conference on Learning Representations (ICLR), 2023
Weisen Jiang
Hansi Yang
Yu Zhang
James T. Kwok
AAML
271
43
0
28 Apr 2023
Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware
  Minimization
Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware MinimizationIEEE International Conference on Computer Vision (ICCV), 2023
Mingli Zhu
Shaokui Wei
Li Shen
Yanbo Fan
Baoyuan Wu
AAML
209
78
0
24 Apr 2023
Going Further: Flatness at the Rescue of Early Stopping for Adversarial
  Example Transferability
Going Further: Flatness at the Rescue of Early Stopping for Adversarial Example Transferability
Martin Gubri
Maxime Cordy
Yves Le Traon
AAML
237
3
1
05 Apr 2023
Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation
Sample4Geo: Hard Negative Sampling For Cross-View Geo-LocalisationIEEE International Conference on Computer Vision (ICCV), 2023
Fabian Deuser
Konrad Habel
Norbert Oswald
266
124
0
21 Mar 2023
Make Landscape Flatter in Differentially Private Federated Learning
Make Landscape Flatter in Differentially Private Federated LearningComputer Vision and Pattern Recognition (CVPR), 2023
Yi Shi
Yingqi Liu
Kang Wei
Li Shen
Xueqian Wang
Dacheng Tao
FedML
213
88
0
20 Mar 2023
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
Rethinking Model Ensemble in Transfer-based Adversarial AttacksInternational Conference on Learning Representations (ICLR), 2023
Huanran Chen
Yichi Zhang
Yinpeng Dong
Xiao Yang
Hang Su
Junyi Zhu
AAML
366
96
0
16 Mar 2023
Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves
  Generalization
Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves GeneralizationComputer Vision and Pattern Recognition (CVPR), 2023
Xingxuan Zhang
Renzhe Xu
Han Yu
Hao Zou
Peng Cui
326
65
0
03 Mar 2023
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning
  Rate and Momentum for Training Deep Neural Networks
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural NetworksNeural Networks (Neural Netw.), 2023
Hao Sun
Li Shen
Qihuang Zhong
Liang Ding
Shi-Yong Chen
Jingwei Sun
Jing Li
Guangzhong Sun
Dacheng Tao
163
41
0
01 Mar 2023
Towards Stable Test-Time Adaptation in Dynamic Wild World
Towards Stable Test-Time Adaptation in Dynamic Wild WorldInternational Conference on Learning Representations (ICLR), 2023
Shuaicheng Niu
Jiaxiang Wu
Yifan Zhang
Z. Wen
Yaofo Chen
P. Zhao
Zhuliang Yu
TTA
398
399
0
24 Feb 2023
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization
Kayhan Behdin
Qingquan Song
Aman Gupta
S. Keerthi
Ayan Acharya
Borja Ocejo
Gregory Dexter
Rajiv Khanna
D. Durfee
Rahul Mazumder
AAML
303
12
0
19 Feb 2023
Improving Differentiable Architecture Search via Self-Distillation
Improving Differentiable Architecture Search via Self-DistillationNeural Networks (Neural Netw.), 2023
Xunyu Zhu
Jian Li
Yong Liu
Weiping Wang
295
10
0
11 Feb 2023
Flat Seeking Bayesian Neural Networks
Flat Seeking Bayesian Neural NetworksNeural Information Processing Systems (NeurIPS), 2023
Van-Anh Nguyen
L. Vuong
Hoang Phan
Thanh-Toan Do
Dinh Q. Phung
Trung Le
BDL
480
10
0
06 Feb 2023
Exploring the Effect of Multi-step Ascent in Sharpness-Aware
  Minimization
Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Woojin Lee
Jaewook Lee
152
10
0
27 Jan 2023
An SDE for Modeling SAM: Theory and Insights
An SDE for Modeling SAM: Theory and InsightsInternational Conference on Machine Learning (ICML), 2023
Enea Monzio Compagnoni
Luca Biggio
Antonio Orvieto
F. Proske
Hans Kersting
Aurelien Lucchi
292
22
0
19 Jan 2023
Stability Analysis of Sharpness-Aware Minimization
Stability Analysis of Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Jaewook Lee
170
15
0
16 Jan 2023
GoogLe2Net: Going Transverse with Convolutions
GoogLe2Net: Going Transverse with Convolutions
Yuanpeng He
207
2
0
01 Jan 2023
A Generalization of ViT/MLP-Mixer to Graphs
A Generalization of ViT/MLP-Mixer to GraphsInternational Conference on Machine Learning (ICML), 2022
Xiaoxin He
Bryan Hooi
T. Laurent
Adam Perold
Yann LeCun
Xavier Bresson
243
122
0
27 Dec 2022
Improving Generalization of Pre-trained Language Models via Stochastic
  Weight Averaging
Improving Generalization of Pre-trained Language Models via Stochastic Weight AveragingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Peng Lu
I. Kobyzev
Mehdi Rezagholizadeh
Ahmad Rashid
A. Ghodsi
Philippe Langlais
MoMe
200
12
0
12 Dec 2022
Adversarial Weight Perturbation Improves Generalization in Graph Neural
  Networks
Adversarial Weight Perturbation Improves Generalization in Graph Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2022
Yihan Wu
Aleksandar Bojchevski
Heng Huang
AAML
338
35
0
09 Dec 2022
Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective
Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective
Hoang Phan
Lam C. Tran
Ngoc N. Tran
Nhat Ho
Tuan Truong
Qi Lei
Nhat Ho
Dinh Q. Phung
Trung Le
703
13
0
24 Nov 2022
Efficient Generalization Improvement Guided by Random Weight
  Perturbation
Efficient Generalization Improvement Guided by Random Weight Perturbation
Tao Li
Wei Yan
Zehao Lei
Yingwen Wu
Kun Fang
Ming-Hsuan Yang
Xiaolin Huang
AAML
140
8
0
21 Nov 2022
SAMSON: Sharpness-Aware Minimization Scaled by Outlier Normalization for
  Improving DNN Generalization and Robustness
SAMSON: Sharpness-Aware Minimization Scaled by Outlier Normalization for Improving DNN Generalization and Robustness
Gonçalo Mordido
Sébastien Henwood
Sarath Chandar
Franccois Leduc-Primeau
AAML
172
1
0
18 Nov 2022
How Does Sharpness-Aware Minimization Minimize Sharpness?
How Does Sharpness-Aware Minimization Minimize Sharpness?
Kaiyue Wen
Tengyu Ma
Zhiyuan Li
AAML
275
61
0
10 Nov 2022
Learning Cross-view Geo-localization Embeddings via Dynamic Weighted
  Decorrelation Regularization
Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation RegularizationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Ting Wang
Zhedong Zheng
Zunjie Zhu
Yuhan Gao
Yi Yang
Chenggang Yan
161
55
0
10 Nov 2022
Sufficient Invariant Learning for Distribution Shift
Sufficient Invariant Learning for Distribution ShiftComputer Vision and Pattern Recognition (CVPR), 2022
Taero Kim
Sungjun Lim
Kyungwoo Song
Yonghan Jung
Krikamol Muandet
Kyungwoo Song
OOD
360
3
0
24 Oct 2022
K-SAM: Sharpness-Aware Minimization at the Speed of SGD
K-SAM: Sharpness-Aware Minimization at the Speed of SGD
Renkun Ni
Ping Yeh-Chiang
Jonas Geiping
Micah Goldblum
A. Wilson
Tom Goldstein
196
12
0
23 Oct 2022
Rethinking Sharpness-Aware Minimization as Variational Inference
Rethinking Sharpness-Aware Minimization as Variational Inference
Szilvia Ujváry
Zsigmond Telek
A. Kerekes
Anna Mészáros
Ferenc Huszár
121
8
0
19 Oct 2022
GA-SAM: Gradient-Strength based Adaptive Sharpness-Aware Minimization
  for Improved Generalization
GA-SAM: Gradient-Strength based Adaptive Sharpness-Aware Minimization for Improved GeneralizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhiyuan Zhang
Ruixuan Luo
Qi Su
Xueting Sun
216
17
0
13 Oct 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better
  Generalization on Language Models
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Qihuang Zhong
Liang Ding
Li Shen
Peng Mi
Juhua Liu
Bo Du
Dacheng Tao
AAML
168
56
0
11 Oct 2022
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation
  Approach
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation ApproachNeural Information Processing Systems (NeurIPS), 2022
Peng Mi
Li Shen
Tianhe Ren
Weihao Ye
Xiaoshuai Sun
Rongrong Ji
Dacheng Tao
AAML
270
84
0
11 Oct 2022
SAM as an Optimal Relaxation of Bayes
SAM as an Optimal Relaxation of BayesInternational Conference on Learning Representations (ICLR), 2022
Thomas Möllenhoff
Mohammad Emtiyaz Khan
BDL
273
40
0
04 Oct 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent
  Kernel
Scale-invariant Bayesian Neural Networks with Connectivity Tangent KernelInternational Conference on Learning Representations (ICLR), 2022
Sungyub Kim
Si-hun Park
Kyungsu Kim
Eunho Yang
BDL
183
7
0
30 Sep 2022
Relaxed Attention for Transformer Models
Relaxed Attention for Transformer ModelsIEEE International Joint Conference on Neural Network (IJCNN), 2022
Timo Lohrenz
Björn Möller
Zhengyang Li
Tim Fingscheidt
KELM
173
13
0
20 Sep 2022
Bootstrap Generalization Ability from Loss Landscape Perspective
Bootstrap Generalization Ability from Loss Landscape Perspective
Huanran Chen
Shitong Shao
Ziyi Wang
Zirui Shang
Jin Chen
Xiaofeng Ji
Xinxiao Wu
OOD
323
22
0
18 Sep 2022
Towards Bridging the Performance Gaps of Joint Energy-based Models
Towards Bridging the Performance Gaps of Joint Energy-based ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Xiulong Yang
Qing Su
Shihao Ji
VLM
294
18
0
16 Sep 2022
Model Generalization: A Sharpness Aware Optimization Perspective
Model Generalization: A Sharpness Aware Optimization Perspective
Jozef Marus Coldenhoff
Chengkun Li
Yurui Zhu
77
3
0
14 Aug 2022
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep
  Models
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Xingyu Xie
Pan Zhou
Huan Li
Zhouchen Lin
Shuicheng Yan
ODL
432
244
0
13 Aug 2022
Deep is a Luxury We Don't Have
Deep is a Luxury We Don't HaveInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Ahmed Taha
Yen Nhi Truong Vu
Brent Mombourquette
Thomas P. Matthews
Jason Su
Sadanand Singh
ViTMedIm
157
3
0
11 Aug 2022
Symmetry Regularization and Saturating Nonlinearity for Robust
  Quantization
Symmetry Regularization and Saturating Nonlinearity for Robust QuantizationEuropean Conference on Computer Vision (ECCV), 2022
Sein Park
Yeongsang Jang
Eunhyeok Park
MQ
137
6
0
31 Jul 2022
CrAM: A Compression-Aware Minimizer
CrAM: A Compression-Aware MinimizerInternational Conference on Learning Representations (ICLR), 2022
Alexandra Peste
Adrian Vladu
Eldar Kurtic
Christoph H. Lampert
Dan Alistarh
272
11
0
28 Jul 2022
PoF: Post-Training of Feature Extractor for Improving Generalization
PoF: Post-Training of Feature Extractor for Improving GeneralizationInternational Conference on Machine Learning (ICML), 2022
Ikuro Sato
Ryota Yamada
Masayuki Tanaka
Nakamasa Inoue
Rei Kawakami
144
5
0
05 Jul 2022
Augment like there's no tomorrow: Consistently performing neural
  networks for medical imaging
Augment like there's no tomorrow: Consistently performing neural networks for medical imaging
J. Pohjonen
Carolin Sturenberg
Atte Fohr
Reija Randén-Brady
L. Luomala
J. Lohi
Esa Pitkanen
A. Rannikko
T. Mirtti
OOD
154
8
0
30 Jun 2022
Understanding and Extending Subgraph GNNs by Rethinking Their Symmetries
Understanding and Extending Subgraph GNNs by Rethinking Their SymmetriesNeural Information Processing Systems (NeurIPS), 2022
Fabrizio Frasca
Beatrice Bevilacqua
Michael M. Bronstein
Haggai Maron
309
153
0
22 Jun 2022
Towards Understanding Sharpness-Aware Minimization
Towards Understanding Sharpness-Aware MinimizationInternational Conference on Machine Learning (ICML), 2022
Maksym Andriushchenko
Nicolas Flammarion
AAML
312
177
0
13 Jun 2022
Fisher SAM: Information Geometry and Sharpness Aware Minimisation
Fisher SAM: Information Geometry and Sharpness Aware MinimisationInternational Conference on Machine Learning (ICML), 2022
Minyoung Kim
Da Li
S. Hu
Timothy M. Hospedales
AAML
293
85
0
10 Jun 2022
Previous
12345
Next