Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2102.11600
Cited By
v1
v2
v3 (latest)
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
International Conference on Machine Learning (ICML), 2021
23 February 2021
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks"
50 / 224 papers shown
Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination
Knowledge Discovery and Data Mining (KDD), 2023
Ming Hu
Zhihao Yue
Zhiwei Ling
Cheng Chen
Yihao Huang
Xian Wei
Xiang Lian
Yang Liu
Xiao He
FedML
219
25
0
18 May 2023
Sharpness & Shift-Aware Self-Supervised Learning
Ngoc N. Tran
S. Duong
Hoang Phan
Tung Pham
Dinh Q. Phung
Trung Le
SSL
192
1
0
17 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
216
1
0
08 May 2023
Venn Diagram Multi-label Class Interpretation of Diabetic Foot Ulcer with Color and Sharpness Enhancement
M. Hasan
Moi Hoon Yap
M. Hasan
176
2
0
01 May 2023
Towards the Flatter Landscape and Better Generalization in Federated Learning under Client-level Differential Privacy
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yi Shi
Kang Wei
Li Shen
Yingqi Liu
Xueqian Wang
Bo Yuan
Dacheng Tao
FedML
265
5
0
01 May 2023
An Adaptive Policy to Employ Sharpness-Aware Minimization
International Conference on Learning Representations (ICLR), 2023
Weisen Jiang
Hansi Yang
Yu Zhang
James T. Kwok
AAML
271
43
0
28 Apr 2023
Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware Minimization
IEEE International Conference on Computer Vision (ICCV), 2023
Mingli Zhu
Shaokui Wei
Li Shen
Yanbo Fan
Baoyuan Wu
AAML
209
78
0
24 Apr 2023
Going Further: Flatness at the Rescue of Early Stopping for Adversarial Example Transferability
Martin Gubri
Maxime Cordy
Yves Le Traon
AAML
237
3
1
05 Apr 2023
Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation
IEEE International Conference on Computer Vision (ICCV), 2023
Fabian Deuser
Konrad Habel
Norbert Oswald
266
124
0
21 Mar 2023
Make Landscape Flatter in Differentially Private Federated Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Yi Shi
Yingqi Liu
Kang Wei
Li Shen
Xueqian Wang
Dacheng Tao
FedML
213
88
0
20 Mar 2023
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
International Conference on Learning Representations (ICLR), 2023
Huanran Chen
Yichi Zhang
Yinpeng Dong
Xiao Yang
Hang Su
Junyi Zhu
AAML
366
96
0
16 Mar 2023
Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization
Computer Vision and Pattern Recognition (CVPR), 2023
Xingxuan Zhang
Renzhe Xu
Han Yu
Hao Zou
Peng Cui
326
65
0
03 Mar 2023
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural Networks
Neural Networks (Neural Netw.), 2023
Hao Sun
Li Shen
Qihuang Zhong
Liang Ding
Shi-Yong Chen
Jingwei Sun
Jing Li
Guangzhong Sun
Dacheng Tao
163
41
0
01 Mar 2023
Towards Stable Test-Time Adaptation in Dynamic Wild World
International Conference on Learning Representations (ICLR), 2023
Shuaicheng Niu
Jiaxiang Wu
Yifan Zhang
Z. Wen
Yaofo Chen
P. Zhao
Zhuliang Yu
TTA
398
399
0
24 Feb 2023
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization
Kayhan Behdin
Qingquan Song
Aman Gupta
S. Keerthi
Ayan Acharya
Borja Ocejo
Gregory Dexter
Rajiv Khanna
D. Durfee
Rahul Mazumder
AAML
303
12
0
19 Feb 2023
Improving Differentiable Architecture Search via Self-Distillation
Neural Networks (Neural Netw.), 2023
Xunyu Zhu
Jian Li
Yong Liu
Weiping Wang
295
10
0
11 Feb 2023
Flat Seeking Bayesian Neural Networks
Neural Information Processing Systems (NeurIPS), 2023
Van-Anh Nguyen
L. Vuong
Hoang Phan
Thanh-Toan Do
Dinh Q. Phung
Trung Le
BDL
480
10
0
06 Feb 2023
Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Woojin Lee
Jaewook Lee
152
10
0
27 Jan 2023
An SDE for Modeling SAM: Theory and Insights
International Conference on Machine Learning (ICML), 2023
Enea Monzio Compagnoni
Luca Biggio
Antonio Orvieto
F. Proske
Hans Kersting
Aurelien Lucchi
292
22
0
19 Jan 2023
Stability Analysis of Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Jaewook Lee
170
15
0
16 Jan 2023
GoogLe2Net: Going Transverse with Convolutions
Yuanpeng He
207
2
0
01 Jan 2023
A Generalization of ViT/MLP-Mixer to Graphs
International Conference on Machine Learning (ICML), 2022
Xiaoxin He
Bryan Hooi
T. Laurent
Adam Perold
Yann LeCun
Xavier Bresson
243
122
0
27 Dec 2022
Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Peng Lu
I. Kobyzev
Mehdi Rezagholizadeh
Ahmad Rashid
A. Ghodsi
Philippe Langlais
MoMe
200
12
0
12 Dec 2022
Adversarial Weight Perturbation Improves Generalization in Graph Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yihan Wu
Aleksandar Bojchevski
Heng Huang
AAML
338
35
0
09 Dec 2022
Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective
Hoang Phan
Lam C. Tran
Ngoc N. Tran
Nhat Ho
Tuan Truong
Qi Lei
Nhat Ho
Dinh Q. Phung
Trung Le
703
13
0
24 Nov 2022
Efficient Generalization Improvement Guided by Random Weight Perturbation
Tao Li
Wei Yan
Zehao Lei
Yingwen Wu
Kun Fang
Ming-Hsuan Yang
Xiaolin Huang
AAML
140
8
0
21 Nov 2022
SAMSON: Sharpness-Aware Minimization Scaled by Outlier Normalization for Improving DNN Generalization and Robustness
Gonçalo Mordido
Sébastien Henwood
Sarath Chandar
Franccois Leduc-Primeau
AAML
172
1
0
18 Nov 2022
How Does Sharpness-Aware Minimization Minimize Sharpness?
Kaiyue Wen
Tengyu Ma
Zhiyuan Li
AAML
275
61
0
10 Nov 2022
Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization
IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022
Ting Wang
Zhedong Zheng
Zunjie Zhu
Yuhan Gao
Yi Yang
Chenggang Yan
161
55
0
10 Nov 2022
Sufficient Invariant Learning for Distribution Shift
Computer Vision and Pattern Recognition (CVPR), 2022
Taero Kim
Sungjun Lim
Kyungwoo Song
Yonghan Jung
Krikamol Muandet
Kyungwoo Song
OOD
360
3
0
24 Oct 2022
K-SAM: Sharpness-Aware Minimization at the Speed of SGD
Renkun Ni
Ping Yeh-Chiang
Jonas Geiping
Micah Goldblum
A. Wilson
Tom Goldstein
196
12
0
23 Oct 2022
Rethinking Sharpness-Aware Minimization as Variational Inference
Szilvia Ujváry
Zsigmond Telek
A. Kerekes
Anna Mészáros
Ferenc Huszár
121
8
0
19 Oct 2022
GA-SAM: Gradient-Strength based Adaptive Sharpness-Aware Minimization for Improved Generalization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhiyuan Zhang
Ruixuan Luo
Qi Su
Xueting Sun
216
17
0
13 Oct 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Qihuang Zhong
Liang Ding
Li Shen
Peng Mi
Juhua Liu
Bo Du
Dacheng Tao
AAML
168
56
0
11 Oct 2022
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
Neural Information Processing Systems (NeurIPS), 2022
Peng Mi
Li Shen
Tianhe Ren
Weihao Ye
Xiaoshuai Sun
Rongrong Ji
Dacheng Tao
AAML
270
84
0
11 Oct 2022
SAM as an Optimal Relaxation of Bayes
International Conference on Learning Representations (ICLR), 2022
Thomas Möllenhoff
Mohammad Emtiyaz Khan
BDL
273
40
0
04 Oct 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel
International Conference on Learning Representations (ICLR), 2022
Sungyub Kim
Si-hun Park
Kyungsu Kim
Eunho Yang
BDL
183
7
0
30 Sep 2022
Relaxed Attention for Transformer Models
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Timo Lohrenz
Björn Möller
Zhengyang Li
Tim Fingscheidt
KELM
173
13
0
20 Sep 2022
Bootstrap Generalization Ability from Loss Landscape Perspective
Huanran Chen
Shitong Shao
Ziyi Wang
Zirui Shang
Jin Chen
Xiaofeng Ji
Xinxiao Wu
OOD
323
22
0
18 Sep 2022
Towards Bridging the Performance Gaps of Joint Energy-based Models
Computer Vision and Pattern Recognition (CVPR), 2022
Xiulong Yang
Qing Su
Shihao Ji
VLM
294
18
0
16 Sep 2022
Model Generalization: A Sharpness Aware Optimization Perspective
Jozef Marus Coldenhoff
Chengkun Li
Yurui Zhu
77
3
0
14 Aug 2022
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Xingyu Xie
Pan Zhou
Huan Li
Zhouchen Lin
Shuicheng Yan
ODL
432
244
0
13 Aug 2022
Deep is a Luxury We Don't Have
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Ahmed Taha
Yen Nhi Truong Vu
Brent Mombourquette
Thomas P. Matthews
Jason Su
Sadanand Singh
ViT
MedIm
157
3
0
11 Aug 2022
Symmetry Regularization and Saturating Nonlinearity for Robust Quantization
European Conference on Computer Vision (ECCV), 2022
Sein Park
Yeongsang Jang
Eunhyeok Park
MQ
137
6
0
31 Jul 2022
CrAM: A Compression-Aware Minimizer
International Conference on Learning Representations (ICLR), 2022
Alexandra Peste
Adrian Vladu
Eldar Kurtic
Christoph H. Lampert
Dan Alistarh
272
11
0
28 Jul 2022
PoF: Post-Training of Feature Extractor for Improving Generalization
International Conference on Machine Learning (ICML), 2022
Ikuro Sato
Ryota Yamada
Masayuki Tanaka
Nakamasa Inoue
Rei Kawakami
144
5
0
05 Jul 2022
Augment like there's no tomorrow: Consistently performing neural networks for medical imaging
J. Pohjonen
Carolin Sturenberg
Atte Fohr
Reija Randén-Brady
L. Luomala
J. Lohi
Esa Pitkanen
A. Rannikko
T. Mirtti
OOD
154
8
0
30 Jun 2022
Understanding and Extending Subgraph GNNs by Rethinking Their Symmetries
Neural Information Processing Systems (NeurIPS), 2022
Fabrizio Frasca
Beatrice Bevilacqua
Michael M. Bronstein
Haggai Maron
309
153
0
22 Jun 2022
Towards Understanding Sharpness-Aware Minimization
International Conference on Machine Learning (ICML), 2022
Maksym Andriushchenko
Nicolas Flammarion
AAML
312
177
0
13 Jun 2022
Fisher SAM: Information Geometry and Sharpness Aware Minimisation
International Conference on Machine Learning (ICML), 2022
Minyoung Kim
Da Li
S. Hu
Timothy M. Hospedales
AAML
293
85
0
10 Jun 2022
Previous
1
2
3
4
5
Next