Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2102.11600
Cited By
v1
v2
v3 (latest)
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
International Conference on Machine Learning (ICML), 2021
23 February 2021
Jungmin Kwon
Jeongseop Kim
Hyunseong Park
I. Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks"
50 / 224 papers shown
FlatNAS: optimizing Flatness in Neural Architecture Search for Out-of-Distribution Robustness
Matteo Gambella
Fabrizio Pittorino
Manuel Roveri
OOD
332
6
0
29 Feb 2024
Gradient Alignment for Cross-Domain Face Anti-Spoofing
B. Le
Simon S. Woo
CVBM
391
35
0
29 Feb 2024
Effective Gradient Sample Size via Variation Estimation for Accelerating Sharpness aware Minimization
Jiaxin Deng
Junbiao Pang
Baochang Zhang
Tian Wang
212
1
0
24 Feb 2024
Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization
Zirui Zhu
Yong Liu
Zangwei Zheng
Huifeng Guo
Yang You
149
0
0
23 Feb 2024
On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Yihao Zhang
Hangzhou He
Jingyu Zhu
Huanran Chen
Yifei Wang
Zeming Wei
AAML
390
24
0
23 Feb 2024
Mirror Gradient: Towards Robust Multimodal Recommender Systems via Exploring Flat Local Minima
Shan Zhong
Zhongzhan Huang
Daifeng Li
Wushao Wen
Jinghui Qin
Guanbin Li
256
21
0
17 Feb 2024
Subgraphormer: Unifying Subgraph GNNs and Graph Transformers via Graph Products
Guy Bar-Shalom
Beatrice Bevilacqua
Haggai Maron
AI4CE
312
10
0
13 Feb 2024
Curvature-Informed SGD via General Purpose Lie-Group Preconditioners
Omead Brandon Pooladzandi
Xi-Lin Li
245
10
0
07 Feb 2024
A Precise Characterization of SGD Stability Using Loss Surface Geometry
International Conference on Learning Representations (ICLR), 2024
Gregory Dexter
Borja Ocejo
S. Keerthi
Aman Gupta
Ayan Acharya
Rajiv Khanna
MLT
246
1
0
22 Jan 2024
Momentum-SAM: Sharpness Aware Minimization without Computational Overhead
Marlon Becker
Frederick Altrock
Benjamin Risse
493
10
0
22 Jan 2024
Stabilizing Sharpness-aware Minimization Through A Simple Renormalization Strategy
Chengli Tan
Jiangshe Zhang
Junmin Liu
Yicheng Wang
Yunda Hao
AAML
312
5
0
14 Jan 2024
ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment
Paniz Halvachi
Alexandra Peste
Dan Alistarh
Christoph H. Lampert
182
0
0
11 Dec 2023
Generalization Bounds for Robust Contrastive Learning: From Theory to Practice
Ngoc N. Tran
Lam C. Tran
Hoang Phan
Anh-Vu Bui
Tung Pham
Toan M. Tran
Dinh Q. Phung
Trung Le
SSL
NoLa
375
0
0
16 Nov 2023
Using Stochastic Gradient Descent to Smooth Nonconvex Functions: Analysis of Implicit Graduated Optimization with Optimal Noise Scheduling
Naoki Sato
Hideaki Iiduka
384
4
0
15 Nov 2023
FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning
Neural Information Processing Systems (NeurIPS), 2023
Zhuo Huang
Li Shen
Jun-chen Yu
Bo Han
Tongliang Liu
FedML
264
35
0
25 Oct 2023
Winning Prize Comes from Losing Tickets: Improve Invariant Learning by Exploring Variant Parameters for Out-of-Distribution Generalization
International Journal of Computer Vision (IJCV), 2023
Zhuo Huang
Muyang Li
Li Shen
Jun-chen Yu
Chen Gong
Bo Han
Tongliang Liu
OOD
291
16
0
25 Oct 2023
Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Neural Information Processing Systems (NeurIPS), 2023
Zixiang Chen
Junkai Zhang
Yiwen Kou
Xiangning Chen
Cho-Jui Hsieh
Quanquan Gu
319
24
0
11 Oct 2023
Asymmetrically Decentralized Federated Learning
IEEE transactions on computers (IEEE Trans. Comput.), 2023
Qinglun Li
Miao Zhang
Nan Yin
Quanjun Yin
Li Shen
FedML
313
6
0
08 Oct 2023
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
International Conference on Learning Representations (ICLR), 2023
Tom Sherborne
Naomi Saphra
Pradeep Dasigi
Hao Peng
375
6
0
05 Oct 2023
A simple connection from loss flatness to compressed neural representations
Shirui Chen
Stefano Recanatesi
E. Shea-Brown
288
0
0
03 Oct 2023
Window-based Model Averaging Improves Generalization in Heterogeneous Federated Learning
Debora Caldarola
Barbara Caputo
Marco Ciccone
FedML
257
8
0
02 Oct 2023
Membership Privacy Risks of Sharpness Aware Minimization
Young In Kim
Pratiksha Agrawal
Pratiksha Agrawal
Johannes O. Royset
Rajiv Khanna
FedML
389
3
0
30 Sep 2023
Sharpness-Aware Teleportation on Riemannian Manifolds
Kenneth Allen
Hoang Nguyen
Haocheng Luo
Ming-Jun Lai
Mehrtash Harandi
Dinh Q. Phung
T. Le
AAML
363
3
0
29 Sep 2023
Enhancing Sharpness-Aware Optimization Through Variance Suppression
Neural Information Processing Systems (NeurIPS), 2023
Bingcong Li
G. Giannakis
AAML
448
34
0
27 Sep 2023
Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR)
Guo-qing Jiang
Jinlong Liu
Zixiang Ding
Lin Guo
W. Lin
AI4CE
221
2
0
24 Sep 2023
Create and Find Flatness: Building Flat Training Spaces in Advance for Continual Learning
European Conference on Artificial Intelligence (ECAI), 2023
Wenhang Shi
Yiren Chen
Zhe Zhao
Wei Lu
Kimmo Yan
Xiaoyong Du
CLL
238
5
0
20 Sep 2023
Gradient constrained sharpness-aware prompt learning for vision-language models
Liangchen Liu
Nannan Wang
Dawei Zhou
Xinbo Gao
Decheng Liu
Xi Yang
Tongliang Liu
VLM
228
3
0
14 Sep 2023
Adversarial Collaborative Filtering for Free
ACM Conference on Recommender Systems (RecSys), 2023
Huiyuan Chen
Xiaoting Li
Vivian Lai
Chin-Chia Michael Yeh
Yujie Fan
Yan Zheng
Mahashweta Das
Hao Yang
AAML
137
8
0
20 Aug 2023
DFedADMM: Dual Constraints Controlled Model Inconsistency for Decentralized Federated Learning
Qinglun Li
Li Shen
Guang-Ming Li
Quanjun Yin
Dacheng Tao
FedML
132
7
0
16 Aug 2023
ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Yixuan Zhou
Yi Qu
Xing Xu
Hengtao Shen
142
29
0
15 Aug 2023
G-Mix: A Generalized Mixup Learning Framework Towards Flat Minima
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
Xingyu Li
Bo Tang
AAML
214
1
0
07 Aug 2023
Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning
IEEE International Conference on Computer Vision (ICCV), 2023
Lingyao Li
Yongfeng Zhang
Xixu Hu
Xingxu Xie
G. Yang
AAML
171
35
0
01 Aug 2023
Flatness-Aware Minimization for Domain Generalization
IEEE International Conference on Computer Vision (ICCV), 2023
Xingxuan Zhang
Renzhe Xu
Han Yu
Yancheng Dong
Pengfei Tian
Peng Cu
272
35
0
20 Jul 2023
Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Pranshu Malviya
Gonçalo Mordido
A. Baratin
Reza Babanezhad Harikandeh
Jerry Huang
Damien Scieur
Razvan Pascanu
Sarath Chandar
ODL
242
1
0
18 Jul 2023
Sharpness-Aware Graph Collaborative Filtering
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Huiyuan Chen
Chin-Chia Michael Yeh
Yujie Fan
Yan Zheng
Junpeng Wang
Vivian Lai
Mahashweta Das
Hao Yang
171
6
0
18 Jul 2023
FAM: Relative Flatness Aware Minimization
Linara Adilova
Amr Abourayya
Jianning Li
Amin Dada
Henning Petzka
Jan Egger
Jens Kleesiek
Michael Kamp
ODL
182
2
0
05 Jul 2023
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Peng Mi
Li Shen
Tianhe Ren
Weihao Ye
Tianshuo Xu
Xiaoshuai Sun
Tongliang Liu
Rongrong Ji
Dacheng Tao
AAML
256
2
0
30 Jun 2023
Adaptive Sharpness-Aware Pruning for Robust Sparse Networks
International Conference on Learning Representations (ICLR), 2023
Anna Bair
Hongxu Yin
Maying Shen
Pavlo Molchanov
J. Álvarez
293
17
0
25 Jun 2023
The Inductive Bias of Flatness Regularization for Deep Matrix Factorization
Khashayar Gatmiry
Zhiyuan Li
Ching-Yao Chuang
Sashank J. Reddi
Tengyu Ma
Stefanie Jegelka
ODL
193
13
0
22 Jun 2023
Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima
Neural Information Processing Systems (NeurIPS), 2023
Dongkuk Si
Chulhee Yun
426
27
0
16 Jun 2023
The Split Matters: Flat Minima Methods for Improving the Performance of GNNs
International Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2023
N. Lell
A. Scherp
232
2
0
15 Jun 2023
Tokenization with Factorized Subword Encoding
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
David Samuel
Lilja Øvrelid
192
2
0
13 Jun 2023
Normalization Layers Are All That Sharpness-Aware Minimization Needs
Neural Information Processing Systems (NeurIPS), 2023
Maximilian Mueller
Tiffany J. Vlaar
David Rolnick
Matthias Hein
283
32
0
07 Jun 2023
Optimal Transport Model Distributional Robustness
Neural Information Processing Systems (NeurIPS), 2023
Van-Anh Nguyen
Trung Le
Anh Tuan Bui
Thanh-Toan Do
Dinh Q. Phung
OOD
283
4
0
07 Jun 2023
Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing
Interspeech (Interspeech), 2023
Hye-jin Shim
Jee-weon Jung
Tomi Kinnunen
192
15
0
31 May 2023
Sharpness-Aware Minimization Leads to Low-Rank Features
Neural Information Processing Systems (NeurIPS), 2023
Maksym Andriushchenko
Dara Bahri
H. Mobahi
Nicolas Flammarion
AAML
391
35
0
25 May 2023
Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term
Knowledge Discovery and Data Mining (KDD), 2023
Yun Yue
Jiadi Jiang
Zhiling Ye
Ni Gao
Yongchao Liu
Kecheng Zhang
MLAU
ODL
256
20
0
25 May 2023
The Crucial Role of Normalization in Sharpness-Aware Minimization
Neural Information Processing Systems (NeurIPS), 2023
Yan Dai
Kwangjun Ahn
S. Sra
362
27
0
24 May 2023
Towards More Suitable Personalization in Federated Learning via Decentralized Partial Model Training
Yi Shi
Yingqi Liu
Yan Sun
Zihao Lin
Li Shen
Xueqian Wang
Dacheng Tao
FedML
233
13
0
24 May 2023
Improving Convergence and Generalization Using Parameter Symmetries
International Conference on Learning Representations (ICLR), 2023
Bo Zhao
Robert Mansel Gower
Robin Walters
Rose Yu
MoMe
399
22
0
22 May 2023
Previous
1
2
3
4
5
Next
Page 3 of 5