v1v2v3 (latest)

ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks

International Conference on Machine Learning (ICML), 2021

23 February 2021

Papers citing "ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks"

50 / 224 papers shown

Is Aggregation the Only Choice? Federated Learning via Layer-wise Model RecombinationKnowledge Discovery and Data Mining (KDD), 2023

Ming Hu

Yang Liu

219

18 May 2023

Sharpness & Shift-Aware Self-Supervised Learning

Tung Pham

Trung Le

192

17 May 2023

CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation

216

08 May 2023

Venn Diagram Multi-label Class Interpretation of Diabetic Foot Ulcer with Color and Sharpness Enhancement

M. Hasan

Moi Hoon Yap

M. Hasan

176

01 May 2023

Towards the Flatter Landscape and Better Generalization in Federated Learning under Client-level Differential PrivacyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Li Shen

265

01 May 2023

An Adaptive Policy to Employ Sharpness-Aware MinimizationInternational Conference on Learning Representations (ICLR), 2023

271

28 Apr 2023

Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware MinimizationIEEE International Conference on Computer Vision (ICCV), 2023

209

24 Apr 2023

Going Further: Flatness at the Rescue of Early Stopping for Adversarial Example Transferability

237

05 Apr 2023

Sample4Geo: Hard Negative Sampling For Cross-View Geo-LocalisationIEEE International Conference on Computer Vision (ICCV), 2023

Fabian Deuser

Konrad Habel

Norbert Oswald

266

124

21 Mar 2023

Make Landscape Flatter in Differentially Private Federated LearningComputer Vision and Pattern Recognition (CVPR), 2023

Li Shen

213

20 Mar 2023

Rethinking Model Ensemble in Transfer-based Adversarial AttacksInternational Conference on Learning Representations (ICLR), 2023

Yinpeng Dong

366

16 Mar 2023

Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves GeneralizationComputer Vision and Pattern Recognition (CVPR), 2023

Peng Cui

326

03 Mar 2023

AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural NetworksNeural Networks (Neural Netw.), 2023

Li Shen

Liang Ding

163

01 Mar 2023

Towards Stable Test-Time Adaptation in Dynamic Wild WorldInternational Conference on Learning Representations (ICLR), 2023

398

399

24 Feb 2023

mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

303

19 Feb 2023

Improving Differentiable Architecture Search via Self-DistillationNeural Networks (Neural Netw.), 2023

Xunyu Zhu

Jian Li

Yong Liu

Weiping Wang

295

11 Feb 2023

Flat Seeking Bayesian Neural NetworksNeural Information Processing Systems (NeurIPS), 2023

Van-Anh Nguyen

Trung Le

480

06 Feb 2023

Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization

152

27 Jan 2023

An SDE for Modeling SAM: Theory and InsightsInternational Conference on Machine Learning (ICML), 2023

Enea Monzio Compagnoni

Antonio Orvieto

292

19 Jan 2023

Stability Analysis of Sharpness-Aware Minimization

170

16 Jan 2023

GoogLe2Net: Going Transverse with Convolutions

Yuanpeng He

207

01 Jan 2023

A Generalization of ViT/MLP-Mixer to GraphsInternational Conference on Machine Learning (ICML), 2022

Bryan Hooi

243

122

27 Dec 2022

Improving Generalization of Pre-trained Language Models via Stochastic Weight AveragingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

200

12 Dec 2022

Adversarial Weight Perturbation Improves Generalization in Graph Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2022

Yihan Wu

Aleksandar Bojchevski

Heng Huang

AAML

338

09 Dec 2022

Beyond Losses Reweighting: Empowering Multi-Task Learning via the Generalization Perspective

703

24 Nov 2022

Efficient Generalization Improvement Guided by Random Weight Perturbation

140

21 Nov 2022

SAMSON: Sharpness-Aware Minimization Scaled by Outlier Normalization for Improving DNN Generalization and Robustness

Gonçalo Mordido

Sébastien Henwood

Sarath Chandar

Franccois Leduc-Primeau

AAML

172

18 Nov 2022

How Does Sharpness-Aware Minimization Minimize Sharpness?

275

10 Nov 2022

Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation RegularizationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2022

Chenggang Yan

161

10 Nov 2022

Sufficient Invariant Learning for Distribution ShiftComputer Vision and Pattern Recognition (CVPR), 2022

360

24 Oct 2022

K-SAM: Sharpness-Aware Minimization at the Speed of SGD

196

23 Oct 2022

Rethinking Sharpness-Aware Minimization as Variational Inference

121

19 Oct 2022

GA-SAM: Gradient-Strength based Adaptive Sharpness-Aware Minimization for Improved GeneralizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

216

13 Oct 2022

Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Qihuang Zhong

Liang Ding

Li Shen

Bo Du

168

11 Oct 2022

Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation ApproachNeural Information Processing Systems (NeurIPS), 2022

Li Shen

270

11 Oct 2022

SAM as an Optimal Relaxation of BayesInternational Conference on Learning Representations (ICLR), 2022

Thomas Möllenhoff

Mohammad Emtiyaz Khan

BDL

273

04 Oct 2022

Scale-invariant Bayesian Neural Networks with Connectivity Tangent KernelInternational Conference on Learning Representations (ICLR), 2022

183

30 Sep 2022

Relaxed Attention for Transformer ModelsIEEE International Joint Conference on Neural Network (IJCNN), 2022

173

20 Sep 2022

Bootstrap Generalization Ability from Loss Landscape Perspective

323

18 Sep 2022

Towards Bridging the Performance Gaps of Joint Energy-based ModelsComputer Vision and Pattern Recognition (CVPR), 2022

294

16 Sep 2022

Model Generalization: A Sharpness Aware Optimization Perspective

Jozef Marus Coldenhoff

Chengkun Li

Yurui Zhu

14 Aug 2022

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

432

244

13 Aug 2022

Deep is a Luxury We Don't HaveInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022

157

11 Aug 2022

Symmetry Regularization and Saturating Nonlinearity for Robust QuantizationEuropean Conference on Computer Vision (ECCV), 2022

Sein Park

Yeongsang Jang

Eunhyeok Park

137

31 Jul 2022

CrAM: A Compression-Aware MinimizerInternational Conference on Learning Representations (ICLR), 2022

Dan Alistarh

272

28 Jul 2022

PoF: Post-Training of Feature Extractor for Improving GeneralizationInternational Conference on Machine Learning (ICML), 2022

144

05 Jul 2022

Augment like there's no tomorrow: Consistently performing neural networks for medical imaging

154

30 Jun 2022

Understanding and Extending Subgraph GNNs by Rethinking Their SymmetriesNeural Information Processing Systems (NeurIPS), 2022

309

153

22 Jun 2022

Towards Understanding Sharpness-Aware MinimizationInternational Conference on Machine Learning (ICML), 2022

Maksym Andriushchenko

Nicolas Flammarion

AAML

312

177

13 Jun 2022

Fisher SAM: Information Geometry and Sharpness Aware MinimisationInternational Conference on Machine Learning (ICML), 2022

Minyoung Kim

Da Li

S. Hu

Timothy M. Hospedales

AAML

293

10 Jun 2022