ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.15639
  4. Cited By
Enhancing Sharpness-Aware Optimization Through Variance Suppression
v1v2v3 (latest)

Enhancing Sharpness-Aware Optimization Through Variance Suppression

Neural Information Processing Systems (NeurIPS), 2023
27 September 2023
Bingcong Li
G. Giannakis
    AAML
ArXiv (abs)PDFHTML

Papers citing "Enhancing Sharpness-Aware Optimization Through Variance Suppression"

27 / 27 papers shown
Title
Flat Minima and Generalization: Insights from Stochastic Convex Optimization
Flat Minima and Generalization: Insights from Stochastic Convex Optimization
Matan Schliserman
Shira Vansover-Hager
Tomer Koren
52
0
0
05 Nov 2025
AppForge: From Assistant to Independent Developer - Are GPTs Ready for Software Development?
AppForge: From Assistant to Independent Developer - Are GPTs Ready for Software Development?
Dezhi Ran
Yuan Cao
Mengzhou Wu
Simin Chen
Yuzhe Guo
...
Jialei Wei
Linyi Li
Wei Yang
Baishakhi Ray
Tao Xie
LLMAGALMELM
88
0
0
09 Oct 2025
Sharpness-Aware Minimization Can Hallucinate Minimizers
Sharpness-Aware Minimization Can Hallucinate Minimizers
Chanwoong Park
Uijeong Jang
Ernest K. Ryu
Insoon Yang
91
0
0
26 Sep 2025
VASSO: Variance Suppression for Sharpness-Aware Minimization
Bingcong Li
Yilang Zhang
G. Giannakis
216
1
0
02 Sep 2025
Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
Yuhang Liu
Tao Li
Zhehao Huang
Zuopeng Yang
Xiaolin Huang
68
0
0
27 Aug 2025
Communication-Efficient Distributed Training for Collaborative Flat Optima Recovery in Deep Learning
Communication-Efficient Distributed Training for Collaborative Flat Optima Recovery in Deep Learning
Tolga Dimlioglu
A. Choromańska
FedML
218
1
0
27 Jul 2025
Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization
Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization
C. Tan
Yubo Zhou
Haishan Ye
Guang Dai
Junmin Liu
Zengjie Song
Jiangshe Zhang
Zixiang Zhao
Yunda Hao
Yong Xu
AAML
230
0
0
29 May 2025
Towards Robust Influence Functions with Flat Validation Minima
Towards Robust Influence Functions with Flat Validation Minima
Xichen Ye
Yifan Wu
Weizhong Zhang
Cheng Jin
Yifan Chen
TDI
277
3
0
25 May 2025
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
Yilang Zhang
Bingcong Li
G. Giannakis
530
2
0
24 May 2025
Sharpness-Aware Minimization: General Analysis and Improved RatesInternational Conference on Learning Representations (ICLR), 2025
Dimitris Oikonomou
Nicolas Loizou
248
7
0
04 Mar 2025
Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning AlgorithmIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Yilang Zhang
Bingcong Li
G. Giannakis
AAML
179
0
0
11 Jan 2025
Sharpness-Aware Minimization with Adaptive Regularization for Training
  Deep Neural Networks
Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jinping Zou
Xiaoge Deng
Tao Sun
256
1
0
22 Dec 2024
GAQAT: gradient-adaptive quantization-aware training for domain
  generalization
GAQAT: gradient-adaptive quantization-aware training for domain generalization
Jiacheng Jiang
Yuan Meng
Chen Tang
Han Yu
Qun Li
Zhi Wang
Wenwu Zhu
MQ
221
1
0
07 Dec 2024
Tilted Sharpness-Aware Minimization
Tilted Sharpness-Aware Minimization
Tian Li
Wanrong Zhu
J. Bilmes
225
0
0
30 Oct 2024
On the Crucial Role of Initialization for Matrix Factorization
On the Crucial Role of Initialization for Matrix FactorizationInternational Conference on Learning Representations (ICLR), 2024
Bingcong Li
Liang Zhang
Aryan Mokhtari
Niao He
383
10
0
24 Oct 2024
Implicit Regularization of Sharpness-Aware Minimization for
  Scale-Invariant Problems
Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant ProblemsNeural Information Processing Systems (NeurIPS), 2024
Bingcong Li
Liang Zhang
Niao He
236
9
0
18 Oct 2024
Combinatorial Multi-armed Bandits: Arm Selection via Group Testing
Combinatorial Multi-armed Bandits: Arm Selection via Group Testing
Arpan Mukherjee
Shashanka Ubaru
K. Murugesan
Karthikeyan Shanmugam
A. Tajer
214
5
0
14 Oct 2024
How Learning Dynamics Drive Adversarially Robust Generalization?
How Learning Dynamics Drive Adversarially Robust Generalization?
Yuelin Xu
Xiao Zhang
AAML
270
1
0
10 Oct 2024
Convergence of Sharpness-Aware Minimization Algorithms using Increasing
  Batch Size and Decaying Learning Rate
Convergence of Sharpness-Aware Minimization Algorithms using Increasing Batch Size and Decaying Learning Rate
Hinata Harada
Hideaki Iiduka
208
1
0
16 Sep 2024
Neighborhood and Global Perturbations Supported SAM in Federated
  Learning: From Local Tweaks To Global Awareness
Neighborhood and Global Perturbations Supported SAM in Federated Learning: From Local Tweaks To Global Awareness
Boyuan Li
Zihao Peng
Yafei Li
Mingliang Xu
Shengbo Chen
Baofeng Ji
Cong Shen
FedML
286
1
0
26 Aug 2024
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer
  Models
Efficient Sharpness-Aware Minimization for Molecular Graph Transformer ModelsInternational Conference on Learning Representations (ICLR), 2024
Yili Wang
Kaixiong Zhou
Ninghao Liu
Ying Wang
Xin Wang
146
12
0
19 Jun 2024
Locally Estimated Global Perturbations are Better than Local
  Perturbations for Federated Sharpness-aware Minimization
Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization
Ziqing Fan
Shengchao Hu
Jiangchao Yao
Gang Niu
Ya Zhang
Masashi Sugiyama
Yanfeng Wang
FedML
208
28
0
29 May 2024
Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
Ruipeng Zhang
Ziqing Fan
Jiangchao Yao
Ya Zhang
Yanfeng Wang
222
8
0
29 May 2024
Revisiting Random Weight Perturbation for Efficiently Improving
  Generalization
Revisiting Random Weight Perturbation for Efficiently Improving Generalization
Tao Li
Qinghua Tao
Weihao Yan
Zehao Lei
Yingwen Wu
Kun Fang
Mingzhen He
Xiaolin Huang
AAML
305
10
0
30 Mar 2024
Friendly Sharpness-Aware Minimization
Friendly Sharpness-Aware MinimizationComputer Vision and Pattern Recognition (CVPR), 2024
Tao Li
Pan Zhou
Zhengbao He
Xinwen Cheng
Xiaolin Huang
AAML
208
34
0
19 Mar 2024
Momentum-SAM: Sharpness Aware Minimization without Computational Overhead
Momentum-SAM: Sharpness Aware Minimization without Computational Overhead
Marlon Becker
Frederick Altrock
Benjamin Risse
405
8
0
22 Jan 2024
Stabilizing Sharpness-aware Minimization Through A Simple
  Renormalization Strategy
Stabilizing Sharpness-aware Minimization Through A Simple Renormalization Strategy
Chengli Tan
Jiangshe Zhang
Junmin Liu
Yicheng Wang
Yunda Hao
AAML
275
5
0
14 Jan 2024
1