Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.01412
Cited By
Sharpness-Aware Minimization for Efficiently Improving Generalization
3 October 2020
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sharpness-Aware Minimization for Efficiently Improving Generalization"
50 / 867 papers shown
Title
SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation
Dahun Shin
Dongyeop Lee
Jinseok Chung
Namhoon Lee
ODL
AAML
171
0
0
25 Feb 2025
Class-Conditional Neural Polarizer: A Lightweight and Effective Backdoor Defense by Purifying Poisoned Features
Mingli Zhu
Shaokui Wei
Hongyuan Zha
Baoyuan Wu
AAML
44
0
0
23 Feb 2025
DiffFake: Exposing Deepfakes using Differential Anomaly Detection
Sotirios Stamnas
Victor Sanchez
43
1
0
22 Feb 2025
High-dimensional manifold of solutions in neural networks: insights from statistical physics
Enrico M. Malatesta
46
4
0
20 Feb 2025
Improving the Stability of GNN Force Field Models by Reducing Feature Correlation
Y. Zeng
Wenlong He
Ihor Vasyltsov
Jiaxin Wei
Ying Zhang
Lin Chen
Yuehua Dai
34
0
0
18 Feb 2025
Cross-Domain Continual Learning for Edge Intelligence in Wireless ISAC Networks
Jingzhi Hu
Xin Li
Zhou Su
Jun-Jie Luo
67
0
0
18 Feb 2025
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
Sifan Zhou
Shuo Wang
Zhihang Yuan
Mingjia Shi
Yuzhang Shang
Dawei Yang
ALM
MQ
85
0
0
18 Feb 2025
Linear Mode Connectivity in Differentiable Tree Ensembles
Ryuichi Kanoh
M. Sugiyama
69
1
0
17 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Zhihui Guo
Zhiming Liu
Qianli Ren
Yuexian Zou
83
2
0
10 Feb 2025
MedConv: Convolutions Beat Transformers on Long-Tailed Bone Density Prediction
Xuyin Qi
Zeyu Zhang
Huazhan Zheng
Mingxi Chen
Numan Kutaiba
...
Hongtao Mao
Y. Li
Zhibin Liao
Yang Zhao
Minh Nguyen Nhat To
MedIm
46
7
0
02 Feb 2025
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Antoine Simoulin
Namyong Park
Xiaoyi Liu
Grey Yang
110
0
0
31 Jan 2025
QCS: Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition
C. Wang
Li Chen
Lili Wang
Zhaofan Li
Xuebin Lv
78
1
0
28 Jan 2025
With Great Backbones Comes Great Adversarial Transferability
Erik Arakelyan
Karen Hambardzumyan
Davit Papikyan
Pasquale Minervini
Albert Gordo
Isabelle Augenstein
Aram H. Markosyan
AAML
65
0
0
21 Jan 2025
Elucidating the Design Space of Dataset Condensation
Shitong Shao
Zikai Zhou
Huanran Chen
Zhiqiang Shen
DD
54
7
0
20 Jan 2025
Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
Yilang Zhang
Bingcong Li
G. Giannakis
AAML
39
0
0
11 Jan 2025
Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
Keiichiro Takahashi
Taisuke Kobayashi
Tomoya Yamanokuchi
Takamitsu Matsubara
26
0
0
31 Dec 2024
Functional Risk Minimization
Ferran Alet
Clement Gehring
Tomás Lozano-Pérez
Kenji Kawaguchi
Joshua B. Tenenbaum
Leslie Pack Kaelbling
OffRL
60
0
0
31 Dec 2024
Memory-Centric Computing: Recent Advances in Processing-in-DRAM
O. Mutlu
Ataberk Olgun
Geraldo F. Oliveira
Ismail Emir Yüksel
44
3
0
26 Dec 2024
Computational Analysis of Yaredawi YeZema Silt in Ethiopian Orthodox Tewahedo Church Chants
Mequanent Argaw Muluneh
Yan-Tsung Peng
Li Su
44
0
0
25 Dec 2024
Towards Unsupervised Model Selection for Domain Adaptive Object Detection
Hengfu Yu
Jinhong Deng
Wen Li
Lixin Duan
40
0
0
23 Dec 2024
Parameter-Efficient Interventions for Enhanced Model Merging
Marcin Osial
Daniel Marczak
Bartosz Zieliñski
MoMe
84
1
0
22 Dec 2024
Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks
Jinping Zou
Xiaoge Deng
Tao Sun
74
0
0
22 Dec 2024
Grams: Gradient Descent with Adaptive Momentum Scaling
Yang Cao
Xiaoyu Li
Zhao-quan Song
ODL
87
2
0
22 Dec 2024
SSE-SAM: Balancing Head and Tail Classes Gradually through Stage-Wise SAM
Xingyu Lyu
Qianqian Xu
Zhiyong Yang
Shaojie Lyu
Qingming Huang
74
0
0
18 Dec 2024
Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
Aodi Li
Liansheng Zhuang
Xiao Long
Minghong Yao
Shafei Wang
180
0
0
18 Dec 2024
The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification
Ahmad Hassanpour
Amir Zarei
Khawla Mallat
Anderson Santana de Oliveira
Bian Yang
77
0
0
16 Dec 2024
Meta Curvature-Aware Minimization for Domain Generalization
Z. Chen
Yiwen Ye
Feilong Tang
Yongsheng Pan
Yong-quan Xia
BDL
191
1
0
16 Dec 2024
Towards Understanding the Role of Sharpness-Aware Minimization Algorithms for Out-of-Distribution Generalization
Samuel Schapiro
Han Zhao
71
0
0
06 Dec 2024
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods
Jiamian Hu
Yuanyuan Hong
Yihua Chen
He Wang
Moriaki Yasuhara
63
0
0
03 Dec 2024
Federated Motor Imagery Classification for Privacy-Preserving Brain-Computer Interfaces
Tianwang Jia
Lubin Meng
Siyang Li
Jiajing Liu
Dongrui Wu
102
2
0
02 Dec 2024
Task Arithmetic Through The Lens Of One-Shot Federated Learning
Zhixu Tao
I. Mason
Sanjeev R. Kulkarni
Xavier Boix
MoMe
FedML
84
3
0
27 Nov 2024
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Zhiwei Jia
Yuesong Nan
Huixi Zhao
Gengdai Liu
EGVM
88
0
0
22 Nov 2024
DiM:
f
f
f
-Divergence Minimization Guided Sharpness-Aware Optimization for Semi-supervised Medical Image Segmentation
Bingli Wang
Houcheng Su
Nan Yin
Mengzhu Wang
Li Shen
85
0
0
19 Nov 2024
Reliable Poisoned Sample Detection against Backdoor Attacks Enhanced by Sharpness Aware Minimization
Mingda Zhang
Mingli Zhu
Zihao Zhu
Baoyuan Wu
AAML
76
1
0
18 Nov 2024
Towards Accurate and Efficient Sub-8-Bit Integer Training
Wenjin Guo
Donglai Liu
Weiying Xie
Yunsong Li
Xuefei Ning
Zihan Meng
Shulin Zeng
Jie Lei
Zhenman Fang
Yu Wang
MQ
34
1
0
17 Nov 2024
Enhancing generalization in high energy physics using white-box adversarial attacks
Franck Rothen
Samuel Klein
Matthew Leigh
T. Golling
AAML
31
1
0
14 Nov 2024
Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization
Yuhao He
Jinyu Tian
Xianwei Zheng
Li Dong
Yuanman Li
L. Zhang
AAML
23
0
0
06 Nov 2024
Adaptive Consensus Gradients Aggregation for Scaled Distributed Training
Yoni Choukroun
Shlomi Azoulay
P. Kisilev
31
0
0
06 Nov 2024
Theoretical characterisation of the Gauss-Newton conditioning in Neural Networks
Jim Zhao
Sidak Pal Singh
Aurélien Lucchi
AI4CE
43
0
0
04 Nov 2024
1st-Order Magic: Analysis of Sharpness-Aware Minimization
Nalin Tiwary
Siddarth Aananth
23
0
0
03 Nov 2024
PSformer: Parameter-efficient Transformer with Segment Attention for Time Series Forecasting
Yanlong Wang
J. Xu
Fei Ma
Shao-Lun Huang
Danny Dongning Sun
Xiao-Ping Zhang
AI4TS
45
1
0
03 Nov 2024
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning
Simon Rampp
M. Milling
Andreas Triantafyllopoulos
Björn Schuller
31
1
0
01 Nov 2024
Label Noise: Ignorance Is Bliss
Yilun Zhu
Jianxin Zhang
Aditya Gangrade
Clayton Scott
NoLa
34
2
0
31 Oct 2024
DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of Plasticity
Baekrok Shin
Junsoo Oh
Hanseul Cho
Chulhee Yun
AI4CE
52
1
0
30 Oct 2024
(FL)
2
^2
2
: Overcoming Few Labels in Federated Semi-Supervised Learning
Seungjoo Lee
Thanh-Long V. Le
Jaemin Shin
Sung-Ju Lee
FedML
34
1
0
30 Oct 2024
Reweighting Local Mimina with Tilted SAM
Tian Li
Tianyi Zhou
J. Bilmes
33
0
0
30 Oct 2024
A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization
Zhong Ji
S. M. I. Simon X. Yang
Jingren Liu
Yanwei Pang
Jungong Han
31
0
0
29 Oct 2024
Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition
Mengke Li
Y. Liu
Yang Lu
Yiqun Zhang
Yiu-ming Cheung
Hui Huang
VLM
33
2
0
28 Oct 2024
Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Yihong Luo
Yuhan Chen
Siya Qiu
Yiwei Wang
Chen Zhang
Yan Zhou
Xiaochun Cao
Jing Tang
AAML
30
2
0
22 Oct 2024
Simplicity Bias via Global Convergence of Sharpness Minimization
Khashayar Gatmiry
Zhiyuan Li
Sashank J. Reddi
Stefanie Jegelka
26
1
0
21 Oct 2024
Previous
1
2
3
4
5
...
16
17
18
Next