ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.01412
  4. Cited By
Sharpness-Aware Minimization for Efficiently Improving Generalization

Sharpness-Aware Minimization for Efficiently Improving Generalization

3 October 2020
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
    AAML
ArXivPDFHTML

Papers citing "Sharpness-Aware Minimization for Efficiently Improving Generalization"

50 / 867 papers shown
Title
Implicit Regularization of Sharpness-Aware Minimization for
  Scale-Invariant Problems
Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems
Bingcong Li
Liang Zhang
Niao He
41
3
0
18 Oct 2024
Stochastic Gradient Descent Jittering for Inverse Problems: Alleviating
  the Accuracy-Robustness Tradeoff
Stochastic Gradient Descent Jittering for Inverse Problems: Alleviating the Accuracy-Robustness Tradeoff
Peimeng Guan
Mark A. Davenport
28
0
0
18 Oct 2024
Feature Augmentation based Test-Time Adaptation
Feature Augmentation based Test-Time Adaptation
Y. Cho
Youngrae Kim
Junho Yoon
Seunghoon Hong
Dongman Lee
TPM
TTA
32
0
0
18 Oct 2024
Transformer-Based Approaches for Sensor-Based Human Activity
  Recognition: Opportunities and Challenges
Transformer-Based Approaches for Sensor-Based Human Activity Recognition: Opportunities and Challenges
Clayton Frederick Souza Leite
Henry Mauranen
Aziza Zhanabatyrova
Yu Xiao
24
1
0
17 Oct 2024
Sharpness-Aware Black-Box Optimization
Sharpness-Aware Black-Box Optimization
Feiyang Ye
Yueming Lyu
Xuehao Wang
Masashi Sugiyama
Yu-Jie Zhang
Ivor W. Tsang
AAML
42
0
0
16 Oct 2024
Model Balancing Helps Low-data Training and Fine-tuning
Model Balancing Helps Low-data Training and Fine-tuning
Zihang Liu
Y. Hu
Tianyu Pang
Yefan Zhou
Pu Ren
Yaoqing Yang
34
2
0
16 Oct 2024
Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation
Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation
Zixin Wang
Dong Gong
Sen Wang
Zi Huang
Yadan Luo
VLM
34
0
0
16 Oct 2024
Combinatorial Multi-armed Bandits: Arm Selection via Group Testing
Combinatorial Multi-armed Bandits: Arm Selection via Group Testing
Arpan Mukherjee
Shashanka Ubaru
K. Murugesan
Karthikeyan Shanmugam
A. Tajer
39
1
0
14 Oct 2024
Domain-Conditioned Transformer for Fully Test-time Adaptation
Domain-Conditioned Transformer for Fully Test-time Adaptation
Yushun Tang
Shuoshuo Chen
Jiyuan Jia
Yi Zhang
Zhihai He
23
2
0
14 Oct 2024
Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics
Daniel Paulin
P. Whalley
Neil K. Chada
B. Leimkuhler
BDL
46
4
0
14 Oct 2024
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late in Training
Zhanpeng Zhou
Mingze Wang
Yuchen Mao
Bingrui Li
Junchi Yan
AAML
62
0
0
14 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement
  Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
62
6
0
13 Oct 2024
S$^4$ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
S4^44ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack
Yongxiang Liu
Bowen Peng
Li Liu
X. Li
110
0
0
13 Oct 2024
Understanding Adversarially Robust Generalization via Weight-Curvature
  Index
Understanding Adversarially Robust Generalization via Weight-Curvature Index
Yuelin Xu
Xiao Zhang
AAML
29
0
0
10 Oct 2024
Boosting Deep Ensembles with Learning Rate Tuning
Boosting Deep Ensembles with Learning Rate Tuning
Hongpeng Jin
Yanzhao Wu
24
0
0
10 Oct 2024
QT-DoG: Quantization-aware Training for Domain Generalization
QT-DoG: Quantization-aware Training for Domain Generalization
Saqib Javed
Hieu Le
Mathieu Salzmann
OOD
MQ
28
1
0
08 Oct 2024
Leveraging free energy in pretraining model selection for improved
  fine-tuning
Leveraging free energy in pretraining model selection for improved fine-tuning
Michael Munn
Susan Wei
32
0
0
08 Oct 2024
Improving Generalization with Flat Hilbert Bayesian Inference
Improving Generalization with Flat Hilbert Bayesian Inference
Tuan Truong
Quyen Tran
Quan Pham-Ngoc
Nhat Ho
Dinh Q. Phung
Trung Le
18
0
0
05 Oct 2024
Training Over a Distribution of Hyperparameters for Enhanced Performance
  and Adaptability on Imbalanced Classification
Training Over a Distribution of Hyperparameters for Enhanced Performance and Adaptability on Imbalanced Classification
Kelsey Lieberman
Swarna Kamlam Ravindran
Shuai Yuan
Carlo Tomasi
OOD
35
0
0
04 Oct 2024
MONICA: Benchmarking on Long-tailed Medical Image Classification
MONICA: Benchmarking on Long-tailed Medical Image Classification
Lie Ju
Siyuan Yan
Yukun Zhou
Yang Nan
Xiaodan Xing
Peibo Duan
Zongyuan Ge
57
0
0
02 Oct 2024
Fisher Information-based Efficient Curriculum Federated Learning with
  Large Language Models
Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models
Ji Liu
Jiaxiang Ren
Ruoming Jin
Zijie Zhang
Yang Zhou
P. Valduriez
Dejing Dou
FedML
31
1
0
30 Sep 2024
Scalable Fine-tuning from Multiple Data Sources:A First-Order
  Approximation Approach
Scalable Fine-tuning from Multiple Data Sources:A First-Order Approximation Approach
Dongyue Li
Ziniu Zhang
Lu Wang
Hongyang R. Zhang
38
0
0
28 Sep 2024
A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning Needs
A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning Needs
Yan Sun
Li Shen
Dacheng Tao
FedML
25
0
0
27 Sep 2024
KALE-LM: Unleash The Power Of AI For Science Via Knowledge And Logic Enhanced Large Model
KALE-LM: Unleash The Power Of AI For Science Via Knowledge And Logic Enhanced Large Model
Weichen Dai
Yezeng Chen
Zijie Dai
Zhijie Huang
Y. Liu
...
Chengli Zhong
Xinhe Li
Zeyu Wang
Zhuoying Feng
Yi Zhou
35
0
0
27 Sep 2024
Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological
  Measurement
Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement
Haodong Li
Hao Lu
Ying-Cong Chen
30
1
0
25 Sep 2024
Neural Network Plasticity and Loss Sharpness
Neural Network Plasticity and Loss Sharpness
Max Koster
Jude Kukla
18
0
0
25 Sep 2024
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
Yao Ni
Shan Zhang
Piotr Koniusz
143
2
0
25 Sep 2024
Learning Representation for Multitask learning through Self Supervised
  Auxiliary learning
Learning Representation for Multitask learning through Self Supervised Auxiliary learning
Seokwon Shin
Hyungrok Do
Youngdoo Son
SSL
26
1
0
25 Sep 2024
Revisiting Video Quality Assessment from the Perspective of
  Generalization
Revisiting Video Quality Assessment from the Perspective of Generalization
Xinli Yue
Jianhui Sun
Liangchao Yao
Fan Xia
Yuetang Deng
...
Lei Li
Fengyun Rao
Jing Lv
Qian Wang
Lingchen Zhao
MoMe
23
0
0
23 Sep 2024
Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape
Tao Li
Zhengbao He
Yujun Li
Yasheng Wang
Lifeng Shang
X. Huang
51
0
0
22 Sep 2024
UU-Mamba: Uncertainty-aware U-Mamba for Cardiovascular Segmentation
UU-Mamba: Uncertainty-aware U-Mamba for Cardiovascular Segmentation
Ting Yu Tsai
Li Lin
Shu Hu
Connie W. Tsao
Xin Li
Ming-Ching Chang
Hongtu Zhu
Xin Wang
Mamba
43
1
0
22 Sep 2024
DP$^2$-FedSAM: Enhancing Differentially Private Federated Learning
  Through Personalized Sharpness-Aware Minimization
DP2^22-FedSAM: Enhancing Differentially Private Federated Learning Through Personalized Sharpness-Aware Minimization
Zhenxiao Zhang
Yuanxiong Guo
Yanmin Gong
FedML
38
0
0
20 Sep 2024
Bilateral Sharpness-Aware Minimization for Flatter Minima
Bilateral Sharpness-Aware Minimization for Flatter Minima
Jiaxin Deng
Junbiao Pang
Baochang Zhang
Qingming Huang
AAML
110
0
0
20 Sep 2024
Convergence of Sharpness-Aware Minimization Algorithms using Increasing
  Batch Size and Decaying Learning Rate
Convergence of Sharpness-Aware Minimization Algorithms using Increasing Batch Size and Decaying Learning Rate
Hinata Harada
Hideaki Iiduka
30
1
0
16 Sep 2024
Flash STU: Fast Spectral Transform Units
Flash STU: Fast Spectral Transform Units
Y. Isabel Liu
Windsor Nguyen
Yagiz Devre
Evan Dogariu
Anirudha Majumdar
Elad Hazan
AI4TS
72
1
0
16 Sep 2024
COSCO: A Sharpness-Aware Training Framework for Few-shot Multivariate
  Time Series Classification
COSCO: A Sharpness-Aware Training Framework for Few-shot Multivariate Time Series Classification
Jesus Barreda
Ashley Gomez
Ruben Puga
Kaixiong Zhou
Li Zhang
AI4TS
16
0
0
15 Sep 2024
Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Chengxi Ye
Grace Chu
Yanfeng Liu
Yichi Zhang
Lukasz Lew
Andrew G. Howard
MQ
27
2
0
14 Sep 2024
HTR-VT: Handwritten Text Recognition with Vision Transformer
HTR-VT: Handwritten Text Recognition with Vision Transformer
Yuting Li
Dexiong Chen
Tinglong Tang
Xi Shen
ViT
21
7
0
13 Sep 2024
TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and
  Resource Efficiency
TriplePlay: Enhancing Federated Learning with CLIP for Non-IID Data and Resource Efficiency
Ahmed Imteaj
Md Zarif Hossain
Saika Zaman
Abdur R. Shahid
VLM
21
1
0
09 Sep 2024
WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking
WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking
Carl De Sousa Trias
Mihai P. Mitrea
A. Fiandrotti
Marco Cagnazzo
Sumanta Chaudhuri
Enzo Tartaglione
AAML
30
1
0
05 Sep 2024
CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models
CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models
Rui Zeng
Xi Chen
Yuwen Pu
Xuhong Zhang
Tianyu Du
Shouling Ji
41
2
0
02 Sep 2024
Fisher Information guided Purification against Backdoor Attacks
Fisher Information guided Purification against Backdoor Attacks
Nazmul Karim
Abdullah Al Arafat
Adnan Siraj Rakin
Zhishan Guo
Nazanin Rahnavard
AAML
48
1
0
01 Sep 2024
PSLF: A PID Controller-incorporated Second-order Latent Factor Analysis
  Model for Recommender System
PSLF: A PID Controller-incorporated Second-order Latent Factor Analysis Model for Recommender System
Jialiang Wang
Yan Xia
Ye Yuan
16
0
0
31 Aug 2024
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL
  Features and Additional Regularization for the ASVspoof 2024 Challenge
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge
Kirill Borodin
Vasiliy Kudryavtsev
Dmitrii Korzh
Alexey Efimenko
Grach Mkrtchian
Mikhail Gorodnichev
Oleg Y. Rogov
41
1
0
30 Aug 2024
Towards reliable respiratory disease diagnosis based on cough sounds and
  vision transformers
Towards reliable respiratory disease diagnosis based on cough sounds and vision transformers
Qian Wang
Zhaoyang Bu
Jiaxuan Mao
Wenyu Zhu
Jingya Zhao
Wei Du
Guochao Shi
Min Zhou
Si Chen
Jieming Qu
MedIm
39
0
0
28 Aug 2024
Neighborhood and Global Perturbations Supported SAM in Federated
  Learning: From Local Tweaks To Global Awareness
Neighborhood and Global Perturbations Supported SAM in Federated Learning: From Local Tweaks To Global Awareness
Boyuan Li
Zihao Peng
Yafei Li
Mingliang Xu
Shengbo Chen
Baofeng Ji
Cong Shen
FedML
60
0
0
26 Aug 2024
Dual-Path Adversarial Lifting for Domain Shift Correction in Online
  Test-time Adaptation
Dual-Path Adversarial Lifting for Domain Shift Correction in Online Test-time Adaptation
Yushun Tang
Shuoshuo Chen
Zhihe Lu
Xinchao Wang
Zhihai He
36
1
0
26 Aug 2024
FungiTastic: A multi-modal dataset and benchmark for image categorization
FungiTastic: A multi-modal dataset and benchmark for image categorization
Lukás Picek
Klara Janouskova
Milan Šulc
Jirí Matas
77
1
0
24 Aug 2024
SZU-AFS Antispoofing System for the ASVspoof 5 Challenge
SZU-AFS Antispoofing System for the ASVspoof 5 Challenge
Yuxiong Xu
Jiafeng Zhong
Sengui Zheng
Zefeng Liu
Bin Li
37
2
0
19 Aug 2024
Enhancing Adversarial Transferability with Adversarial Weight Tuning
Enhancing Adversarial Transferability with Adversarial Weight Tuning
Jiahao Chen
Zhou Feng
Rui Zeng
Yuwen Pu
Chunyi Zhou
Yi Jiang
Yuyou Gan
Jinbao Li
Shouling Ji
AAML
35
0
0
18 Aug 2024
Previous
123456...161718
Next