ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.00519
  4. Cited By
Stochastic Weight Averaging Revisited
v1v2v3v4 (latest)

Stochastic Weight Averaging Revisited

Applied Sciences (Appl. Sci.), 2022
3 January 2022
Hao Guo
Jiyong Jin
B. Liu
ArXiv (abs)PDFHTMLGithub (6★)

Papers citing "Stochastic Weight Averaging Revisited"

21 / 21 papers shown
Title
Foundational Models and Federated Learning: Survey, Taxonomy, Challenges and Practical Insights
Foundational Models and Federated Learning: Survey, Taxonomy, Challenges and Practical InsightsPeerJ Computer Science (PeerJ CS), 2025
Cosmin Hatfaludi
Alex Serban
FedML
84
0
0
05 Sep 2025
PADAM: Parallel averaged Adam reduces the error for stochastic optimization in scientific machine learning
PADAM: Parallel averaged Adam reduces the error for stochastic optimization in scientific machine learning
Arnulf Jentzen
Julian Kranz
Adrian Riekert
ODL
179
0
0
28 May 2025
A Model Zoo of Vision Transformers
A Model Zoo of Vision Transformers
Damian Falk
Léo Meynent
Florence Pfammatter
Konstantin Schurholt
Damian Borth
352
2
0
14 Apr 2025
Aggregation on Learnable Manifolds for Asynchronous Federated Optimization
Aggregation on Learnable Manifolds for Asynchronous Federated Optimization
Archie Licudi
A. Thakur
Soheila Molaei
Danielle Belgrave
David Clifton
FedML
77
0
0
18 Mar 2025
Enhancing Accuracy and Parameter-Efficiency of Neural Representations
  for Network Parameterization
Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization
Hongjun Choi
Jayaraman J. Thiagarajan
Ruben Glatt
Shusen Liu
195
2
0
29 Jun 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
112
10
0
14 Feb 2024
Interpretable Time Series Models for Wastewater Modeling in Combined
  Sewer Overflows
Interpretable Time Series Models for Wastewater Modeling in Combined Sewer OverflowsProcedia Computer Science (PCS), 2024
Teodor Chiaburu
Felix Bießmann
AI4TSAI4CE
107
3
0
04 Jan 2024
Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free
  Ensembles of DNNs
Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free Ensembles of DNNs
Uri Stern
D. Weinshall
CLL
130
0
0
17 Oct 2023
Deep Model Fusion: A Survey
Deep Model Fusion: A Survey
Weishi Li
Yong Peng
Miao Zhang
Liang Ding
Han Hu
Li Shen
FedMLMoMe
197
79
0
27 Sep 2023
The Split Matters: Flat Minima Methods for Improving the Performance of
  GNNs
The Split Matters: Flat Minima Methods for Improving the Performance of GNNsInternational Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2023
N. Lell
A. Scherp
144
2
0
15 Jun 2023
A Boosted Model Ensembling Approach to Ball Action Spotting in Videos:
  The Runner-Up Solution to CVPR'23 SoccerNet Challenge
A Boosted Model Ensembling Approach to Ball Action Spotting in Videos: The Runner-Up Solution to CVPR'23 SoccerNet Challenge
Luping Wang
Hao Guo
B. Liu
176
3
0
09 Jun 2023
Improving Energy Conserving Descent for Machine Learning: Theory and
  Practice
Improving Energy Conserving Descent for Machine Learning: Theory and Practice
G. Luca
Alice Gatti
E. Silverstein
114
1
0
01 Jun 2023
A Survey of Historical Learning: Learning Models with Learning History
A Survey of Historical Learning: Learning Models with Learning History
Xiang Li
Ge Wu
Lingfeng Yang
Wenzhe Wang
Renjie Song
Jian Yang
MUAI4TS
149
2
0
23 Mar 2023
REPAIR: REnormalizing Permuted Activations for Interpolation Repair
REPAIR: REnormalizing Permuted Activations for Interpolation RepairInternational Conference on Learning Representations (ICLR), 2022
Keller Jordan
Hanie Sedghi
O. Saukh
R. Entezari
Behnam Neyshabur
MoMe
286
113
0
15 Nov 2022
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with
  Latest Weight Averaging
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with Latest Weight Averaging
Jean Kaddour
MoMe3DH
196
47
0
29 Sep 2022
Learning Gradient-based Mixup towards Flatter Minima for Domain
  Generalization
Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization
Danni Peng
Sinno Jialin Pan
125
3
0
29 Sep 2022
Two-Tailed Averaging: Anytime, Adaptive, Once-in-a-While Optimal Weight
  Averaging for Better Generalization
Two-Tailed Averaging: Anytime, Adaptive, Once-in-a-While Optimal Weight Averaging for Better Generalization
Gábor Melis
MoMe
172
1
0
26 Sep 2022
Improving Predictive Performance and Calibration by Weight Fusion in
  Semantic Segmentation
Improving Predictive Performance and Calibration by Weight Fusion in Semantic Segmentation
Timo Sämann
A. Hammam
Andrei Bursuc
Christoph Stiller
H. Groß
FedML
105
1
0
22 Jul 2022
Diverse Weight Averaging for Out-of-Distribution Generalization
Diverse Weight Averaging for Out-of-Distribution GeneralizationNeural Information Processing Systems (NeurIPS), 2022
Alexandre Ramé
Matthieu Kirchmeyer
Thibaud Rahier
A. Rakotomamonjy
Patrick Gallinari
Matthieu Cord
OOD
373
153
0
19 May 2022
PFGE: Parsimonious Fast Geometric Ensembling of DNNs
PFGE: Parsimonious Fast Geometric Ensembling of DNNsInternational Conference on Intelligent Computing (ICIC), 2022
Hao Guo
Jiyong Jin
B. Liu
FedML
253
1
0
14 Feb 2022
When Do Flat Minima Optimizers Work?
When Do Flat Minima Optimizers Work?Neural Information Processing Systems (NeurIPS), 2022
Jean Kaddour
Linqing Liu
Ricardo M. A. Silva
Matt J. Kusner
ODL
370
83
0
01 Feb 2022
1