Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.03641
Cited By
Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy
4 July 2024
Tao Li
Weisen Jiang
Fanghui Liu
X. Huang
James T. Kwok
MoMe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy"
4 / 4 papers shown
Title
Friendly Sharpness-Aware Minimization
Tao Li
Pan Zhou
Zhengbao He
Xinwen Cheng
Xiaolin Huang
AAML
33
15
0
19 Mar 2024
WARM: On the Benefits of Weight Averaged Reward Models
Alexandre Ramé
Nino Vieillard
Léonard Hussenot
Robert Dadashi
Geoffrey Cideron
Olivier Bachem
Johan Ferret
92
92
0
22 Jan 2024
An Adaptive Policy to Employ Sharpness-Aware Minimization
Weisen Jiang
Hansi Yang
Yu Zhang
James T. Kwok
AAML
74
31
0
28 Apr 2023
Diverse Weight Averaging for Out-of-Distribution Generalization
Alexandre Ramé
Matthieu Kirchmeyer
Thibaud Rahier
A. Rakotomamonjy
Patrick Gallinari
Matthieu Cord
OOD
186
128
0
19 May 2022
1