ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.05482
  4. Cited By
Model soups: averaging weights of multiple fine-tuned models improves
  accuracy without increasing inference time

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

10 March 2022
Mitchell Wortsman
Gabriel Ilharco
S. Gadre
Rebecca Roelofs
Raphael Gontijo-Lopes
Ari S. Morcos
Hongseok Namkoong
Ali Farhadi
Y. Carmon
Simon Kornblith
Ludwig Schmidt
    MoMe
ArXivPDFHTML

Papers citing "Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time"

17 / 667 papers shown
Title
Unlocking High-Accuracy Differentially Private Image Classification
  through Scale
Unlocking High-Accuracy Differentially Private Image Classification through Scale
Soham De
Leonard Berrada
Jamie Hayes
Samuel L. Smith
Borja Balle
11
213
0
28 Apr 2022
Learning to Scaffold: Optimizing Model Explanations for Teaching
Learning to Scaffold: Optimizing Model Explanations for Teaching
Patrick Fernandes
Marcos Vinícius Treviso
Danish Pruthi
André F. T. Martins
Graham Neubig
FAtt
17
22
0
22 Apr 2022
NAFSSR: Stereo Image Super-Resolution Using NAFNet
NAFSSR: Stereo Image Super-Resolution Using NAFNet
Xiaojie Chu
Liangyu Chen
Wenqing Yu
SupR
6
113
0
19 Apr 2022
Fusing finetuned models for better pretraining
Fusing finetuned models for better pretraining
Leshem Choshen
Elad Venezian
Noam Slonim
Yoav Katz
FedML
AI4CE
MoMe
31
86
0
06 Apr 2022
Beyond Separability: Analyzing the Linear Transferability of Contrastive
  Representations to Related Subpopulations
Beyond Separability: Analyzing the Linear Transferability of Contrastive Representations to Related Subpopulations
Jeff Z. HaoChen
Colin Wei
Ananya Kumar
Tengyu Ma
20
37
0
06 Apr 2022
Self-Distribution Distillation: Efficient Uncertainty Estimation
Self-Distribution Distillation: Efficient Uncertainty Estimation
Yassir Fathullah
Mark J. F. Gales
UQCV
14
11
0
15 Mar 2022
CECILIA: Comprehensive Secure Machine Learning Framework
CECILIA: Comprehensive Secure Machine Learning Framework
Ali Burak Ünal
Nícolas Pfeifer
Mete Akgun
12
2
0
07 Feb 2022
Problem-dependent attention and effort in neural networks with
  applications to image resolution and model selection
Problem-dependent attention and effort in neural networks with applications to image resolution and model selection
Chris Rohlfs
14
4
0
05 Jan 2022
Merging Models with Fisher-Weighted Averaging
Merging Models with Fisher-Weighted Averaging
Michael Matena
Colin Raffel
FedML
MoMe
19
347
0
18 Nov 2021
Exploiting all samples in low-resource sentence classification: early
  stopping and initialization parameters
Exploiting all samples in low-resource sentence classification: early stopping and initialization parameters
Hongseok Choi
Hyunju Lee
10
3
0
12 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language
  Modeling
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
184
384
0
06 Nov 2021
Learning to Prompt for Vision-Language Models
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
322
2,249
0
02 Sep 2021
The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning
The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning
Anders Andreassen
Yasaman Bahri
Behnam Neyshabur
Rebecca Roelofs
OOD
OODD
8
78
0
30 Jun 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
PFGDF: Pruning Filter via Gaussian Distribution Feature for Deep Neural
  Networks Acceleration
PFGDF: Pruning Filter via Gaussian Distribution Feature for Deep Neural Networks Acceleration
Jianrong Xu
Boyu Diao
Bifeng Cui
Kang Yang
Chao Li
H. Hong
8
4
0
23 Jun 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
Simple and Scalable Predictive Uncertainty Estimation using Deep
  Ensembles
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
268
5,635
0
05 Dec 2016
Previous
123...121314