ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.00109
  4. Cited By
Snapshot Ensembles: Train 1, get M for free

Snapshot Ensembles: Train 1, get M for free

1 April 2017
Gao Huang
Shouqing Yang
Geoff Pleiss
Zhuang Liu
John E. Hopcroft
Kilian Q. Weinberger
    OODFedMLUQCV
ArXiv (abs)PDFHTML

Papers citing "Snapshot Ensembles: Train 1, get M for free"

50 / 461 papers shown
Fighting Randomness with Randomness: Mitigating Optimisation Instability
  of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Branislav Pecher
Ján Cegin
Róbert Belanec
Jakub Simko
Ivan Srba
Maria Bielikova
213
1
0
18 Jun 2024
Improving child speech recognition with augmented child-like speech
Improving child speech recognition with augmented child-like speech
Yuanyuan Zhang
Zhengjun Yue
T. Patel
O. Scharenborg
168
11
0
12 Jun 2024
MeGA: Merging Multiple Independently Trained Neural Networks Based on
  Genetic Algorithm
MeGA: Merging Multiple Independently Trained Neural Networks Based on Genetic Algorithm
Daniel Yun
FedMLMoMe
232
1
0
07 Jun 2024
PrevMatch: Revisiting and Maximizing Temporal Knowledge in Semi-Supervised Semantic Segmentation
PrevMatch: Revisiting and Maximizing Temporal Knowledge in Semi-Supervised Semantic Segmentation
Wooseok Shin
Hyun Joon Park
Jin Sob Kim
Sung Won Han
Se Hong Park
Sung Won Han
VLM
224
10
0
31 May 2024
Kernel Semi-Implicit Variational Inference
Kernel Semi-Implicit Variational Inference
Ziheng Cheng
Longlin Yu
Tianyu Xie
Shiyue Zhang
Cheng Zhang
246
7
0
29 May 2024
Probabilistic Contrastive Learning with Explicit Concentration on the
  Hypersphere
Probabilistic Contrastive Learning with Explicit Concentration on the Hypersphere
H. Li
Ouyang Cheng
Tamaz Amiranashvili
Matthew S. Rosen
Bjoern Menze
J. Iglesias
327
1
0
26 May 2024
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-Attention Networks
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-Attention Networks
Michelle Halbheer
Dominik J. Mühlematter
Alexander Becker
Dominik Narnhofer
Helge Aasen
Konrad Schindler
Mehmet Özgür Türkoglu
UQCV
475
11
0
23 May 2024
Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks
Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks
Xin-Chun Li
Jinli Tang
Bo Zhang
Lan Li
De-Chuan Zhan
301
2
0
21 May 2024
Green AI in Action: Strategic Model Selection for Ensembles in Production
Green AI in Action: Strategic Model Selection for Ensembles in Production
Nienke Nijkamp
June Sallou
Niels van der Heijden
Luís Cruz
226
6
0
21 May 2024
Automated Deep Learning for Load Forecasting
Automated Deep Learning for Load Forecasting
Julie Keisler
Sandra Claudel
Gilles Cabriel
Margaux Brégère
AI4TS
202
4
0
14 May 2024
Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via
  Ensembling Normalization Layers with Shared Weights
Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights
Soyed Tuhin Ahmed
Michael Hefenbrock
M. Tahoori
UQCV
176
2
0
07 May 2024
Reliable Model Watermarking: Defending Against Theft without
  Compromising on Evasion
Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion
Markus Frey
Sichu Liang
Wentao Hu
Matthias Nau
Ju Jia
Shilin Wang
AAML
295
10
0
21 Apr 2024
DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning
  Models
DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
Nastaran Saadati
Minh Pham
Nasla Saleem
Joshua R. Waite
Aditya Balu
Zhanhong Jiang
Chinmay Hegde
Soumik Sarkar
MoMe
244
5
0
11 Apr 2024
Post-Hoc Reversal: Are We Selecting Models Prematurely?
Post-Hoc Reversal: Are We Selecting Models Prematurely?
Rishabh Ranjan
Saurabh Garg
Mrigank Raman
Carlos Guestrin
Zachary Chase Lipton
233
3
0
11 Apr 2024
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
Yingsen Zeng
Yujie Zhong
Chengjian Feng
Lin Ma
485
14
0
07 Apr 2024
On the Learnability of Out-of-distribution Detection
On the Learnability of Out-of-distribution Detection
Zhen Fang
Shouqing Yang
Yifan Zhang
Bo Han
Jie Lu
223
9
0
07 Apr 2024
BEM: Balanced and Entropy-based Mix for Long-Tailed Semi-Supervised
  Learning
BEM: Balanced and Entropy-based Mix for Long-Tailed Semi-Supervised Learning
Hongwei Zheng
Linyuan Zhou
Han Li
Jinming Su
Xiaoming Wei
Xiaoming Xu
218
9
0
01 Apr 2024
Dual DETRs for Multi-Label Temporal Action Detection
Dual DETRs for Multi-Label Temporal Action Detection
Yuhan Zhu
Guozhen Zhang
Jing Tan
Gangshan Wu
Limin Wang
247
22
0
31 Mar 2024
Stitching for Neuroevolution: Recombining Deep Neural Networks without
  Breaking Them
Stitching for Neuroevolution: Recombining Deep Neural Networks without Breaking Them
Arthur Guijt
D. Thierens
Tanja Alderliesten
Peter A. N. Bosman
191
2
0
21 Mar 2024
Diversity-Aware Agnostic Ensemble of Sharpness Minimizers
Diversity-Aware Agnostic Ensemble of Sharpness Minimizers
Anh-Vu Bui
Vy Vo
Tung Pham
Dinh Q. Phung
Trung Le
FedMLUQCV
254
1
0
19 Mar 2024
DiTMoS: Delving into Diverse Tiny-Model Selection on Microcontrollers
DiTMoS: Delving into Diverse Tiny-Model Selection on MicrocontrollersAnnual IEEE International Conference on Pervasive Computing and Communications (PerCom), 2024
Xiao Ma
Shengfeng He
Hezhe Qiao
Dong-Lai Ma
181
2
0
14 Mar 2024
Bayesian Uncertainty Estimation by Hamiltonian Monte Carlo: Applications
  to Cardiac MRI Segmentation
Bayesian Uncertainty Estimation by Hamiltonian Monte Carlo: Applications to Cardiac MRI Segmentation
Yidong Zhao
João Tourais
Iain Pierce
Christian Nitsche
T. Treibel
Sebastian Weingartner
Artur M. Schweidtmann
Qian Tao
BDLUQCV
336
5
0
04 Mar 2024
A prediction rigidity formalism for low-cost uncertainties in trained
  neural networks
A prediction rigidity formalism for low-cost uncertainties in trained neural networks
Filippo Bigi
Sanggyu Chong
Michele Ceriotti
Federico Grasselli
208
12
0
04 Mar 2024
Training Neural Networks from Scratch with Parallel Low-Rank Adapters
Training Neural Networks from Scratch with Parallel Low-Rank Adapters
Minyoung Huh
Brian Cheung
Jeremy Bernstein
Phillip Isola
Pulkit Agrawal
270
15
0
26 Feb 2024
Forecasting Events in Soccer Matches Through Language
Forecasting Events in Soccer Matches Through Language
Tiago Mendes-Neves
Luís Meireles
João Mendes-Moreira
179
5
0
09 Feb 2024
Towards a Systematic Approach to Design New Ensemble Learning Algorithms
Towards a Systematic Approach to Design New Ensemble Learning Algorithms
João Mendes-Moreira
Tiago Mendes-Neves
FedML
159
2
0
09 Feb 2024
A Bandit Approach with Evolutionary Operators for Model Selection
A Bandit Approach with Evolutionary Operators for Model Selection
Margaux Brégere Lpsm
Julie Keisler
198
1
0
07 Feb 2024
How Good is a Single Basin?
How Good is a Single Basin?International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Kai Lion
Lorenzo Noci
Thomas Hofmann
Gregor Bachmann
UQCV
177
3
0
05 Feb 2024
eXplainable Bayesian Multi-Perspective Generative Retrieval
eXplainable Bayesian Multi-Perspective Generative Retrieval
EuiYul Song
Philhoon Oh
Sangryul Kim
Hyunjung Shim
BDL
236
0
0
04 Feb 2024
ARGS: Alignment as Reward-Guided Search
ARGS: Alignment as Reward-Guided SearchInternational Conference on Learning Representations (ICLR), 2024
Maxim Khanov
Jirayu Burapacheep
Yixuan Li
435
94
0
23 Jan 2024
Stochastic Subnetwork Annealing: A Regularization Technique for Fine
  Tuning Pruned Subnetworks
Stochastic Subnetwork Annealing: A Regularization Technique for Fine Tuning Pruned Subnetworks
Tim Whitaker
Darrell Whitley
292
1
0
16 Jan 2024
DiffHybrid-UQ: Uncertainty Quantification for Differentiable Hybrid
  Neural Modeling
DiffHybrid-UQ: Uncertainty Quantification for Differentiable Hybrid Neural Modeling
Deepak Akhare
Tengfei Luo
Jian-Xun Wang
230
9
0
30 Dec 2023
FedSDD: Scalable and Diversity-enhanced Distillation for Model
  Aggregation in Federated Learning
FedSDD: Scalable and Diversity-enhanced Distillation for Model Aggregation in Federated Learning
Ho Man Kwan
Shenghui Song
FedML
129
4
0
28 Dec 2023
TinyGSM: achieving >80% on GSM8k with small language models
TinyGSM: achieving >80% on GSM8k with small language models
Bingbin Liu
Sébastien Bubeck
Ronen Eldan
Janardhan Kulkarni
Yuanzhi Li
Anh Nguyen
Rachel A. Ward
Yi Zhang
ALM
251
56
0
14 Dec 2023
Mini-batch Gradient Descent with Buffer
Mini-batch Gradient Descent with Buffer
Haobo Qi
Du Huang
Yingqiu Zhu
Danyang Huang
Hansheng Wang
190
1
0
14 Dec 2023
Federated Full-Parameter Tuning of Billion-Sized Language Models with
  Communication Cost under 18 Kilobytes
Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes
Zhen Qin
Daoyuan Chen
Bingchen Qian
Bolin Ding
Yaliang Li
Shuiguang Deng
FedML
365
56
0
11 Dec 2023
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty AwarenessComputer Vision and Pattern Recognition (CVPR), 2023
Anh-Quan Cao
Angela Dai
Raoul de Charette
UQCV
270
37
0
04 Dec 2023
ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal
  Action Localization
ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization
Elahe Vahdani
Yingli Tian
185
0
0
27 Nov 2023
MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable
  Uncertainty
MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable UncertaintyIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Rémi Marsal
F. Chabot
Angélique Loesch
William Grolleau
H. Sahbi
MDEUQCV
229
16
0
10 Nov 2023
A comprehensive survey on deep active learning in medical image analysis
A comprehensive survey on deep active learning in medical image analysis
Haoran Wang
Q. Jin
Shiman Li
Siyu Liu
Manning Wang
Zhijian Song
VLM
395
72
0
22 Oct 2023
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from
  a Parametric Perspective
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric PerspectiveInternational Conference on Learning Representations (ICLR), 2023
Ming Zhong
Chenxin An
Weizhu Chen
Jiawei Han
Pengcheng He
357
16
0
17 Oct 2023
Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free
  Ensembles of DNNs
Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free Ensembles of DNNs
Uri Stern
D. Weinshall
CLL
219
0
0
17 Oct 2023
Why Do We Need Weight Decay in Modern Deep Learning?
Why Do We Need Weight Decay in Modern Deep Learning?Neural Information Processing Systems (NeurIPS), 2023
Maksym Andriushchenko
Francesco DÁngelo
Aditya Varre
Nicolas Flammarion
294
57
0
06 Oct 2023
RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone Network
RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone NetworkIEEE International Joint Conference on Neural Network (IJCNN), 2023
Haozhe Sun
Isabelle M Guyon
F. Mohr
Hedi Tabia
CVBM
179
3
0
02 Oct 2023
A Theoretical Analysis of Noise Geometry in Stochastic Gradient Descent
A Theoretical Analysis of Noise Geometry in Stochastic Gradient Descent
Mingze Wang
Lei Wu
399
3
0
01 Oct 2023
LoRA ensembles for large language model fine-tuning
LoRA ensembles for large language model fine-tuning
Xi Wang
Laurence Aitchison
Maja Rudolph
UQCV
475
55
0
29 Sep 2023
A Primer on Bayesian Neural Networks: Review and Debates
A Primer on Bayesian Neural Networks: Review and Debates
Federico Danieli
Konstantinos Pitas
M. Vladimirova
Vincent Fortuin
BDLAAML
281
35
0
28 Sep 2023
Deep Model Fusion: A Survey
Deep Model Fusion: A Survey
Weishi Li
Yong Peng
Miao Zhang
Liang Ding
Han Hu
Li Shen
FedMLMoMe
296
87
0
27 Sep 2023
Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
Piyush Tiwary
Atri Guha
Subhodip Panda
Prathosh A.P.
MUGAN
402
12
0
25 Sep 2023
Improve Deep Forest with Learnable Layerwise Augmentation Policy
  Schedule
Improve Deep Forest with Learnable Layerwise Augmentation Policy ScheduleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Hongyu Zhu
Sichu Liang
Wentao Hu
Fangqi Li
Yali Yuan
Shi-Lin Wang
Guang Cheng
122
3
0
16 Sep 2023
Previous
12345...8910
Next