ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.10026
  4. Cited By
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

Neural Information Processing Systems (NeurIPS), 2018
27 February 2018
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
    UQCV
ArXiv (abs)PDFHTML

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

50 / 548 papers shown
Dormant Neural Trojans
Dormant Neural TrojansInternational Conference on Machine Learning and Applications (ICMLA), 2022
Feisi Fu
Panagiota Kiourti
Wenchao Li
AAML
227
0
0
02 Nov 2022
Symmetries, flat minima, and the conserved quantities of gradient flow
Symmetries, flat minima, and the conserved quantities of gradient flowInternational Conference on Learning Representations (ICLR), 2022
Bo Zhao
I. Ganev
Robin Walters
Rose Yu
Nima Dehmamy
374
28
0
31 Oct 2022
A picture of the space of typical learnable tasks
A picture of the space of typical learnable tasksInternational Conference on Machine Learning (ICML), 2022
Rahul Ramesh
Jialin Mao
Itay Griniasty
Rubing Yang
H. Teoh
Mark K. Transtrum
James P. Sethna
Pratik Chaudhari
SSLDRL
422
6
0
31 Oct 2022
Flatter, faster: scaling momentum for optimal speedup of SGD
Flatter, faster: scaling momentum for optimal speedup of SGD
Aditya Cowsik
T. Can
Paolo Glorioso
361
6
0
28 Oct 2022
Exploring Mode Connectivity for Pre-trained Language Models
Exploring Mode Connectivity for Pre-trained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yujia Qin
Cheng Qian
Jing Yi
Weize Chen
Yankai Lin
Xu Han
Zhiyuan Liu
Maosong Sun
Jie Zhou
226
26
0
25 Oct 2022
Augmentation by Counterfactual Explanation -- Fixing an Overconfident
  Classifier
Augmentation by Counterfactual Explanation -- Fixing an Overconfident ClassifierIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Sumedha Singla
Nihal Murali
Forough Arabshahi
Sofia Triantafyllou
Kayhan Batmanghelich
CML
229
6
0
21 Oct 2022
lo-fi: distributed fine-tuning without communication
lo-fi: distributed fine-tuning without communication
Mitchell Wortsman
Suchin Gururangan
Shen Li
Ali Farhadi
Ludwig Schmidt
Michael G. Rabbat
Ari S. Morcos
349
24
0
19 Oct 2022
Pareto Manifold Learning: Tackling multiple tasks via ensembles of
  single-task models
Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task modelsInternational Conference on Machine Learning (ICML), 2022
Nikolaos Dimitriadis
P. Frossard
Franccois Fleuret
304
29
0
18 Oct 2022
Packed-Ensembles for Efficient Uncertainty Estimation
Packed-Ensembles for Efficient Uncertainty EstimationInternational Conference on Learning Representations (ICLR), 2022
Olivier Laurent
Adrien Lafage
Enzo Tartaglione
Geoffrey Daniel
Jean-Marc Martinez
Andrei Bursuc
Gianni Franchi
OODD
469
43
0
17 Oct 2022
RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy
  Medical Imaging
RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical ImagingIndustrial Conference on Data Mining (IDM), 2022
A. Jaiswal
Kumar Ashutosh
Justin F. Rousseau
Yifan Peng
Zinan Lin
Ying Ding
163
11
0
15 Oct 2022
Mean-field analysis for heavy ball methods: Dropout-stability,
  connectivity, and global convergence
Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence
Diyuan Wu
Vyacheslav Kungurtsev
Marco Mondelli
197
3
0
13 Oct 2022
Wasserstein Barycenter-based Model Fusion and Linear Mode Connectivity
  of Neural Networks
Wasserstein Barycenter-based Model Fusion and Linear Mode Connectivity of Neural Networks
A. K. Akash
Sixu Li
Nicolas García Trillos
207
15
0
13 Oct 2022
Deep Combinatorial Aggregation
Deep Combinatorial AggregationNeural Information Processing Systems (NeurIPS), 2022
Yuesong Shen
Zorah Lähner
OODUQCV
155
6
0
12 Oct 2022
Stable and Efficient Adversarial Training through Local Linearization
Stable and Efficient Adversarial Training through Local Linearization
Zhuorong Li
Daiwei Yu
AAML
113
0
0
11 Oct 2022
On the Importance of Calibration in Semi-supervised Learning
On the Importance of Calibration in Semi-supervised Learning
Charlotte Loh
Rumen Dangovski
Shivchander Sudalairaj
Seung-Jun Han
Ligong Han
Leonid Karlinsky
Marin Soljacic
Akash Srivastava
189
7
0
10 Oct 2022
Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss
  Landscape for Deep Networks
Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep NetworksInternational Conference on Learning Representations (ICLR), 2022
Xiang Wang
Annie Wang
Mo Zhou
Rong Ge
MoMe
520
10
0
03 Oct 2022
Multiple Modes for Continual Learning
Multiple Modes for Continual Learning
Siddhartha Datta
N. Shadbolt
CLLMoMe
214
2
0
29 Sep 2022
Learning Gradient-based Mixup towards Flatter Minima for Domain
  Generalization
Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization
Danni Peng
Sinno Jialin Pan
192
5
0
29 Sep 2022
On Quantum Speedups for Nonconvex Optimization via Quantum Tunneling
  Walks
On Quantum Speedups for Nonconvex Optimization via Quantum Tunneling WalksQuantum (Quantum), 2022
Yizhou Liu
Weijie J. Su
Tongyang Li
288
23
0
29 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
340
28
0
18 Sep 2022
Random initialisations performing above chance and how to find them
Random initialisations performing above chance and how to find them
Frederik Benzing
Simon Schug
Robert Meier
J. Oswald
Yassir Akram
Nicolas Zucchet
Laurence Aitchison
Angelika Steger
ODL
424
28
0
15 Sep 2022
Git Re-Basin: Merging Models modulo Permutation Symmetries
Git Re-Basin: Merging Models modulo Permutation SymmetriesInternational Conference on Learning Representations (ICLR), 2022
Samuel K. Ainsworth
J. Hayase
S. Srinivasa
MoMe
958
422
0
11 Sep 2022
Lottery Pools: Winning More by Interpolating Tickets without Increasing
  Training or Inference Cost
Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference CostAAAI Conference on Artificial Intelligence (AAAI), 2022
Lu Yin
Shiwei Liu
Fang Meng
Tianjin Huang
Vlado Menkovski
Mykola Pechenizkiy
133
14
0
23 Aug 2022
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function
  Perspective
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function PerspectiveNeural Information Processing Systems (NeurIPS), 2022
Chanwoo Park
Sangdoo Yun
Sanghyuk Chun
AAML
222
39
0
21 Aug 2022
Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for
  Classification
Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for ClassificationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Quanshi Zhang
Feng He
Yilan Chen
Zhefan Rao
189
45
0
18 Aug 2022
On the generalization of learning algorithms that do not converge
On the generalization of learning algorithms that do not convergeNeural Information Processing Systems (NeurIPS), 2022
N. Chandramoorthy
Andreas Loukas
Khashayar Gatmiry
Stefanie Jegelka
MLT
379
12
0
16 Aug 2022
Uncertainty Quantification for Traffic Forecasting: A Unified Approach
Uncertainty Quantification for Traffic Forecasting: A Unified ApproachIEEE International Conference on Data Engineering (ICDE), 2022
Weizhu Qian
Dalin Zhang
Yan Zhao
Kai Zheng
James Jianqiao Yu
BDLAI4TS
181
33
0
11 Aug 2022
Improving Predictive Performance and Calibration by Weight Fusion in
  Semantic Segmentation
Improving Predictive Performance and Calibration by Weight Fusion in Semantic Segmentation
Timo Sämann
A. Hammam
Andrei Bursuc
Christoph Stiller
H. Groß
FedML
182
1
0
22 Jul 2022
On the Subspace Structure of Gradient-Based Meta-Learning
On the Subspace Structure of Gradient-Based Meta-Learning
Gustaf Tegnér
Alfredo Reichlin
Hang Yin
Mårten Björkman
Danica Kragic
255
0
0
08 Jul 2022
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for
  real-time object detectors
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsComputer Vision and Pattern Recognition (CVPR), 2022
Chien-Yao Wang
Alexey Bochkovskiy
H. Liao
ObjD
478
9,312
0
06 Jul 2022
Effective training-time stacking for ensembling of deep neural networks
Effective training-time stacking for ensembling of deep neural networksInternational Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2022
P. Proskura
Alexey Zaytsev
96
11
0
27 Jun 2022
Transfer learning for ensembles: reducing computation time and keeping
  the diversity
Transfer learning for ensembles: reducing computation time and keeping the diversityInternational Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2022
Ilya Shashkov
Nikita Balabin
Evgeny Burnaev
Alexey Zaytsev
234
2
0
27 Jun 2022
A Geometric Method for Improved Uncertainty Estimation in Real-time
A Geometric Method for Improved Uncertainty Estimation in Real-timeConference on Uncertainty in Artificial Intelligence (UAI), 2022
Gabriella Chouraqui
L. Cohen
Gil Einziger
Liel Leman
143
0
0
23 Jun 2022
Disentangling Model Multiplicity in Deep Learning
Disentangling Model Multiplicity in Deep Learning
Ari Heljakka
Martin Trapp
Arno Solin
Arno Solin
181
6
0
17 Jun 2022
Geometrically Guided Integrated Gradients
Geometrically Guided Integrated Gradients
Md. Mahfuzur Rahman
N. Lewis
Sergey Plis
FAttAAML
141
0
0
13 Jun 2022
Lottery Tickets on a Data Diet: Finding Initializations with Sparse
  Trainable Networks
Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable NetworksNeural Information Processing Systems (NeurIPS), 2022
Mansheej Paul
Brett W. Larsen
Surya Ganguli
Jonathan Frankle
Gintare Karolina Dziugaite
148
24
0
02 Jun 2022
Star algorithm for NN ensembling
Star algorithm for NN ensembling
Sergey Zinchenko
Dmitry Lishudi
FedML
110
0
0
01 Jun 2022
Superposing Many Tickets into One: A Performance Booster for Sparse
  Neural Network Training
Superposing Many Tickets into One: A Performance Booster for Sparse Neural Network TrainingConference on Uncertainty in Artificial Intelligence (UAI), 2022
Lu Yin
Vlado Menkovski
Meng Fang
Tianjin Huang
Yulong Pei
Mykola Pechenizkiy
Decebal Constantin Mocanu
Shiwei Liu
279
9
0
30 May 2022
The Missing Invariance Principle Found -- the Reciprocal Twin of
  Invariant Risk Minimization
The Missing Invariance Principle Found -- the Reciprocal Twin of Invariant Risk MinimizationNeural Information Processing Systems (NeurIPS), 2022
Dongsung Huh
A. Baidya
OOD
144
9
0
29 May 2022
Laplace HypoPINN: Physics-Informed Neural Network for hypocenter
  localization and its predictive uncertainty
Laplace HypoPINN: Physics-Informed Neural Network for hypocenter localization and its predictive uncertainty
M. Izzatullah
I. Yildirim
U. Waheed
T. Alkhalifah
211
20
0
28 May 2022
How Tempering Fixes Data Augmentation in Bayesian Neural Networks
How Tempering Fixes Data Augmentation in Bayesian Neural NetworksInternational Conference on Machine Learning (ICML), 2022
Gregor Bachmann
Lorenzo Noci
Thomas Hofmann
BDLAAML
285
11
0
27 May 2022
Linear Connectivity Reveals Generalization Strategies
Linear Connectivity Reveals Generalization StrategiesInternational Conference on Learning Representations (ICLR), 2022
Jeevesh Juneja
Rachit Bansal
Kyunghyun Cho
João Sedoc
Naomi Saphra
856
55
0
24 May 2022
Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free
Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for FreeComputer Vision and Pattern Recognition (CVPR), 2022
Tianlong Chen
Zhenyu Zhang
Yihua Zhang
Shiyu Chang
Sijia Liu
Zinan Lin
AAML
201
28
0
24 May 2022
The Unreasonable Effectiveness of Deep Evidential Regression
The Unreasonable Effectiveness of Deep Evidential RegressionAAAI Conference on Artificial Intelligence (AAAI), 2022
N. Meinert
J. Gawlikowski
Alexander Lavin
UQCVEDL
640
49
0
20 May 2022
Interpolating Compressed Parameter Subspaces
Interpolating Compressed Parameter Subspaces
Siddhartha Datta
N. Shadbolt
234
5
0
19 May 2022
Diverse Weight Averaging for Out-of-Distribution Generalization
Diverse Weight Averaging for Out-of-Distribution GeneralizationNeural Information Processing Systems (NeurIPS), 2022
Alexandre Ramé
Matthieu Kirchmeyer
Thibaud Rahier
A. Rakotomamonjy
Patrick Gallinari
Matthieu Cord
OOD
566
160
0
19 May 2022
ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent
  Training
ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent TrainingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yue Zhao
Yantao Shen
Yuanjun Xiong
Shuo Yang
Wei Xia
Zhuowen Tu
Bernt Shiele
Stefano Soatto
BDL
229
8
0
12 May 2022
One-shot Federated Learning without Server-side Training
One-shot Federated Learning without Server-side TrainingNeural Networks (NN), 2022
Shangchao Su
Bin Li
Xiangyang Xue
FedML
161
38
0
26 Apr 2022
Federated Geometric Monte Carlo Clustering to Counter Non-IID Datasets
Federated Geometric Monte Carlo Clustering to Counter Non-IID Datasets
Federico Lucchetti
Jérémie Decouchant
Maria Fernandes
L. Chen
Marcus Volp
FedML
134
1
0
23 Apr 2022
A Simple Approach to Adversarial Robustness in Few-shot Image
  Classification
A Simple Approach to Adversarial Robustness in Few-shot Image Classification
Akshayvarun Subramanya
Hamed Pirsiavash
VLM
146
6
0
11 Apr 2022
Previous
123...10116789
Next
Page 7 of 11
Pageof 11