v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

Neural Information Processing Systems (NeurIPS), 2018

27 February 2018

Dmitry Vetrov

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

50 / 548 papers shown

Dormant Neural TrojansInternational Conference on Machine Learning and Applications (ICMLA), 2022

227

02 Nov 2022

Symmetries, flat minima, and the conserved quantities of gradient flowInternational Conference on Learning Representations (ICLR), 2022

374

31 Oct 2022

A picture of the space of typical learnable tasksInternational Conference on Machine Learning (ICML), 2022

422

31 Oct 2022

Flatter, faster: scaling momentum for optimal speedup of SGD

Aditya Cowsik

T. Can

Paolo Glorioso

361

28 Oct 2022

Exploring Mode Connectivity for Pre-trained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Yankai Lin

Xu Han

Zhiyuan Liu

Maosong Sun

Jie Zhou

226

25 Oct 2022

Augmentation by Counterfactual Explanation -- Fixing an Overconfident ClassifierIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Kayhan Batmanghelich

229

21 Oct 2022

lo-fi: distributed fine-tuning without communication

349

19 Oct 2022

Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task modelsInternational Conference on Machine Learning (ICML), 2022

Nikolaos Dimitriadis

P. Frossard

Franccois Fleuret

304

18 Oct 2022

Packed-Ensembles for Efficient Uncertainty EstimationInternational Conference on Learning Representations (ICLR), 2022

469

17 Oct 2022

RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical ImagingIndustrial Conference on Data Mining (IDM), 2022

163

15 Oct 2022

Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence

Diyuan Wu

Vyacheslav Kungurtsev

Marco Mondelli

197

13 Oct 2022

Wasserstein Barycenter-based Model Fusion and Linear Mode Connectivity of Neural Networks

A. K. Akash

Sixu Li

Nicolas García Trillos

207

13 Oct 2022

Deep Combinatorial AggregationNeural Information Processing Systems (NeurIPS), 2022

Yuesong Shen

Zorah Lähner

OOD UQCV

155

12 Oct 2022

Stable and Efficient Adversarial Training through Local Linearization

Zhuorong Li

Daiwei Yu

AAML

113

11 Oct 2022

On the Importance of Calibration in Semi-supervised Learning

Charlotte Loh

Rumen Dangovski

Shivchander Sudalairaj

Akash Srivastava

189

10 Oct 2022

Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep NetworksInternational Conference on Learning Representations (ICLR), 2022

520

03 Oct 2022

Multiple Modes for Continual Learning

Siddhartha Datta

N. Shadbolt

CLL MoMe

214

29 Sep 2022

Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

Danni Peng

Sinno Jialin Pan

192

29 Sep 2022

On Quantum Speedups for Nonconvex Optimization via Quantum Tunneling WalksQuantum (Quantum), 2022

Yizhou Liu

Weijie J. Su

Tongyang Li

288

29 Sep 2022

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022

Homanga Bharadhwaj

340

18 Sep 2022

Random initialisations performing above chance and how to find them

424

15 Sep 2022

Git Re-Basin: Merging Models modulo Permutation SymmetriesInternational Conference on Learning Representations (ICLR), 2022

958

422

11 Sep 2022

Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference CostAAAI Conference on Artificial Intelligence (AAAI), 2022

Lu Yin

Vlado Menkovski

133

23 Aug 2022

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function PerspectiveNeural Information Processing Systems (NeurIPS), 2022

222

21 Aug 2022

Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for ClassificationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

189

18 Aug 2022

On the generalization of learning algorithms that do not convergeNeural Information Processing Systems (NeurIPS), 2022

379

16 Aug 2022

Uncertainty Quantification for Traffic Forecasting: A Unified ApproachIEEE International Conference on Data Engineering (ICDE), 2022

181

11 Aug 2022

Improving Predictive Performance and Calibration by Weight Fusion in Semantic Segmentation

182

22 Jul 2022

On the Subspace Structure of Gradient-Based Meta-Learning

Danica Kragic

255

08 Jul 2022

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsComputer Vision and Pattern Recognition (CVPR), 2022

478

9,312

06 Jul 2022

Effective training-time stacking for ensembling of deep neural networksInternational Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2022

P. Proskura

Alexey Zaytsev

27 Jun 2022

Transfer learning for ensembles: reducing computation time and keeping the diversityInternational Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2022

Ilya Shashkov

Nikita Balabin

Evgeny Burnaev

Alexey Zaytsev

234

27 Jun 2022

A Geometric Method for Improved Uncertainty Estimation in Real-timeConference on Uncertainty in Artificial Intelligence (UAI), 2022

143

23 Jun 2022

Disentangling Model Multiplicity in Deep Learning

Ari Heljakka

Martin Trapp

Arno Solin

181

17 Jun 2022

Geometrically Guided Integrated Gradients

141

13 Jun 2022

Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable NetworksNeural Information Processing Systems (NeurIPS), 2022

Gintare Karolina Dziugaite

148

02 Jun 2022

Star algorithm for NN ensembling

Sergey Zinchenko

Dmitry Lishudi

FedML

110

01 Jun 2022

Superposing Many Tickets into One: A Performance Booster for Sparse Neural Network TrainingConference on Uncertainty in Artificial Intelligence (UAI), 2022

Lu Yin

Vlado Menkovski

Decebal Constantin Mocanu

Shiwei Liu

279

30 May 2022

The Missing Invariance Principle Found -- the Reciprocal Twin of Invariant Risk MinimizationNeural Information Processing Systems (NeurIPS), 2022

Dongsung Huh

A. Baidya

OOD

144

29 May 2022

Laplace HypoPINN: Physics-Informed Neural Network for hypocenter localization and its predictive uncertainty

211

28 May 2022

How Tempering Fixes Data Augmentation in Bayesian Neural NetworksInternational Conference on Machine Learning (ICML), 2022

285

27 May 2022

Linear Connectivity Reveals Generalization StrategiesInternational Conference on Learning Representations (ICLR), 2022

856

24 May 2022

Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for FreeComputer Vision and Pattern Recognition (CVPR), 2022

201

24 May 2022

The Unreasonable Effectiveness of Deep Evidential RegressionAAAI Conference on Artificial Intelligence (AAAI), 2022

640

20 May 2022

Interpolating Compressed Parameter Subspaces

Siddhartha Datta

N. Shadbolt

234

19 May 2022

Diverse Weight Averaging for Out-of-Distribution GeneralizationNeural Information Processing Systems (NeurIPS), 2022

566

160

19 May 2022

ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent TrainingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Yantao Shen

229

12 May 2022

One-shot Federated Learning without Server-side TrainingNeural Networks (NN), 2022

161

26 Apr 2022

Federated Geometric Monte Carlo Clustering to Counter Non-IID Datasets

134

23 Apr 2022

A Simple Approach to Adversarial Robustness in Few-shot Image Classification

Akshayvarun Subramanya

Hamed Pirsiavash

VLM

146

11 Apr 2022