v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

Neural Information Processing Systems (NeurIPS), 2018

27 February 2018

Dmitry Vetrov

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

50 / 548 papers shown

Deep learning, stochastic gradient descent and diffusion mapsJournal of Computational Mathematics and Data Science (JCMDS), 2022

Carmina Fjellström

Kaj Nyström

DiffM

223

04 Apr 2022

On Uncertainty, Tempering, and Data Augmentation in Bayesian ClassificationNeural Information Processing Systems (NeurIPS), 2022

Sanyam Kapoor

240

30 Mar 2022

Improving Generalization in Federated Learning by Seeking Flat MinimaEuropean Conference on Computer Vision (ECCV), 2022

377

139

22 Mar 2022

A Local Convergence Theory for the Stochastic Gradient Descent Method in Non-Convex Optimization With Non-isolated Local Minima

Tae-Eon Ko

Xiantao Li

212

21 Mar 2022

Self-Ensemble Adversarial Training for Improved RobustnessInternational Conference on Learning Representations (ICLR), 2022

Hongjun Wang

Yisen Wang

OOD AAML

241

18 Mar 2022

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference timeInternational Conference on Machine Learning (ICML), 2022

Raphael Gontijo-Lopes

...

728

1,290

10 Mar 2022

Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks

Siddhartha Datta

N. Shadbolt

AAML

218

07 Mar 2022

Embedded Ensembles: Infinite Width Limit and Operating RegimesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Alexey Zaytsev

138

24 Feb 2022

Prune and Tune Ensembles: Low-Cost Ensemble Learning With Sparse Independent SubnetworksAAAI Conference on Artificial Intelligence (AAAI), 2022

Tim Whitaker

L. D. Whitley

UQCV

167

23 Feb 2022

Continual Learning Beyond a Single Model

361

20 Feb 2022

Ensemble Learning techniques for object detection in high-resolution satellite images

16 Feb 2022

PFGE: Parsimonious Fast Geometric Ensembling of DNNsInternational Conference on Intelligent Computing (ICIC), 2022

437

14 Feb 2022

Deep Networks on Toroids: Removing Symmetries Reveals the Structure of Flat Regions in the Landscape GeometryInternational Conference on Machine Learning (ICML), 2022

501

07 Feb 2022

Evaluating natural language processing models with generalization metrics that do not need access to any training or testing data

327

06 Feb 2022

When Do Flat Minima Optimizers Work?Neural Information Processing Systems (NeurIPS), 2022

526

01 Feb 2022

Learning Proximal Operators to Discover Multiple OptimaInternational Conference on Learning Representations (ICLR), 2022

333

28 Jan 2022

Improving robustness and calibration in ensembles with diversity regularizationGerman Conference on Pattern Recognition (GCPR), 2022

124

26 Jan 2022

Generalization in Supervised Learning Through Riemannian Contraction

L. Kozachkov

Patrick M. Wensing

Jean-Jacques E. Slotine

MLT

252

17 Jan 2022

Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks

Shaun Li

AI4CE

229

03 Jan 2022

Stochastic Weight Averaging RevisitedApplied Sciences (Appl. Sci.), 2022

Hao Guo

Jiyong Jin

B. Liu

376

03 Jan 2022

Representation Topology Divergence: A Method for Comparing Neural Network RepresentationsInternational Conference on Machine Learning (ICML), 2021

281

31 Dec 2021

SAE: Sequential Anchored Ensembles

Arnaud Delaunoy

Gilles Louppe

UQCV BDL

176

30 Dec 2021

MVDG: A Unified Multi-view Framework for Domain GeneralizationEuropean Conference on Computer Vision (ECCV), 2021

Jian Zhang

Lei Qi

Yinghuan Shi

Yang Gao

262

23 Dec 2021

Hypernet-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels

195

13 Dec 2021

Efficient Self-Ensemble for Semantic SegmentationBritish Machine Vision Conference (BMVC), 2021

291

26 Nov 2021

Backdoor Attack through Frequency Domain

261

22 Nov 2021

MS-nowcasting: Operational Precipitation Nowcasting with Convolutional LSTMs at Microsoft Weather

Haiyu Dong

198

18 Nov 2021

Data Augmentation Can Improve RobustnessNeural Information Processing Systems (NeurIPS), 2021

Sylvestre-Alvise Rebuffi

249

362

09 Nov 2021

Mode connectivity in the loss landscape of parameterized quantum circuitsQuantum Machine Intelligence (QMI), 2021

Kathleen E. Hamilton

E. Lynn

R. Pooser

204

09 Nov 2021

Exponential escape efficiency of SGD from sharp minima in non-stationary regime

Hikaru Ibayashi

Masaaki Imaizumi

291

07 Nov 2021

Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU NetworksJournal of machine learning research (JMLR), 2021

Aleksandr Shevchenko

Vyacheslav Kungurtsev

Marco Mondelli

MLT

288

03 Nov 2021

Deep learning via message passing algorithms based on belief propagation

437

27 Oct 2021

Towards Better Plasticity-Stability Trade-off in Incremental Learning: A Simple Linear Connector

225

15 Oct 2021

What Happens after SGD Reaches Zero Loss? --A Mathematical Framework

Zhiyuan Li

Tianhao Wang

Sanjeev Arora

MLT

353

114

13 Oct 2021

The Role of Permutation Invariance in Linear Mode Connectivity of Neural NetworksInternational Conference on Learning Representations (ICLR), 2021

601

273

12 Oct 2021

Learning a subspace of policies for online adaptation in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2021

311

11 Oct 2021

Tighter Sparse Approximation Bounds for ReLU Neural Networks

Carles Domingo-Enrich

Youssef Mroueh

309

07 Oct 2021

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective

1.6K

06 Oct 2021

Prior and Posterior Networks: A Survey on Evidential Deep Learning Methods For Uncertainty Estimation

Dennis Ulmer

Christian Hardmeier

J. Frellsen

BDL UQCV UD EDL PER

335

06 Oct 2021

Boost Neural Networks by Checkpoints

155

03 Oct 2021

A Physics inspired Functional Operator for Model Uncertainty Quantification in the RKHS

Rishabh Singh

José C. Príncipe

222

22 Sep 2021

Connecting Low-Loss Subspace for Personalized Federated Learning

216

16 Sep 2021

SanitAIs: Unsupervised Data Augmentation to Sanitize Trojaned Neural Networks

217

09 Sep 2021

Expressive Power and Loss Surfaces of Deep Learning Models

S. Dube

134

08 Aug 2021

Quantum Continual Learning Overcoming Catastrophic ForgettingChinese Physics Letters (CPL), 2021

Wenjie Jiang

Zhide Lu

D. Deng

227

05 Aug 2021

AdvRush: Searching for Adversarially Robust Neural Architectures

215

03 Aug 2021

Taxonomizing local versus global structure in neural network loss landscapesNeural Information Processing Systems (NeurIPS), 2021

367

23 Jul 2021

Fed-ensemble: Improving Generalization through Model Ensembling in Federated LearningIEEE Transactions on Automation Science and Engineering (T-ASE), 2021

169

21 Jul 2021

The Limiting Dynamics of SGD: Modified Loss, Phase Space Oscillations, and Anomalous DiffusionNeural Computation (Neural Comput.), 2021

D. Kunin

Javier Sagastuy-Breña

510

19 Jul 2021

Structured Directional Pruning via Perturbation Orthogonal Projection

194

12 Jul 2021