v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

Neural Information Processing Systems (NeurIPS), 2018

27 February 2018

Dmitry Vetrov

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

48 / 548 papers shown

A study of local optima for learning feature interactions using neural networksIEEE International Joint Conference on Neural Network (IJCNN), 2020

Yangzi Guo

Adrian Barbu

239

11 Feb 2020

SQWA: Stochastic Quantized Weight Averaging for Improving the Generalization Capability of Low-Precision Deep Neural NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Sungho Shin

Yoonho Boo

Wonyong Sung

141

02 Feb 2020

Parameter Space Factorization for Zero-Shot Learning across Tasks and LanguagesTransactions of the Association for Computational Linguistics (TACL), 2020

331

30 Jan 2020

The Case for Bayesian Deep Learning

A. Wilson

UQCV BDL OOD

288

121

29 Jan 2020

On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation

169

22 Jan 2020

Stochastic Weight Averaging in Parallel: Large-Batch Training that Generalizes WellInternational Conference on Learning Representations (ICLR), 2020

290

07 Jan 2020

Landscape Connectivity and Dropout Stability of SGD Solutions for Over-parameterized Neural NetworksInternational Conference on Machine Learning (ICML), 2019

Aleksandr Shevchenko

Marco Mondelli

433

20 Dec 2019

Optimization for deep learning: theory and algorithms

Tian Ding

ODL

343

178

19 Dec 2019

Linear Mode Connectivity and the Lottery Ticket HypothesisInternational Conference on Machine Learning (ICML), 2019

Jonathan Frankle

Gintare Karolina Dziugaite

Daniel M. Roy

Michael Carbin

MoMe

799

706

11 Dec 2019

Deep Ensembles: A Loss Landscape Perspective

Stanislav Fort

Huiyi Hu

Balaji Lakshminarayanan

OOD UQCV

445

700

05 Dec 2019

Semi-Supervised Learning for Text Classification by Layer PartitioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Alexander Hanbo Li

A. Sethy

139

26 Nov 2019

Rigging the Lottery: Making All Tickets WinnersInternational Conference on Machine Learning (ICML), 2019

539

688

25 Nov 2019

Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations

Tian Ding

Dawei Li

Tian Ding

343

04 Nov 2019

Loss Patterns of Neural Networks

Ivan Skorokhodov

Andrey Kravchenko

3DPC

174

09 Oct 2019

Pure and Spurious Critical Points: a Geometric Study of Linear NetworksInternational Conference on Learning Representations (ICLR), 2019

Matthew Trager

Kathlén Kohn

Joan Bruna

185

03 Oct 2019

Generalization Bounds for Convolutional Neural Networks

Shan Lin

Jingwei Zhang

MLT

139

03 Oct 2019

How noise affects the Hessian spectrum in overparameterized neural networks

Ming-Bo Wei

D. Schwab

259

01 Oct 2019

Lookahead Optimizer: k steps forward, 1 step backNeural Information Processing Systems (NeurIPS), 2019

Jimmy Ba

491

816

19 Jul 2019

Subspace Inference for Bayesian Deep LearningConference on Uncertainty in Artificial Intelligence (UAI), 2019

Dmitry Vetrov

275

155

17 Jul 2019

Towards Understanding Generalization in Gradient-Based Meta-Learning

Simon Guiroy

Vikas Verma

C. Pal

172

16 Jul 2019

Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape

298

05 Jul 2019

The Difficulty of Training Sparse Neural Networks

336

108

25 Jun 2019

Homogeneous Vector Capsules Enable Adaptive Gradient Descent in Convolutional Neural NetworksIEEE Access (IEEE Access), 2019

Adam Byerly

T. Kalganova

244

20 Jun 2019

Finding the Needle in the Haystack with Convolutions: on the benefits of architectural biasNeural Information Processing Systems (NeurIPS), 2019

Stéphane dÁscoli

Levent Sagun

Joan Bruna

Giulio Biroli

183

16 Jun 2019

Explaining Landscape Connectivity of Low-cost Solutions for Multilayer NetsNeural Information Processing Systems (NeurIPS), 2019

424

101

14 Jun 2019

Large Scale Structure of Neural Network Loss LandscapesNeural Information Processing Systems (NeurIPS), 2019

Stanislav Fort

Stanislaw Jastrzebski

230

11 Jun 2019

A Direct Approach to Robust Deep Learning Using Adversarial NetworksInternational Conference on Learning Representations (ICLR), 2019

Huaxia Wang

Chun-Nam Yu

GAN AAML OOD

167

23 May 2019

Budgeted Training: Rethinking Deep Neural Network Training Under Resource ConstraintsInternational Conference on Learning Representations (ICLR), 2019

Mengtian Li

Ersin Yumer

Deva Ramanan

258

12 May 2019

A Generative Model for Sampling High-Performance and Diverse Weights for Neural Networks

Lior Deutsch

Erik Nijkamp

Yu Yang

118

07 May 2019

Ensemble Distribution DistillationInternational Conference on Learning Representations (ICLR), 2019

527

263

30 Apr 2019

Uniform convergence may be unable to explain generalization in deep learningNeural Information Processing Systems (NeurIPS), 2019

Vaishnavh Nagarajan

J. Zico Kolter

MoMe AI4CE

436

336

13 Feb 2019

Cyclical Stochastic Gradient MCMC for Bayesian Deep LearningInternational Conference on Learning Representations (ICLR), 2019

Jianyi Zhang

289

291

11 Feb 2019

A Simple Baseline for Bayesian Uncertainty in Deep Learning

Dmitry Vetrov

753

913

07 Feb 2019

Asymmetric Valleys: Beyond Sharp and Flat Local MinimaNeural Information Processing Systems (NeurIPS), 2019

Haowei He

Gao Huang

Yang Yuan

ODL MLT

271

158

02 Feb 2019

Loss Landscapes of Regularized Linear Autoencoders

368

23 Jan 2019

On Connected Sublevel Sets in Deep Learning

Quynh N. Nguyen

332

106

22 Jan 2019

Enhancing Discrete Choice Models with Representation Learning

Brian Sifringer

Virginie Lurkin

Alexandre Alahi

23 Dec 2018

Projected BNNs: Avoiding weight-space pathologies by learning latent representations of neural network weights

Finale Doshi-velez

228

16 Nov 2018

A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation

Akhilesh Deepak Gotmare

251

304

29 Oct 2018

Good Initializations of Variational Bayes for Deep Models

332

18 Oct 2018

MotherNets: Rapid Deep Ensemble Learning

174

12 Sep 2018

Make (Nearly) Every Neural Network Better: Generating Neural Network Ensembles by Weight Parameter Resampling

104

02 Jul 2018

Using Mode Connectivity for Loss Landscape Analysis

Akhilesh Deepak Gotmare

N. Keskar

Caiming Xiong

R. Socher

174

18 Jun 2018

The global optimum of shallow neural network is attained by ridgelet transform

169

19 May 2018

Averaging Weights Leads to Wider Optima and Better GeneralizationConference on Uncertainty in Artificial Intelligence (UAI), 2018

Dmitry Vetrov

649

1,890

14 Mar 2018

Variance Networks: When Expectation Does Not Meet Your ExpectationsInternational Conference on Learning Representations (ICLR), 2018

Dmitry Vetrov

366

10 Mar 2018

Essentially No Barriers in Neural Network Energy LandscapeInternational Conference on Machine Learning (ICML), 2018

579

487

02 Mar 2018

Generating Neural Networks with Neural Networks

Lior Deutsch

311

06 Jan 2018