Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.10026
Cited By
v1
v2
v3
v4 (latest)
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
Neural Information Processing Systems (NeurIPS), 2018
27 February 2018
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
UQCV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"
50 / 548 papers shown
Deep learning, stochastic gradient descent and diffusion maps
Journal of Computational Mathematics and Data Science (JCMDS), 2022
Carmina Fjellström
Kaj Nyström
DiffM
223
18
0
04 Apr 2022
On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification
Neural Information Processing Systems (NeurIPS), 2022
Sanyam Kapoor
Wesley J. Maddox
Pavel Izmailov
A. Wilson
BDL
UD
240
58
0
30 Mar 2022
Improving Generalization in Federated Learning by Seeking Flat Minima
European Conference on Computer Vision (ECCV), 2022
Debora Caldarola
Barbara Caputo
Marco Ciccone
FedML
377
139
0
22 Mar 2022
A Local Convergence Theory for the Stochastic Gradient Descent Method in Non-Convex Optimization With Non-isolated Local Minima
Tae-Eon Ko
Xiantao Li
212
2
0
21 Mar 2022
Self-Ensemble Adversarial Training for Improved Robustness
International Conference on Learning Representations (ICLR), 2022
Hongjun Wang
Yisen Wang
OOD
AAML
241
57
0
18 Mar 2022
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
International Conference on Machine Learning (ICML), 2022
Mitchell Wortsman
Gabriel Ilharco
S. Gadre
Rebecca Roelofs
Raphael Gontijo-Lopes
...
Hongseok Namkoong
Ali Farhadi
Y. Carmon
Simon Kornblith
Ludwig Schmidt
MoMe
728
1,290
1
10 Mar 2022
Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks
Siddhartha Datta
N. Shadbolt
AAML
218
7
0
07 Mar 2022
Embedded Ensembles: Infinite Width Limit and Operating Regimes
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Maksim Velikanov
Roma Kail
Ivan Anokhin
Roman Vashurin
Maxim Panov
Alexey Zaytsev
Dmitry Yarotsky
138
1
0
24 Feb 2022
Prune and Tune Ensembles: Low-Cost Ensemble Learning With Sparse Independent Subnetworks
AAAI Conference on Artificial Intelligence (AAAI), 2022
Tim Whitaker
L. D. Whitley
UQCV
167
28
0
23 Feb 2022
Continual Learning Beyond a Single Model
T. Doan
Seyed Iman Mirzadeh
Mehrdad Farajtabar
CLL
361
22
0
20 Feb 2022
Ensemble Learning techniques for object detection in high-resolution satellite images
A. Vilhelm
Matthieu Limbert
Clement Audebert
Tugdual Ceillier
ObjD
71
1
0
16 Feb 2022
PFGE: Parsimonious Fast Geometric Ensembling of DNNs
International Conference on Intelligent Computing (ICIC), 2022
Hao Guo
Jiyong Jin
B. Liu
FedML
437
1
0
14 Feb 2022
Deep Networks on Toroids: Removing Symmetries Reveals the Structure of Flat Regions in the Landscape Geometry
International Conference on Machine Learning (ICML), 2022
Fabrizio Pittorino
Antonio Ferraro
Gabriele Perugini
Christoph Feinauer
Carlo Baldassi
R. Zecchina
501
29
0
07 Feb 2022
Evaluating natural language processing models with generalization metrics that do not need access to any training or testing data
Yaoqing Yang
Ryan Theisen
Liam Hodgkinson
Joseph E. Gonzalez
Kannan Ramchandran
Charles H. Martin
Michael W. Mahoney
327
20
0
06 Feb 2022
When Do Flat Minima Optimizers Work?
Neural Information Processing Systems (NeurIPS), 2022
Jean Kaddour
Linqing Liu
Ricardo M. A. Silva
Matt J. Kusner
ODL
526
86
0
01 Feb 2022
Learning Proximal Operators to Discover Multiple Optima
International Conference on Learning Representations (ICLR), 2022
Lingxiao Li
Noam Aigerman
Vladimir G. Kim
Jiajin Li
Kristjan Greenewald
Mikhail Yurochkin
Justin Solomon
333
3
0
28 Jan 2022
Improving robustness and calibration in ensembles with diversity regularization
German Conference on Pattern Recognition (GCPR), 2022
H. A. Mehrtens
Camila González
Anirban Mukhopadhyay
UQCV
124
9
0
26 Jan 2022
Generalization in Supervised Learning Through Riemannian Contraction
L. Kozachkov
Patrick M. Wensing
Jean-Jacques E. Slotine
MLT
252
10
0
17 Jan 2022
Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks
Shaun Li
AI4CE
229
1
0
03 Jan 2022
Stochastic Weight Averaging Revisited
Applied Sciences (Appl. Sci.), 2022
Hao Guo
Jiyong Jin
B. Liu
376
35
0
03 Jan 2022
Representation Topology Divergence: A Method for Comparing Neural Network Representations
International Conference on Machine Learning (ICML), 2021
S. Barannikov
I. Trofimov
Nikita Balabin
Evgeny Burnaev
3DPC
281
61
0
31 Dec 2021
SAE: Sequential Anchored Ensembles
Arnaud Delaunoy
Gilles Louppe
UQCV
BDL
176
0
0
30 Dec 2021
MVDG: A Unified Multi-view Framework for Domain Generalization
European Conference on Computer Vision (ECCV), 2021
Jian Zhang
Lei Qi
Yinghuan Shi
Yang Gao
262
42
0
23 Dec 2021
Hypernet-Ensemble Learning of Segmentation Probability for Medical Image Segmentation with Ambiguous Labels
Sun-Beom Hong
A. Bonkhoff
Andrew Hoopes
Martin Bretzner
M. Schirmer
A. Giese
Adrian Dalca
Polina Golland
N. Rost
UQCV
195
8
0
13 Dec 2021
Efficient Self-Ensemble for Semantic Segmentation
British Machine Vision Conference (BMVC), 2021
Walid Bousselham
Guillaume Thibault
Lucas Pagano
Archana Machireddy
Joe W. Gray
Y. Chang
Xubo B. Song
ViT
291
32
0
26 Nov 2021
Backdoor Attack through Frequency Domain
Tong Wang
Xingtai Lv
Feng Xu
Shengwei An
Hanghang Tong
Ting Wang
AAML
261
41
0
22 Nov 2021
MS-nowcasting: Operational Precipitation Nowcasting with Convolutional LSTMs at Microsoft Weather
Sylwester Klocek
Haiyu Dong
M. Dixon
Panashe Kanengoni
Najeeb Kazmi
Pete Luferenko
Zhongjian Lv
Shikhar Sharma
Jonathan A. Weyn
Siqi Xiang
AI4Cl
198
27
0
18 Nov 2021
Data Augmentation Can Improve Robustness
Neural Information Processing Systems (NeurIPS), 2021
Sylvestre-Alvise Rebuffi
Sven Gowal
D. A. Calian
Florian Stimberg
Olivia Wiles
Timothy A. Mann
AAML
249
362
0
09 Nov 2021
Mode connectivity in the loss landscape of parameterized quantum circuits
Quantum Machine Intelligence (QMI), 2021
Kathleen E. Hamilton
E. Lynn
R. Pooser
204
3
0
09 Nov 2021
Exponential escape efficiency of SGD from sharp minima in non-stationary regime
Hikaru Ibayashi
Masaaki Imaizumi
291
5
0
07 Nov 2021
Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU Networks
Journal of machine learning research (JMLR), 2021
Aleksandr Shevchenko
Vyacheslav Kungurtsev
Marco Mondelli
MLT
288
15
0
03 Nov 2021
Deep learning via message passing algorithms based on belief propagation
Carlo Lucibello
Fabrizio Pittorino
Gabriele Perugini
R. Zecchina
437
19
0
27 Oct 2021
Towards Better Plasticity-Stability Trade-off in Incremental Learning: A Simple Linear Connector
Guoliang Lin
Hanlu Chu
Hanjiang Lai
MoMe
CLL
225
64
0
15 Oct 2021
What Happens after SGD Reaches Zero Loss? --A Mathematical Framework
Zhiyuan Li
Tianhao Wang
Sanjeev Arora
MLT
353
114
0
13 Oct 2021
The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks
International Conference on Learning Representations (ICLR), 2021
R. Entezari
Hanie Sedghi
O. Saukh
Behnam Neyshabur
MoMe
601
273
0
12 Oct 2021
Learning a subspace of policies for online adaptation in Reinforcement Learning
International Conference on Learning Representations (ICLR), 2021
Jean-Baptiste Gaya
Laure Soulier
Ludovic Denoyer
OffRL
311
16
0
11 Oct 2021
Tighter Sparse Approximation Bounds for ReLU Neural Networks
Carles Domingo-Enrich
Youssef Mroueh
309
4
0
07 Oct 2021
Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective
Luca Scimeca
Seong Joon Oh
Sanghyuk Chun
Michael Poli
Sangdoo Yun
OOD
1.6K
61
0
06 Oct 2021
Prior and Posterior Networks: A Survey on Evidential Deep Learning Methods For Uncertainty Estimation
Dennis Ulmer
Christian Hardmeier
J. Frellsen
BDL
UQCV
UD
EDL
PER
335
77
0
06 Oct 2021
Boost Neural Networks by Checkpoints
Feng Wang
Gu-Yeon Wei
Qiao Liu
Jinxiang Ou
Xian Wei
Hairong Lv
FedML
UQCV
155
12
0
03 Oct 2021
A Physics inspired Functional Operator for Model Uncertainty Quantification in the RKHS
Rishabh Singh
José C. Príncipe
222
4
0
22 Sep 2021
Connecting Low-Loss Subspace for Personalized Federated Learning
S. Hahn
Minwoo Jeong
Junghye Lee
FedML
216
25
0
16 Sep 2021
SanitAIs: Unsupervised Data Augmentation to Sanitize Trojaned Neural Networks
Kiran Karra
C. Ashcraft
Cash Costello
AAML
217
0
0
09 Sep 2021
Expressive Power and Loss Surfaces of Deep Learning Models
S. Dube
134
0
0
08 Aug 2021
Quantum Continual Learning Overcoming Catastrophic Forgetting
Chinese Physics Letters (CPL), 2021
Wenjie Jiang
Zhide Lu
D. Deng
227
12
0
05 Aug 2021
AdvRush: Searching for Adversarially Robust Neural Architectures
J. Mok
Byunggook Na
Hyeokjun Choe
Sungroh Yoon
OOD
AAML
215
52
0
03 Aug 2021
Taxonomizing local versus global structure in neural network loss landscapes
Neural Information Processing Systems (NeurIPS), 2021
Yaoqing Yang
Liam Hodgkinson
Ryan Theisen
Joe Zou
Joseph E. Gonzalez
Kannan Ramchandran
Michael W. Mahoney
367
43
0
23 Jul 2021
Fed-ensemble: Improving Generalization through Model Ensembling in Federated Learning
IEEE Transactions on Automation Science and Engineering (T-ASE), 2021
Naichen Shi
Fan Lai
Raed Al Kontar
Mosharaf Chowdhury
FedML
169
41
0
21 Jul 2021
The Limiting Dynamics of SGD: Modified Loss, Phase Space Oscillations, and Anomalous Diffusion
Neural Computation (Neural Comput.), 2021
D. Kunin
Javier Sagastuy-Breña
Lauren Gillespie
Eshed Margalit
Hidenori Tanaka
Surya Ganguli
Daniel L. K. Yamins
510
19
0
19 Jul 2021
Structured Directional Pruning via Perturbation Orthogonal Projection
YinchuanLi
XiaofengLiu
YunfengShao
QingWang
YanhuiGeng
194
2
0
12 Jul 2021
Previous
1
2
3
...
10
11
7
8
9
Next