Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.10026
Cited By
v1
v2
v3
v4 (latest)
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
Neural Information Processing Systems (NeurIPS), 2018
27 February 2018
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
UQCV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"
50 / 548 papers shown
Dormant Neural Trojans
International Conference on Machine Learning and Applications (ICMLA), 2022
Feisi Fu
Panagiota Kiourti
Wenchao Li
AAML
227
0
0
02 Nov 2022
Symmetries, flat minima, and the conserved quantities of gradient flow
International Conference on Learning Representations (ICLR), 2022
Bo Zhao
I. Ganev
Robin Walters
Rose Yu
Nima Dehmamy
374
28
0
31 Oct 2022
A picture of the space of typical learnable tasks
International Conference on Machine Learning (ICML), 2022
Rahul Ramesh
Jialin Mao
Itay Griniasty
Rubing Yang
H. Teoh
Mark K. Transtrum
James P. Sethna
Pratik Chaudhari
SSL
DRL
422
6
0
31 Oct 2022
Flatter, faster: scaling momentum for optimal speedup of SGD
Aditya Cowsik
T. Can
Paolo Glorioso
361
6
0
28 Oct 2022
Exploring Mode Connectivity for Pre-trained Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yujia Qin
Cheng Qian
Jing Yi
Weize Chen
Yankai Lin
Xu Han
Zhiyuan Liu
Maosong Sun
Jie Zhou
226
26
0
25 Oct 2022
Augmentation by Counterfactual Explanation -- Fixing an Overconfident Classifier
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Sumedha Singla
Nihal Murali
Forough Arabshahi
Sofia Triantafyllou
Kayhan Batmanghelich
CML
229
6
0
21 Oct 2022
lo-fi: distributed fine-tuning without communication
Mitchell Wortsman
Suchin Gururangan
Shen Li
Ali Farhadi
Ludwig Schmidt
Michael G. Rabbat
Ari S. Morcos
349
24
0
19 Oct 2022
Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models
International Conference on Machine Learning (ICML), 2022
Nikolaos Dimitriadis
P. Frossard
Franccois Fleuret
304
29
0
18 Oct 2022
Packed-Ensembles for Efficient Uncertainty Estimation
International Conference on Learning Representations (ICLR), 2022
Olivier Laurent
Adrien Lafage
Enzo Tartaglione
Geoffrey Daniel
Jean-Marc Martinez
Andrei Bursuc
Gianni Franchi
OODD
469
43
0
17 Oct 2022
RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging
Industrial Conference on Data Mining (IDM), 2022
A. Jaiswal
Kumar Ashutosh
Justin F. Rousseau
Yifan Peng
Zinan Lin
Ying Ding
163
11
0
15 Oct 2022
Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence
Diyuan Wu
Vyacheslav Kungurtsev
Marco Mondelli
197
3
0
13 Oct 2022
Wasserstein Barycenter-based Model Fusion and Linear Mode Connectivity of Neural Networks
A. K. Akash
Sixu Li
Nicolas García Trillos
207
15
0
13 Oct 2022
Deep Combinatorial Aggregation
Neural Information Processing Systems (NeurIPS), 2022
Yuesong Shen
Zorah Lähner
OOD
UQCV
155
6
0
12 Oct 2022
Stable and Efficient Adversarial Training through Local Linearization
Zhuorong Li
Daiwei Yu
AAML
113
0
0
11 Oct 2022
On the Importance of Calibration in Semi-supervised Learning
Charlotte Loh
Rumen Dangovski
Shivchander Sudalairaj
Seung-Jun Han
Ligong Han
Leonid Karlinsky
Marin Soljacic
Akash Srivastava
189
7
0
10 Oct 2022
Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep Networks
International Conference on Learning Representations (ICLR), 2022
Xiang Wang
Annie Wang
Mo Zhou
Rong Ge
MoMe
520
10
0
03 Oct 2022
Multiple Modes for Continual Learning
Siddhartha Datta
N. Shadbolt
CLL
MoMe
214
2
0
29 Sep 2022
Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization
Danni Peng
Sinno Jialin Pan
192
5
0
29 Sep 2022
On Quantum Speedups for Nonconvex Optimization via Quantum Tunneling Walks
Quantum (Quantum), 2022
Yizhou Liu
Weijie J. Su
Tongyang Li
288
23
0
29 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
International Conference on Learning Representations (ICLR), 2022
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
340
28
0
18 Sep 2022
Random initialisations performing above chance and how to find them
Frederik Benzing
Simon Schug
Robert Meier
J. Oswald
Yassir Akram
Nicolas Zucchet
Laurence Aitchison
Angelika Steger
ODL
424
28
0
15 Sep 2022
Git Re-Basin: Merging Models modulo Permutation Symmetries
International Conference on Learning Representations (ICLR), 2022
Samuel K. Ainsworth
J. Hayase
S. Srinivasa
MoMe
958
422
0
11 Sep 2022
Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference Cost
AAAI Conference on Artificial Intelligence (AAAI), 2022
Lu Yin
Shiwei Liu
Fang Meng
Tianjin Huang
Vlado Menkovski
Mykola Pechenizkiy
133
14
0
23 Aug 2022
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective
Neural Information Processing Systems (NeurIPS), 2022
Chanwoo Park
Sangdoo Yun
Sanghyuk Chun
AAML
222
39
0
21 Aug 2022
Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for Classification
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Quanshi Zhang
Feng He
Yilan Chen
Zhefan Rao
189
45
0
18 Aug 2022
On the generalization of learning algorithms that do not converge
Neural Information Processing Systems (NeurIPS), 2022
N. Chandramoorthy
Andreas Loukas
Khashayar Gatmiry
Stefanie Jegelka
MLT
379
12
0
16 Aug 2022
Uncertainty Quantification for Traffic Forecasting: A Unified Approach
IEEE International Conference on Data Engineering (ICDE), 2022
Weizhu Qian
Dalin Zhang
Yan Zhao
Kai Zheng
James Jianqiao Yu
BDL
AI4TS
181
33
0
11 Aug 2022
Improving Predictive Performance and Calibration by Weight Fusion in Semantic Segmentation
Timo Sämann
A. Hammam
Andrei Bursuc
Christoph Stiller
H. Groß
FedML
182
1
0
22 Jul 2022
On the Subspace Structure of Gradient-Based Meta-Learning
Gustaf Tegnér
Alfredo Reichlin
Hang Yin
Mårten Björkman
Danica Kragic
255
0
0
08 Jul 2022
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Computer Vision and Pattern Recognition (CVPR), 2022
Chien-Yao Wang
Alexey Bochkovskiy
H. Liao
ObjD
478
9,312
0
06 Jul 2022
Effective training-time stacking for ensembling of deep neural networks
International Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2022
P. Proskura
Alexey Zaytsev
96
11
0
27 Jun 2022
Transfer learning for ensembles: reducing computation time and keeping the diversity
International Conference on Artificial Intelligence and Pattern Recognition (AIPR), 2022
Ilya Shashkov
Nikita Balabin
Evgeny Burnaev
Alexey Zaytsev
234
2
0
27 Jun 2022
A Geometric Method for Improved Uncertainty Estimation in Real-time
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Gabriella Chouraqui
L. Cohen
Gil Einziger
Liel Leman
143
0
0
23 Jun 2022
Disentangling Model Multiplicity in Deep Learning
Ari Heljakka
Martin Trapp
Arno Solin
Arno Solin
181
6
0
17 Jun 2022
Geometrically Guided Integrated Gradients
Md. Mahfuzur Rahman
N. Lewis
Sergey Plis
FAtt
AAML
141
0
0
13 Jun 2022
Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks
Neural Information Processing Systems (NeurIPS), 2022
Mansheej Paul
Brett W. Larsen
Surya Ganguli
Jonathan Frankle
Gintare Karolina Dziugaite
148
24
0
02 Jun 2022
Star algorithm for NN ensembling
Sergey Zinchenko
Dmitry Lishudi
FedML
110
0
0
01 Jun 2022
Superposing Many Tickets into One: A Performance Booster for Sparse Neural Network Training
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Lu Yin
Vlado Menkovski
Meng Fang
Tianjin Huang
Yulong Pei
Mykola Pechenizkiy
Decebal Constantin Mocanu
Shiwei Liu
279
9
0
30 May 2022
The Missing Invariance Principle Found -- the Reciprocal Twin of Invariant Risk Minimization
Neural Information Processing Systems (NeurIPS), 2022
Dongsung Huh
A. Baidya
OOD
144
9
0
29 May 2022
Laplace HypoPINN: Physics-Informed Neural Network for hypocenter localization and its predictive uncertainty
M. Izzatullah
I. Yildirim
U. Waheed
T. Alkhalifah
211
20
0
28 May 2022
How Tempering Fixes Data Augmentation in Bayesian Neural Networks
International Conference on Machine Learning (ICML), 2022
Gregor Bachmann
Lorenzo Noci
Thomas Hofmann
BDL
AAML
285
11
0
27 May 2022
Linear Connectivity Reveals Generalization Strategies
International Conference on Learning Representations (ICLR), 2022
Jeevesh Juneja
Rachit Bansal
Kyunghyun Cho
João Sedoc
Naomi Saphra
856
55
0
24 May 2022
Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free
Computer Vision and Pattern Recognition (CVPR), 2022
Tianlong Chen
Zhenyu Zhang
Yihua Zhang
Shiyu Chang
Sijia Liu
Zinan Lin
AAML
201
28
0
24 May 2022
The Unreasonable Effectiveness of Deep Evidential Regression
AAAI Conference on Artificial Intelligence (AAAI), 2022
N. Meinert
J. Gawlikowski
Alexander Lavin
UQCV
EDL
640
49
0
20 May 2022
Interpolating Compressed Parameter Subspaces
Siddhartha Datta
N. Shadbolt
234
5
0
19 May 2022
Diverse Weight Averaging for Out-of-Distribution Generalization
Neural Information Processing Systems (NeurIPS), 2022
Alexandre Ramé
Matthieu Kirchmeyer
Thibaud Rahier
A. Rakotomamonjy
Patrick Gallinari
Matthieu Cord
OOD
566
160
0
19 May 2022
ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yue Zhao
Yantao Shen
Yuanjun Xiong
Shuo Yang
Wei Xia
Zhuowen Tu
Bernt Shiele
Stefano Soatto
BDL
229
8
0
12 May 2022
One-shot Federated Learning without Server-side Training
Neural Networks (NN), 2022
Shangchao Su
Bin Li
Xiangyang Xue
FedML
161
38
0
26 Apr 2022
Federated Geometric Monte Carlo Clustering to Counter Non-IID Datasets
Federico Lucchetti
Jérémie Decouchant
Maria Fernandes
L. Chen
Marcus Volp
FedML
134
1
0
23 Apr 2022
A Simple Approach to Adversarial Robustness in Few-shot Image Classification
Akshayvarun Subramanya
Hamed Pirsiavash
VLM
146
6
0
11 Apr 2022
Previous
1
2
3
...
10
11
6
7
8
9
Next
Page 7 of 11
Page
of 11
Go