Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.10026
Cited By
v1
v2
v3
v4 (latest)
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
Neural Information Processing Systems (NeurIPS), 2018
27 February 2018
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
UQCV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"
50 / 548 papers shown
Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Pranav Ajit Nair
Sukomal Pal
Pradeepika Verm
MoMe
235
2
0
26 May 2023
How to escape sharp minima with random perturbations
International Conference on Machine Learning (ICML), 2023
Kwangjun Ahn
Ali Jadbabaie
S. Sra
ODL
418
13
0
25 May 2023
Sparse Weight Averaging with Multiple Particles for Iterative Magnitude Pruning
International Conference on Learning Representations (ICLR), 2023
Moonseok Choi
Hyungi Lee
G. Nam
Juho Lee
267
4
0
24 May 2023
Transferring Learning Trajectories of Neural Networks
International Conference on Learning Representations (ICLR), 2023
Daiki Chijiwa
265
4
0
23 May 2023
Neural Functional Transformers
Neural Information Processing Systems (NeurIPS), 2023
Allan Zhou
Kaien Yang
Yiding Jiang
Kaylee Burns
Winnie Xu
Samuel Sokota
J. Zico Kolter
Chelsea Finn
252
43
0
22 May 2023
Annealing Self-Distillation Rectification Improves Adversarial Training
International Conference on Learning Representations (ICLR), 2023
Yuehua Wu
Hung-Jui Wang
Shang-Tse Chen
AAML
270
6
0
20 May 2023
Mode Connectivity in Auction Design
Neural Information Processing Systems (NeurIPS), 2023
Christoph Hertrich
Yixin Tao
László A. Végh
289
3
0
18 May 2023
Recyclable Tuning for Continual Pre-training
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yujia Qin
Cheng Qian
Xu Han
Yankai Lin
Huadong Wang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
Jie Zhou
CLL
170
16
0
15 May 2023
Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data
IEEE Transactions on Mobile Computing (IEEE TMC), 2023
Tailin Zhou
Zehong Lin
Jinchao Zhang
Danny H. K. Tsang
MoMe
FedML
388
20
0
13 May 2023
Functional Equivalence and Path Connectivity of Reducible Hyperbolic Tangent Networks
Neural Information Processing Systems (NeurIPS), 2023
Matthew Farrugia-Roberts
216
6
0
08 May 2023
Adaptive loose optimization for robust question answering
Jie Ma
Pinghui Wang
Ze-you Wang
Dechen Kong
Min Hu
Tingxu Han
Jun Liu
OOD
409
4
0
06 May 2023
ZipIt! Merging Models from Different Tasks without Training
International Conference on Learning Representations (ICLR), 2023
George Stoica
Daniel Bolya
J. Bjorner
Pratik Ramesh
Taylor N. Hearn
Judy Hoffman
VLM
MoMe
465
163
0
04 May 2023
π
π
π
-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation
International Conference on Machine Learning (ICML), 2023
Chengyue Wu
Teng Wang
Yixiao Ge
Zeyu Lu
Rui-Zhi Zhou
Ying Shan
Ping Luo
MoMe
214
43
0
27 Apr 2023
PopulAtion Parameter Averaging (PAPA)
Alexia Jolicoeur-Martineau
Emy Gervais
Kilian Fatras
Yan Zhang
Damien Scieur
MoMe
483
25
0
06 Apr 2023
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
299
8
0
06 Apr 2023
Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry
J. G. Wiese
Lisa Wimmer
Theodore Papamarkou
B. Bischl
Stephan Günnemann
David Rügamer
195
17
0
06 Apr 2023
On the Variance of Neural Network Training with respect to Test Sets and Distributions
International Conference on Learning Representations (ICLR), 2023
Keller Jordan
OOD
366
20
0
04 Apr 2023
A Survey of Historical Learning: Learning Models with Learning History
Xiang Li
Ge Wu
Lingfeng Yang
Wenzhe Wang
Renjie Song
Jian Yang
MU
AI4TS
248
2
0
23 Mar 2023
Sharpness-Aware Gradient Matching for Domain Generalization
Computer Vision and Pattern Recognition (CVPR), 2023
Pengfei Wang
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
285
143
0
18 Mar 2023
Bridging Models to Defend: A Population-Based Strategy for Robust Adversarial Defense
Ren Wang
Yuxuan Li
Sijia Liu
Dakuo Wang
Jinjun Xiong
Pin-Yu Chen
Sijia Liu
Mohammad Shahidehpour
Alfred Hero
AAML
184
0
0
17 Mar 2023
Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Sang-Ho Kim
Lorenzo Noci
Antonio Orvieto
Thomas Hofmann
CLL
171
57
0
16 Mar 2023
To Stay or Not to Stay in the Pre-train Basin: Insights on Ensembling in Transfer Learning
Neural Information Processing Systems (NeurIPS), 2023
Ildus Sadrtdinov
Dmitrii Pozdeev
Dmitry Vetrov
E. Lobacheva
236
7
0
06 Mar 2023
Average of Pruning: Improving Performance and Stability of Out-of-Distribution Detection
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Zhen Cheng
Fei Zhu
Xu-Yao Zhang
Cheng-Lin Liu
MoMe
OODD
210
15
0
02 Mar 2023
DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks
Computer Vision and Pattern Recognition (CVPR), 2023
Samyak Jain
Sravanti Addepalli
P. Sahu
Priyam Dey
R. Venkatesh Babu
MoMe
OOD
321
27
0
28 Feb 2023
Permutation Equivariant Neural Functionals
Neural Information Processing Systems (NeurIPS), 2023
Allan Zhou
Kaien Yang
Kaylee Burns
Adriano Cardace
Yiding Jiang
Samuel Sokota
J. Zico Kolter
Chelsea Finn
300
65
0
27 Feb 2023
Random Teachers are Good Teachers
International Conference on Machine Learning (ICML), 2023
Felix Sarnthein
Gregor Bachmann
Sotiris Anagnostidis
Thomas Hofmann
336
7
0
23 Feb 2023
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMe
OOD
437
103
0
22 Feb 2023
Revisiting Weighted Aggregation in Federated Learning with Neural Networks
International Conference on Machine Learning (ICML), 2023
Zexi Li
Tao Lin
Xinyi Shang
Chao-Xiang Wu
FedML
327
101
0
14 Feb 2023
Autoselection of the Ensemble of Convolutional Neural Networks with Second-Order Cone Programming
Social Science Research Network (SSRN), 2023
Buse Çisil Güldoğuş
Abdullah Nazhat Abdullah
Muhammad Ammar Ali
Süreyya Özögür-Akyüz
132
1
0
12 Feb 2023
Interpretable Diversity Analysis: Visualizing Feature Representations In Low-Cost Ensembles
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Tim Whitaker
L. D. Whitley
81
1
0
12 Feb 2023
Knowledge is a Region in Weight Space for Fine-tuned Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Almog Gueta
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
311
58
0
09 Feb 2023
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications
Chengyu Dong
OOD
UQCV
BDL
AI4CE
333
2
0
02 Feb 2023
A Comprehensive Survey of Continual Learning: Theory, Method and Application
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Liyuan Wang
Xingxing Zhang
Hang Su
Jun Zhu
KELM
CLL
782
1,081
0
31 Jan 2023
Towards Inference Efficient Deep Ensemble Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
143
17
0
29 Jan 2023
On the Lipschitz Constant of Deep Networks and Double Descent
British Machine Vision Conference (BMVC), 2023
Matteo Gamba
Hossein Azizpour
Mårten Björkman
542
11
0
28 Jan 2023
Uncertainty Estimation based on Geometric Separation
Gabriella Chouraqui
L. Cohen
Gil Einziger
Liel Leman
170
0
0
11 Jan 2023
Re-basin via implicit Sinkhorn differentiation
Computer Vision and Pattern Recognition (CVPR), 2022
F. Guerrero-Peña
H. R. Medeiros
Thomas Dubail
Masih Aminbeidokhti
Mohammadhadi Shateri
M. Pedersoli
MoMe
318
59
0
22 Dec 2022
Likelihood-based generalization of Markov parameter estimation and multiple shooting objectives in system identification
Nicholas Galioto
Alex Arkady Gorodetsky
341
1
0
20 Dec 2022
Neuroevolution of Physics-Informed Neural Nets: Benchmark Problems and Comparative Results
Nicholas Sung
Jian Cheng Wong
C. Ooi
Abhishek Gupta
P. Chiu
Yew-Soon Ong
PINN
184
10
0
15 Dec 2022
Editing Models with Task Arithmetic
International Conference on Learning Representations (ICLR), 2022
Gabriel Ilharco
Marco Tulio Ribeiro
Mitchell Wortsman
Suchin Gururangan
Ludwig Schmidt
Hannaneh Hajishirzi
Ali Farhadi
KELM
MoMe
MU
1.2K
740
0
08 Dec 2022
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Shachar Don-Yehiya
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
MoMe
279
60
0
02 Dec 2022
Context-Adaptive Deep Neural Networks via Bridge-Mode Connectivity
Nathan G. Drenkow
Alvin Tan
C. Ashcraft
Kiran Karra
178
0
0
28 Nov 2022
PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization
Neural Information Processing Systems (NeurIPS), 2022
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDL
MLT
AI4CE
205
75
0
24 Nov 2022
Building a Subspace of Policies for Scalable Continual Learning
International Conference on Learning Representations (ICLR), 2022
Jean-Baptiste Gaya
T. Doan
Lucas Caccia
Laure Soulier
Ludovic Denoyer
Roberta Raileanu
CLL
364
37
0
18 Nov 2022
Weighted Ensemble Self-Supervised Learning
International Conference on Learning Representations (ICLR), 2022
Yangjun Ruan
Saurabh Singh
Warren Morningstar
Alexander A. Alemi
Sergey Ioffe
Ian S. Fischer
Joshua V. Dillon
FedML
227
20
0
18 Nov 2022
Mechanistic Mode Connectivity
International Conference on Machine Learning (ICML), 2022
Ekdeep Singh Lubana
Eric J. Bigelow
Robert P. Dick
David M. Krueger
Hidenori Tanaka
299
56
0
15 Nov 2022
REPAIR: REnormalizing Permuted Activations for Interpolation Repair
International Conference on Learning Representations (ICLR), 2022
Keller Jordan
Hanie Sedghi
O. Saukh
R. Entezari
Behnam Neyshabur
MoMe
417
116
0
15 Nov 2022
On the Performance of Direct Loss Minimization for Bayesian Neural Networks
Yadi Wei
Roni Khardon
BDL
103
3
0
15 Nov 2022
Robust Federated Learning against both Data Heterogeneity and Poisoning Attack via Aggregation Optimization
Yueqi Xie
Weizhong Zhang
Renjie Pi
Fangzhao Wu
Qifeng Chen
Xing Xie
Sunghun Kim
FedML
193
9
0
10 Nov 2022
Quantifying Model Uncertainty for Semantic Segmentation using Operators in the RKHS
Rishabh Singh
José C. Príncipe
UQCV
180
3
0
03 Nov 2022
Previous
1
2
3
...
5
6
7
...
9
10
11
Next