ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.10026
  4. Cited By
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

Neural Information Processing Systems (NeurIPS), 2018
27 February 2018
T. Garipov
Pavel Izmailov
Dmitrii Podoprikhin
Dmitry Vetrov
A. Wilson
    UQCV
ArXiv (abs)PDFHTML

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

50 / 548 papers shown
Domain Aligned Prefix Averaging for Domain Generalization in Abstractive
  Summarization
Domain Aligned Prefix Averaging for Domain Generalization in Abstractive SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Pranav Ajit Nair
Sukomal Pal
Pradeepika Verm
MoMe
235
2
0
26 May 2023
How to escape sharp minima with random perturbations
How to escape sharp minima with random perturbationsInternational Conference on Machine Learning (ICML), 2023
Kwangjun Ahn
Ali Jadbabaie
S. Sra
ODL
418
13
0
25 May 2023
Sparse Weight Averaging with Multiple Particles for Iterative Magnitude
  Pruning
Sparse Weight Averaging with Multiple Particles for Iterative Magnitude PruningInternational Conference on Learning Representations (ICLR), 2023
Moonseok Choi
Hyungi Lee
G. Nam
Juho Lee
267
4
0
24 May 2023
Transferring Learning Trajectories of Neural Networks
Transferring Learning Trajectories of Neural NetworksInternational Conference on Learning Representations (ICLR), 2023
Daiki Chijiwa
265
4
0
23 May 2023
Neural Functional Transformers
Neural Functional TransformersNeural Information Processing Systems (NeurIPS), 2023
Allan Zhou
Kaien Yang
Yiding Jiang
Kaylee Burns
Winnie Xu
Samuel Sokota
J. Zico Kolter
Chelsea Finn
252
43
0
22 May 2023
Annealing Self-Distillation Rectification Improves Adversarial Training
Annealing Self-Distillation Rectification Improves Adversarial TrainingInternational Conference on Learning Representations (ICLR), 2023
Yuehua Wu
Hung-Jui Wang
Shang-Tse Chen
AAML
270
6
0
20 May 2023
Mode Connectivity in Auction Design
Mode Connectivity in Auction DesignNeural Information Processing Systems (NeurIPS), 2023
Christoph Hertrich
Yixin Tao
László A. Végh
289
3
0
18 May 2023
Recyclable Tuning for Continual Pre-training
Recyclable Tuning for Continual Pre-trainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yujia Qin
Cheng Qian
Xu Han
Yankai Lin
Huadong Wang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
Jie Zhou
CLL
170
16
0
15 May 2023
Understanding and Improving Model Averaging in Federated Learning on
  Heterogeneous Data
Understanding and Improving Model Averaging in Federated Learning on Heterogeneous DataIEEE Transactions on Mobile Computing (IEEE TMC), 2023
Tailin Zhou
Zehong Lin
Jinchao Zhang
Danny H. K. Tsang
MoMeFedML
388
20
0
13 May 2023
Functional Equivalence and Path Connectivity of Reducible Hyperbolic
  Tangent Networks
Functional Equivalence and Path Connectivity of Reducible Hyperbolic Tangent NetworksNeural Information Processing Systems (NeurIPS), 2023
Matthew Farrugia-Roberts
216
6
0
08 May 2023
Adaptive loose optimization for robust question answering
Adaptive loose optimization for robust question answering
Jie Ma
Pinghui Wang
Ze-you Wang
Dechen Kong
Min Hu
Tingxu Han
Jun Liu
OOD
409
4
0
06 May 2023
ZipIt! Merging Models from Different Tasks without Training
ZipIt! Merging Models from Different Tasks without TrainingInternational Conference on Learning Representations (ICLR), 2023
George Stoica
Daniel Bolya
J. Bjorner
Pratik Ramesh
Taylor N. Hearn
Judy Hoffman
VLMMoMe
465
163
0
04 May 2023
$π$-Tuning: Transferring Multimodal Foundation Models with Optimal
  Multi-task Interpolation
πππ-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task InterpolationInternational Conference on Machine Learning (ICML), 2023
Chengyue Wu
Teng Wang
Yixiao Ge
Zeyu Lu
Rui-Zhi Zhou
Ying Shan
Ping Luo
MoMe
214
43
0
27 Apr 2023
PopulAtion Parameter Averaging (PAPA)
PopulAtion Parameter Averaging (PAPA)
Alexia Jolicoeur-Martineau
Emy Gervais
Kilian Fatras
Yan Zhang
Damien Scieur
MoMe
483
25
0
06 Apr 2023
Inductive biases in deep learning models for weather prediction
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
299
8
0
06 Apr 2023
Towards Efficient MCMC Sampling in Bayesian Neural Networks by
  Exploiting Symmetry
Towards Efficient MCMC Sampling in Bayesian Neural Networks by Exploiting Symmetry
J. G. Wiese
Lisa Wimmer
Theodore Papamarkou
B. Bischl
Stephan Günnemann
David Rügamer
195
17
0
06 Apr 2023
On the Variance of Neural Network Training with respect to Test Sets and
  Distributions
On the Variance of Neural Network Training with respect to Test Sets and DistributionsInternational Conference on Learning Representations (ICLR), 2023
Keller Jordan
OOD
366
20
0
04 Apr 2023
A Survey of Historical Learning: Learning Models with Learning History
A Survey of Historical Learning: Learning Models with Learning History
Xiang Li
Ge Wu
Lingfeng Yang
Wenzhe Wang
Renjie Song
Jian Yang
MUAI4TS
248
2
0
23 Mar 2023
Sharpness-Aware Gradient Matching for Domain Generalization
Sharpness-Aware Gradient Matching for Domain GeneralizationComputer Vision and Pattern Recognition (CVPR), 2023
Pengfei Wang
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
285
143
0
18 Mar 2023
Bridging Models to Defend: A Population-Based Strategy for Robust Adversarial Defense
Bridging Models to Defend: A Population-Based Strategy for Robust Adversarial Defense
Ren Wang
Yuxuan Li
Sijia Liu
Dakuo Wang
Jinjun Xiong
Pin-Yu Chen
Sijia Liu
Mohammad Shahidehpour
Alfred Hero
AAML
184
0
0
17 Mar 2023
Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks
  in Continual Learning
Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual LearningComputer Vision and Pattern Recognition (CVPR), 2023
Sang-Ho Kim
Lorenzo Noci
Antonio Orvieto
Thomas Hofmann
CLL
171
57
0
16 Mar 2023
To Stay or Not to Stay in the Pre-train Basin: Insights on Ensembling in
  Transfer Learning
To Stay or Not to Stay in the Pre-train Basin: Insights on Ensembling in Transfer LearningNeural Information Processing Systems (NeurIPS), 2023
Ildus Sadrtdinov
Dmitrii Pozdeev
Dmitry Vetrov
E. Lobacheva
236
7
0
06 Mar 2023
Average of Pruning: Improving Performance and Stability of
  Out-of-Distribution Detection
Average of Pruning: Improving Performance and Stability of Out-of-Distribution DetectionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Zhen Cheng
Fei Zhu
Xu-Yao Zhang
Cheng-Lin Liu
MoMeOODD
210
15
0
02 Mar 2023
DART: Diversify-Aggregate-Repeat Training Improves Generalization of
  Neural Networks
DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural NetworksComputer Vision and Pattern Recognition (CVPR), 2023
Samyak Jain
Sravanti Addepalli
P. Sahu
Priyam Dey
R. Venkatesh Babu
MoMeOOD
321
27
0
28 Feb 2023
Permutation Equivariant Neural Functionals
Permutation Equivariant Neural FunctionalsNeural Information Processing Systems (NeurIPS), 2023
Allan Zhou
Kaien Yang
Kaylee Burns
Adriano Cardace
Yiding Jiang
Samuel Sokota
J. Zico Kolter
Chelsea Finn
300
65
0
27 Feb 2023
Random Teachers are Good Teachers
Random Teachers are Good TeachersInternational Conference on Machine Learning (ICML), 2023
Felix Sarnthein
Gregor Bachmann
Sotiris Anagnostidis
Thomas Hofmann
336
7
0
23 Feb 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
437
103
0
22 Feb 2023
Revisiting Weighted Aggregation in Federated Learning with Neural
  Networks
Revisiting Weighted Aggregation in Federated Learning with Neural NetworksInternational Conference on Machine Learning (ICML), 2023
Zexi Li
Tao Lin
Xinyi Shang
Chao-Xiang Wu
FedML
327
101
0
14 Feb 2023
Autoselection of the Ensemble of Convolutional Neural Networks with
  Second-Order Cone Programming
Autoselection of the Ensemble of Convolutional Neural Networks with Second-Order Cone ProgrammingSocial Science Research Network (SSRN), 2023
Buse Çisil Güldoğuş
Abdullah Nazhat Abdullah
Muhammad Ammar Ali
Süreyya Özögür-Akyüz
132
1
0
12 Feb 2023
Interpretable Diversity Analysis: Visualizing Feature Representations In
  Low-Cost Ensembles
Interpretable Diversity Analysis: Visualizing Feature Representations In Low-Cost EnsemblesIEEE International Joint Conference on Neural Network (IJCNN), 2023
Tim Whitaker
L. D. Whitley
81
1
0
12 Feb 2023
Knowledge is a Region in Weight Space for Fine-tuned Language Models
Knowledge is a Region in Weight Space for Fine-tuned Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Almog Gueta
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
311
58
0
09 Feb 2023
Generalized Uncertainty of Deep Neural Networks: Taxonomy and
  Applications
Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications
Chengyu Dong
OODUQCVBDLAI4CE
333
2
0
02 Feb 2023
A Comprehensive Survey of Continual Learning: Theory, Method and
  Application
A Comprehensive Survey of Continual Learning: Theory, Method and ApplicationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Liyuan Wang
Xingxing Zhang
Hang Su
Jun Zhu
KELMCLL
782
1,081
0
31 Jan 2023
Towards Inference Efficient Deep Ensemble Learning
Towards Inference Efficient Deep Ensemble LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
143
17
0
29 Jan 2023
On the Lipschitz Constant of Deep Networks and Double Descent
On the Lipschitz Constant of Deep Networks and Double DescentBritish Machine Vision Conference (BMVC), 2023
Matteo Gamba
Hossein Azizpour
Mårten Björkman
542
11
0
28 Jan 2023
Uncertainty Estimation based on Geometric Separation
Uncertainty Estimation based on Geometric Separation
Gabriella Chouraqui
L. Cohen
Gil Einziger
Liel Leman
170
0
0
11 Jan 2023
Re-basin via implicit Sinkhorn differentiation
Re-basin via implicit Sinkhorn differentiationComputer Vision and Pattern Recognition (CVPR), 2022
F. Guerrero-Peña
H. R. Medeiros
Thomas Dubail
Masih Aminbeidokhti
Mohammadhadi Shateri
M. Pedersoli
MoMe
318
59
0
22 Dec 2022
Likelihood-based generalization of Markov parameter estimation and
  multiple shooting objectives in system identification
Likelihood-based generalization of Markov parameter estimation and multiple shooting objectives in system identification
Nicholas Galioto
Alex Arkady Gorodetsky
341
1
0
20 Dec 2022
Neuroevolution of Physics-Informed Neural Nets: Benchmark Problems and
  Comparative Results
Neuroevolution of Physics-Informed Neural Nets: Benchmark Problems and Comparative Results
Nicholas Sung
Jian Cheng Wong
C. Ooi
Abhishek Gupta
P. Chiu
Yew-Soon Ong
PINN
184
10
0
15 Dec 2022
Editing Models with Task Arithmetic
Editing Models with Task ArithmeticInternational Conference on Learning Representations (ICLR), 2022
Gabriel Ilharco
Marco Tulio Ribeiro
Mitchell Wortsman
Suchin Gururangan
Ludwig Schmidt
Hannaneh Hajishirzi
Ali Farhadi
KELMMoMeMU
1.2K
740
0
08 Dec 2022
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning
ColD Fusion: Collaborative Descent for Distributed Multitask FinetuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Shachar Don-Yehiya
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
MoMe
279
60
0
02 Dec 2022
Context-Adaptive Deep Neural Networks via Bridge-Mode Connectivity
Context-Adaptive Deep Neural Networks via Bridge-Mode Connectivity
Nathan G. Drenkow
Alvin Tan
C. Ashcraft
Kiran Karra
178
0
0
28 Nov 2022
PAC-Bayes Compression Bounds So Tight That They Can Explain
  Generalization
PAC-Bayes Compression Bounds So Tight That They Can Explain GeneralizationNeural Information Processing Systems (NeurIPS), 2022
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDLMLTAI4CE
205
75
0
24 Nov 2022
Building a Subspace of Policies for Scalable Continual Learning
Building a Subspace of Policies for Scalable Continual LearningInternational Conference on Learning Representations (ICLR), 2022
Jean-Baptiste Gaya
T. Doan
Lucas Caccia
Laure Soulier
Ludovic Denoyer
Roberta Raileanu
CLL
364
37
0
18 Nov 2022
Weighted Ensemble Self-Supervised Learning
Weighted Ensemble Self-Supervised LearningInternational Conference on Learning Representations (ICLR), 2022
Yangjun Ruan
Saurabh Singh
Warren Morningstar
Alexander A. Alemi
Sergey Ioffe
Ian S. Fischer
Joshua V. Dillon
FedML
227
20
0
18 Nov 2022
Mechanistic Mode Connectivity
Mechanistic Mode ConnectivityInternational Conference on Machine Learning (ICML), 2022
Ekdeep Singh Lubana
Eric J. Bigelow
Robert P. Dick
David M. Krueger
Hidenori Tanaka
299
56
0
15 Nov 2022
REPAIR: REnormalizing Permuted Activations for Interpolation Repair
REPAIR: REnormalizing Permuted Activations for Interpolation RepairInternational Conference on Learning Representations (ICLR), 2022
Keller Jordan
Hanie Sedghi
O. Saukh
R. Entezari
Behnam Neyshabur
MoMe
417
116
0
15 Nov 2022
On the Performance of Direct Loss Minimization for Bayesian Neural
  Networks
On the Performance of Direct Loss Minimization for Bayesian Neural Networks
Yadi Wei
Roni Khardon
BDL
103
3
0
15 Nov 2022
Robust Federated Learning against both Data Heterogeneity and Poisoning
  Attack via Aggregation Optimization
Robust Federated Learning against both Data Heterogeneity and Poisoning Attack via Aggregation Optimization
Yueqi Xie
Weizhong Zhang
Renjie Pi
Fangzhao Wu
Qifeng Chen
Xing Xie
Sunghun Kim
FedML
193
9
0
10 Nov 2022
Quantifying Model Uncertainty for Semantic Segmentation using Operators
  in the RKHS
Quantifying Model Uncertainty for Semantic Segmentation using Operators in the RKHS
Rishabh Singh
José C. Príncipe
UQCV
180
3
0
03 Nov 2022
Previous
123...567...91011
Next