v1v2v3v4 (latest)

Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

Neural Information Processing Systems (NeurIPS), 2018

27 February 2018

Dmitry Vetrov

Papers citing "Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs"

50 / 548 papers shown

Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

247

28 May 2025

Exploring the Hidden Capacity of LLMs for One-Step Text Generation

Gleb Mezentsev

Ivan Oseledets

293

27 May 2025

Unveiling the Basin-Like Loss Landscape in Large Language Models

472

23 May 2025

The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs

Lucas Bandarkar

Nanyun Peng

MoMe LRM

319

23 May 2025

Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models

Patrick Leask

Neel Nanda

Noura Al Moubayed

308

23 May 2025

Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

331

16 May 2025

Connecting Independently Trained Modes via Layer-Wise Connectivity

453

05 May 2025

The effect of the number of parameters and the number of local feature patches on loss landscapes in distributed quantum neural networks

Yoshiaki Kawase

265

27 Apr 2025

Dynamic Fisher-weighted Model Merging via Bayesian OptimizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

948

26 Apr 2025

Seeking Flat Minima over Diverse Surrogates for Improved Adversarial Transferability: A Theoretical Framework and Algorithmic Instantiation

252

23 Apr 2025

A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization

Sahil Rajesh Dhayalkar

326

20 Apr 2025

Boosting-inspired online learning with transfer for railway maintenance

Diogo Risca

Afonso Lourenço

Goreti Marreiros

205

11 Apr 2025

Understanding Machine Unlearning Through the Lens of Mode Connectivity

Jiali Cheng

Hadi Amiri

952

08 Apr 2025

MASS: MoErging through Adaptive Subspace Selection

Donato Crisostomi

Alessandro Zirilli

Antonio Andrea Gargiulo

Maria Sofia Bucarelli

293

06 Apr 2025

Uncertainty-Aware Decomposed Hybrid Networks

282

24 Mar 2025

Finding Stable Subnetworks at Initialization with Dataset Distillation

Luke McDermott

Rahul Parhi

359

23 Mar 2025

Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space

399

21 Mar 2025

Aggregation on Learnable Manifolds for Asynchronous Federated Optimization

293

18 Mar 2025

On Local Posterior Structure in Deep EnsemblesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

Mikkel Jordahn

Jonas Vestergaard Jensen

Mikkel N. Schmidt

Michael Riis Andersen

UQCV BDL OOD

370

17 Mar 2025

Understanding Flatness in Generative Models: Its Role and Benefits

421

14 Mar 2025

Make Optimization Once and for All with Fine-grained Guidance

...

316

14 Mar 2025

From Task-Specific Models to Unified Systems: A Review of Model Merging Approaches

411

13 Mar 2025

Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

1.3K

13 Mar 2025

Analyzing the Role of Permutation Invariance in Linear Mode ConnectivityInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

316

13 Mar 2025

Task Vector Quantization for Memory-Efficient Model Merging

260

10 Mar 2025

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

839

10 Mar 2025

SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting

345

07 Mar 2025

Paths and Ambient Spaces in Neural Loss LandscapesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

412

05 Mar 2025

Deep Learning is Not So Mysterious or Different

Andrew Gordon Wilson

374

03 Mar 2025

Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You ThinkComputer Vision and Pattern Recognition (CVPR), 2025

216

02 Mar 2025

Rethinking Spiking Neural Networks from an Ensemble Learning PerspectiveInternational Conference on Learning Representations (ICLR), 2025

276

20 Feb 2025

High-dimensional manifold of solutions in neural networks: insights from statistical physics

Enrico M. Malatesta

297

20 Feb 2025

Daily Land Surface Temperature Reconstruction in Landsat Cross-Track Areas Using Deep Ensemble Learning With Uncertainty Quantification

Shengjie Liu

Siqin Wang

Lu Zhang

123

20 Feb 2025

Unveiling Mode Connectivity in Graph Neural Networks

263

18 Feb 2025

SuperMerge: An Approach For Gradient-Based Model Merging

387

17 Feb 2025

LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging

363

15 Feb 2025

Ensembles of Low-Rank Expert AdaptersInternational Conference on Learning Representations (ICLR), 2025

443

31 Jan 2025

CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian SamplingNetwork and Distributed System Security Symposium (NDSS), 2025

739

28 Jan 2025

FedDAG: Federated Domain Adversarial Generation Towards Generalizable Medical Image AnalysisIEEE Transactions on Medical Imaging (IEEE TMI), 2025

180

28 Jan 2025

Evolutionary Optimization of Physics-Informed Neural Networks: Evo-PINN Frontiers and Opportunities

334

11 Jan 2025

Parameter-Efficient Interventions for Enhanced Model Merging

380

22 Dec 2024

Non-Uniform Parameter-Wise Model MergingBigData Congress [Services Society] (BSS), 2024

Albert Manuel Orozco Camacho

429

20 Dec 2024

LossLens: Diagnostics for Machine Learning through Loss Landscape Visual AnalyticsIEEE Computer Graphics and Applications (IEEE CG&A), 2024

...

297

17 Dec 2024

Meta Curvature-Aware Minimization for Domain Generalization

1.1K

16 Dec 2024

Implicit Neural Compression of Point Clouds

401

11 Dec 2024

How to Merge Your Multimodal Models Over Time?Computer Vision and Pattern Recognition (CVPR), 2024

Sebastian Dziadzio

Vishaal Udandarao

Karsten Roth

Christian Schroeder de Witt

406

09 Dec 2024

Task Arithmetic Through The Lens Of One-Shot Federated Learning

524

27 Nov 2024

FREE-Merging: Fourier Transform for Efficient Model Merging

Shenghe Zheng

Hongzhi Wang

MoMe

358

25 Nov 2024

Ex Uno Pluria: Insights on Ensembling in Low Precision Number SystemsNeural Information Processing Systems (NeurIPS), 2024

G. Nam

Juho Lee

343

22 Nov 2024

Stein Variational Newton Neural Network Ensembles

319

04 Nov 2024