v1v2 (latest)

Optimizing Mode Connectivity via Neuron Alignment

Neural Information Processing Systems (NeurIPS), 2020

5 September 2020

Papers citing "Optimizing Mode Connectivity via Neuron Alignment"

50 / 75 papers shown

A Systematic Study of In-the-Wild Model Merging for Large Language Models

369

26 Nov 2025

Can MLLMs Absorb Math Reasoning Abilities from LLMs as Free Lunch?

156

16 Oct 2025

Rethinking Layer-wise Model Merging through Chain of Merges

234

29 Aug 2025

Generalized Linear Mode Connectivity for Transformers

444

28 Jun 2025

Circumventing Backdoor Space via Weight Symmetry

302

09 Jun 2025

Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking

Yuatyong Chaichana

Thanapat Trachu

Peerat Limkonchotiwat

Konpat Preechakul

Tirasan Khandhawit

Ekapol Chuangsuwanich

MoMe

666

29 May 2025

Understanding Mode Connectivity via Parameter Space Symmetry

680

29 May 2025

Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry

377

08 May 2025

Aggregation on Learnable Manifolds for Asynchronous Federated Optimization

400

18 Mar 2025

From Task-Specific Models to Unified Systems: A Review of Model Merging Approaches

462

13 Mar 2025

Paths and Ambient Spaces in Neural Loss LandscapesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

512

05 Mar 2025

Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation

1.1K

24 Feb 2025

Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

420

20 Feb 2025

Unveiling Mode Connectivity in Graph Neural Networks

294

18 Feb 2025

Beyond the Permutation Symmetry of Transformers: The Role of Rotation for Model Fusion

769

01 Feb 2025

Merging Feed-Forward Sublayers for Compressed Transformers

418

10 Jan 2025

Training-free Heterogeneous Model Merging

548

03 Jan 2025

Non-Uniform Parameter-Wise Model MergingBigData Congress [Services Society] (BSS), 2024

Albert Manuel Orozco Camacho

458

20 Dec 2024

MoD: A Distribution-Based Approach for Merging Large Language Models

Quy-Anh Dang

Chris Ngo

MoMe VLM

313

01 Nov 2024

Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging

Lefei Zhang

Dacheng Tao

232

29 Oct 2024

Deep Model Merging: The Sister of Neural Network Interpretability -- A Survey

Kyle Chard

292

16 Oct 2024

Exploring Model Kinship for Merging Large Language Models

504

16 Oct 2024

Revisiting Multi-Permutation Equivariance through the Lens of Irreducible RepresentationsInternational Conference on Learning Representations (ICLR), 2024

Yonatan Sverdlov

Ido Springer

Nadav Dym

496

09 Oct 2024

What Matters for Model Merging at Scale?

Prateek Yadav

Tu Vu

Jonathan Lai

Alexandra Chronopoulou

Manaal Faruqui

Joey Tianyi Zhou

Tsendsuren Munkhdalai

MoMe

296

04 Oct 2024

Parameter Competition Balancing for Model MergingNeural Information Processing Systems (NeurIPS), 2024

Jing Li

...

Min Zhang

277

03 Oct 2024

Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks

408

02 Oct 2024

Weight Scope Alignment: A Frustratingly Easy Method for Model MergingEuropean Conference on Artificial Intelligence (ECAI), 2024

364

22 Aug 2024

SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models

Anke Tang

Li Shen

Yong Luo

Shuai Xie

Han Hu

Lefei Zhang

Di Lin

Dacheng Tao

MoMe

396

19 Aug 2024

Computer Audition: From Task-Specific Machine Learning to Foundation Models

Andreas Triantafyllopoulos

456

22 Jul 2024

Training-Free Model Merging for Multi-target Domain Adaptation

Hao Zhao

274

18 Jul 2024

Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

Stefan Horoi

Albert Manuel Orozco Camacho

Eugene Belilovsky

Guy Wolf

FedML MoMe

275

07 Jul 2024

Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning

Xiaolei Wang

Xinyu Tang

Wayne Xin Zhao

Ji-Rong Wen

306

20 Jun 2024

Towards Efficient Pareto Set Approximation via Mixture of Experts Based Model Fusion

Li Shen

Han Hu

240

14 Jun 2024

The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof

588

30 May 2024

Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models

359

27 May 2024

Visualizing, Rethinking, and Mining the Loss Landscape of Deep Neural Networks

408

21 May 2024

Simultaneous linear connectivity of neural networks modulo permutation

Gintare Karolina Dziugaite

494

09 Apr 2024

Continual Learning with Weight Interpolation

528

05 Apr 2024

Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble

Zirui Xu

308

24 Mar 2024

Arcee's MergeKit: A Toolkit for Merging Large Language Models

804

187

20 Mar 2024

Fisher Mask Nodes for Language Model MergingInternational Conference on Language Resources and Evaluation (LREC), 2024

474

14 Mar 2024

Training-Free Pretrained Model Merging

442

04 Mar 2024

Merging Text Transformer Models from Different Initializations

Neha Verma

Maha Elbayad

MoMe

405

01 Mar 2024

Training Neural Networks from Scratch with Parallel Low-Rank Adapters

359

26 Feb 2024

Improving Model Fusion by Training-time Neuron Alignment with Fixed Neuron AnchorsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

464

02 Feb 2024

Merging Multi-Task Models via Weight-Ensembling Mixture of Experts

Li Shen

Nan Yin

347

01 Feb 2024

Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion

Li Shen

Liang Ding

Bo Du

371

11 Dec 2023

Train ñ Trade: Foundations of Parameter MarketsNeural Information Processing Systems (NeurIPS), 2023

225

07 Dec 2023

Merging by Matching Models in Task Parameter Subspaces

Derek Tam

Mohit Bansal

Colin Raffel

MoMe

366

07 Dec 2023

Proving Linear Mode Connectivity of Neural Networks via Optimal TransportInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

449

29 Oct 2023