ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.06296
  4. Cited By
The Role of Permutation Invariance in Linear Mode Connectivity of Neural
  Networks
v1v2 (latest)

The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks

International Conference on Learning Representations (ICLR), 2021
12 October 2021
R. Entezari
Hanie Sedghi
O. Saukh
Behnam Neyshabur
    MoMe
ArXiv (abs)PDFHTML

Papers citing "The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks"

50 / 212 papers shown
Title
A Systematic Study of Model Merging Techniques in Large Language Models
A Systematic Study of Model Merging Techniques in Large Language Models
Oğuz Kağan Hitit
Leander Girrbach
Zeynep Akata
MoMe
161
0
0
26 Nov 2025
Model Merging Improves Zero-Shot Generalization in Bioacoustic Foundation Models
Model Merging Improves Zero-Shot Generalization in Bioacoustic Foundation Models
Davide Marincione
Donato Crisostomi
Roberto Dessi
Emanuele Rodolà
Emanuele Rossi
MoMeAI4CEVLM
191
0
0
07 Nov 2025
Linear Mode Connectivity under Data Shifts for Deep Ensembles of Image Classifiers
Linear Mode Connectivity under Data Shifts for Deep Ensembles of Image Classifiers
C. Hepburn
T. Zielke
A.P. Raulf
61
0
0
06 Nov 2025
Keys in the Weights: Transformer Authentication Using Model-Bound Latent Representations
Keys in the Weights: Transformer Authentication Using Model-Bound Latent Representations
Ayşe S. Okatan
Mustafa İlhan Akbaş
Laxima Niure Kandel
Berker Peköz
52
0
0
02 Nov 2025
Weight Weaving: Parameter Pooling for Data-Free Model Merging
Weight Weaving: Parameter Pooling for Data-Free Model Merging
Levy G. Chaves
Eduardo Valle
Sandra Avila
MoMe
123
0
0
15 Oct 2025
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
Mohan Zhang
Yihua Zhang
Jinghan Jia
Zhangyang Wang
Sijia Liu
Tianlong Chen
SILMLRM
162
0
0
12 Oct 2025
Robustness and Regularization in Hierarchical Re-Basin
Robustness and Regularization in Hierarchical Re-BasinThe European Symposium on Artificial Neural Networks (ESANN), 2025
Benedikt Franke
Florian Heinrich
Markus Lange
Arne P. Raulf
MoMeAAML
187
1
0
10 Oct 2025
Do We Really Need Permutations? Impact of Width Expansion on Linear Mode Connectivity
Do We Really Need Permutations? Impact of Width Expansion on Linear Mode Connectivity
Akira Ito
Masanori Yamada
Daiki Chijiwa
Atsutoshi Kumagai
MoMe
125
0
0
09 Oct 2025
Improving Chain-of-Thought Efficiency for Autoregressive Image Generation
Improving Chain-of-Thought Efficiency for Autoregressive Image Generation
Zeqi Gu
Markos Georgopoulos
Xiaoliang Dai
Marjan Ghazvininejad
Chu Wang
...
Zecheng He
Zijian He
Jiawei Zhou
Abe Davis
Jialiang Wang
LRM
68
0
0
07 Oct 2025
How does the optimizer implicitly bias the model merging loss landscape?
How does the optimizer implicitly bias the model merging loss landscape?
Chenxiang Zhang
Alexander Theus
Damien Teney
Antonio Orvieto
Jun Pang
S. Mauw
MoMe
122
0
0
06 Oct 2025
Topological Invariance and Breakdown in Learning
Topological Invariance and Breakdown in Learning
Yongyi Yang
Tomaso Poggio
Isaac Chuang
Liu Ziyin
56
0
0
03 Oct 2025
Null-Space Filtering for Data-Free Continual Model Merging: Preserving Transparency, Promoting Fidelity
Null-Space Filtering for Data-Free Continual Model Merging: Preserving Transparency, Promoting Fidelity
Zihuan Qiu
Lei Wang
Yang Cao
Runtong Zhang
Bing Su
Yi Xu
Fanman Meng
Linfeng XU
Qingbo Wu
Hongliang Li
61
0
0
25 Sep 2025
Model Unmerging: Making Your Models Unmergeable for Secure Model Sharing
Model Unmerging: Making Your Models Unmergeable for Secure Model Sharing
Zihao Wang
Enneng Yang
L. Yin
Shiwei Liu
Li Shen
FedMLMoMe
124
0
0
01 Sep 2025
On Task Vectors and Gradients
On Task Vectors and Gradients
Luca Zhou
Daniele Solombrino
Donato Crisostomi
Maria Sofia Bucarelli
Giuseppe Alessio D’Inverno
Fabrizio Silvestri
Emanuele Rodolà
MoMe
289
1
0
22 Aug 2025
On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning
On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning
Tongtian Zhu
Tianyu Zhang
Mingze Wang
Zhanpeng Zhou
Can Wang
FedMLMoMe
192
0
0
09 Jul 2025
Generalized Linear Mode Connectivity for Transformers
Generalized Linear Mode Connectivity for Transformers
Alexander Theus
Alessandro Cabodi
Sotiris Anagnostidis
Antonio Orvieto
Sidak Pal Singh
Valentina Boeva
85
1
0
28 Jun 2025
Subspace-Boosted Model Merging
Subspace-Boosted Model Merging
Ronald Skorobogat
Karsten Roth
Mariana-Iuliana Georgescu
MoMe
311
2
0
19 Jun 2025
Symmetry in Neural Network Parameter Spaces
Symmetry in Neural Network Parameter Spaces
Bo Zhao
Robin Walters
Rose Yu
221
6
0
16 Jun 2025
The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions
The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions
Devin Kwok
Gül Sena Altıntaş
Colin Raffel
David Rolnick
294
2
0
16 Jun 2025
A correlation-permutation approach for speech-music encoders model merging
A correlation-permutation approach for speech-music encoders model merging
Fabian Ritter-Gutierrez
Yi-Cheng Lin
Jeremy H.M Wong
Hung-yi Lee
Eng Siong Chng
Nancy F. Chen
MoMe
175
2
0
13 Jun 2025
Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings
Evan Markou
Thalaiyasingam Ajanthan
Stephen Gould
213
0
0
10 Jun 2025
Circumventing Backdoor Space via Weight Symmetry
Circumventing Backdoor Space via Weight Symmetry
Jie Peng
Hongwei Yang
Jing Zhao
Hengji Dong
Hui He
Weizhe Zhang
Haoyu He
AAML
148
0
0
09 Jun 2025
Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation
Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation
Zhan Zhuang
Xiequn Wang
Wei Li
Yulong Zhang
Qiushi Huang
...
Yanbin Wei
Yuhe Nie
Kede Ma
Yu Zhang
Ying Wei
209
0
0
06 Jun 2025
Smooth Model Compression without Fine-Tuning
Smooth Model Compression without Fine-Tuning
Christina Runkel
Natacha Kuete Meli
Jovita Lukasik
A. Biguri
Carola-Bibiane Schönlieb
Michael Moeller
165
0
0
30 May 2025
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
Yuatyong Chaichana
Thanapat Trachu
Peerat Limkonchotiwat
Konpat Preechakul
Tirasan Khandhawit
Ekapol Chuangsuwanich
MoMe
485
0
0
29 May 2025
Understanding Mode Connectivity via Parameter Space Symmetry
Understanding Mode Connectivity via Parameter Space Symmetry
B. Zhao
Nima Dehmamy
Robin Walters
Rose Yu
417
10
0
29 May 2025
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors
Filippo Rinaldi
Giacomo Capitani
Lorenzo Bonicelli
Donato Crisostomi
Federico Bolelli
E. Ficarra
Emanuele Rodolà
Simone Calderara
Angelo Porrello
166
8
0
28 May 2025
Robust fine-tuning of speech recognition models via model merging: application to disordered speech
Robust fine-tuning of speech recognition models via model merging: application to disordered speech
Alexandre Ducorroy
Rachid Riad
MoMe
174
1
0
26 May 2025
Leveraging Per-Instance Privacy for Machine Unlearning
Leveraging Per-Instance Privacy for Machine Unlearning
N. Sepahvand
Anvith Thudi
Berivan Isik
Ashmita Bhattacharyya
Nicolas Papernot
Eleni Triantafillou
Daniel M. Roy
Gintare Karolina Dziugaite
MUFedML
235
1
0
24 May 2025
Distilling a speech and music encoder with task arithmetic
Distilling a speech and music encoder with task arithmetic
Fabian Ritter-Gutierrez
Yi-Cheng Lin
Jui-Chiang Wei
Jeremy H.M Wong
Eng Siong Chng
Nancy F. Chen
Hung-yi Lee
176
6
0
19 May 2025
MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging
MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging
Zihuan Qiu
Yi Xu
Chiyuan He
Fanman Meng
Linfeng Xu
Qi Wu
Hongliang Li
CLLMoMe
256
3
0
17 May 2025
Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry
Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry
Mohammed Adnan
Rohan Jain
Ekansh Sharma
Rahul Krishnan
Yani Andrew Ioannou
251
1
0
08 May 2025
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Dynamic Fisher-weighted Model Merging via Bayesian OptimizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Sanwoo Lee
Jiahao Liu
Qifan Wang
Jiadong Wang
Xunliang Cai
Yunfang Wu
MoMe
851
4
0
26 Apr 2025
Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations
Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations
Zican Dong
Han Peng
Peiyu Liu
Wayne Xin Zhao
Dong Wu
Feng Xiao
Liang Luo
MoE
238
3
0
09 Apr 2025
MASS: MoErging through Adaptive Subspace Selection
MASS: MoErging through Adaptive Subspace Selection
Donato Crisostomi
Alessandro Zirilli
Antonio Andrea Gargiulo
Maria Sofia Bucarelli
Simone Scardapane
Fabrizio Silvestri
Iacopo Masi
Emanuele Rodolà
MoMe
235
0
0
06 Apr 2025
Model Assembly Learning with Heterogeneous Layer Weight Merging
Model Assembly Learning with Heterogeneous Layer Weight Merging
Yi-Kai Zhang
Jin Wang
Xu-Xiang Zhong
De-Chuan Zhan
Han-Jia Ye
MoMe
230
1
0
27 Mar 2025
Finding Stable Subnetworks at Initialization with Dataset Distillation
Finding Stable Subnetworks at Initialization with Dataset Distillation
Luke McDermott
Rahul Parhi
DD
257
0
0
23 Mar 2025
Structure Is Not Enough: Leveraging Behavior for Neural Network Weight Reconstruction
Structure Is Not Enough: Leveraging Behavior for Neural Network Weight Reconstruction
Léo Meynent
Ivan Melev
Konstantin Schurholt
Göran Kauermann
Damian Borth
277
5
0
21 Mar 2025
Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space
Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space
Vinicius Hernandes
Thomas Spriggs
Saqar Khaleefah
E. Greplova
261
5
0
21 Mar 2025
From Task-Specific Models to Unified Systems: A Review of Model Merging Approaches
Wei Ruan
Tianze Yang
Yimiao Zhou
Tianming Liu
Jin Lu
MoMe
297
6
0
13 Mar 2025
Analyzing the Role of Permutation Invariance in Linear Mode ConnectivityInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Keyao Zhan
Puheng Li
Lei Wu
MoMe
245
1
0
13 Mar 2025
Machine Learning meets Algebraic Combinatorics: A Suite of Datasets Capturing Research-level Conjecturing Ability in Pure Mathematics
Herman Chau
Helen Jenne
Davis Brown
Jesse He
Mark Raugas
Sara Billey
Henry Kvinge
178
1
0
09 Mar 2025
SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting
Linqi Yang
Xiongwei Zhao
Qihao Sun
Ke Wang
Ao Chen
Peng Kang
3DGS
235
9
0
07 Mar 2025
GNNMerge: Merging of GNN Models Without Accessing Training Data
GNNMerge: Merging of GNN Models Without Accessing Training Data
Vipul Garg
Ishita Thakre
Sayan Ranu
MoMe
474
0
0
05 Mar 2025
Paths and Ambient Spaces in Neural Loss LandscapesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Daniel Dold
Julius Kobialka
Nicolai Palm
Emanuel Sommer
David Rügamer
Oliver Durr
AI4CE
323
1
0
05 Mar 2025
In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models
In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models
Hu Wang
Ibrahim Almakky
Congbo Ma
Numan Saeed
Mohammad Yaqub
MoMe
338
2
0
27 Feb 2025
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation
Qiuming Zhao
Guangzhi Sun
Chao Zhang
MoMeVLM
904
3
0
24 Feb 2025
Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models
Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Shuqi Liu
Han Wu
Bowei He
Xiongwei Han
Mingxuan Yuan
Linqi Song
MoMe
256
4
0
20 Feb 2025
High-dimensional manifold of solutions in neural networks: insights from statistical physics
High-dimensional manifold of solutions in neural networks: insights from statistical physics
Enrico M. Malatesta
246
5
0
20 Feb 2025
Unveiling Mode Connectivity in Graph Neural Networks
Unveiling Mode Connectivity in Graph Neural Networks
Bingheng Li
Z. Chen
Haoyu Han
Shenglai Zeng
J. Liu
Shucheng Zhou
192
1
0
18 Feb 2025
12345
Next