Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2306.01708
Cited By
v1
v2 (latest)
TIES-Merging: Resolving Interference When Merging Models
Neural Information Processing Systems (NeurIPS), 2023
2 June 2023
Prateek Yadav
Derek Tam
Leshem Choshen
Colin Raffel
Joey Tianyi Zhou
MoMe
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (14 upvotes)
Github (179★)
Papers citing
"TIES-Merging: Resolving Interference When Merging Models"
50 / 356 papers shown
TRINITY: An Evolved LLM Coordinator
Jinglue Xu
Qi Sun
Peter Schwendeman
Stefan Nielsen
Edoardo Cetin
Yujin Tang
LLMAG
288
0
0
04 Dec 2025
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
Atsuki Yamaguchi
Terufumi Morishita
Aline Villavicencio
Nikolaos Aletras
CLL
276
0
0
04 Dec 2025
An Empirical Survey of Model Merging Algorithms for Social Bias Mitigation
Daiki Shirafuji
Tatsuhiko Saito
Yasutomo Kimura
MoMe
KELM
151
0
0
02 Dec 2025
Basis-Oriented Low-rank Transfer for Few-Shot and Test-Time Adaptation
Junghwan Park
Woojin Cho
J. Heo
Darongsae Kwon
Kookjin Lee
102
0
0
02 Dec 2025
Stay Unique, Stay Efficient: Preserving Model Personality in Multi-Task Merging
Kuangpu Guo
Yuhe Ding
Jian Liang
Zilei Wang
Ran He
MoMe
142
0
0
01 Dec 2025
From Coefficients to Directions: Rethinking Model Merging with Directional Alignment
Zhikang Chen
Sen Cui
Deheng Ye
Min Zhang
Gang Niu
Yu Zhang
Masashi Sugiyama
Tingting Zhu
MoMe
207
0
0
29 Nov 2025
A Systematic Study of Model Merging Techniques in Large Language Models
Oğuz Kağan Hitit
Leander Girrbach
Zeynep Akata
MoMe
349
2
0
26 Nov 2025
Towards Benign Memory Forgetting for Selective Multimodal Large Language Model Unlearning
Zhen Zeng
Leijiang Gu
Zhangling Duan
Feng-Qiang Li
Zenglin Shi
Cees G. M. Snoek
Meng Wang
KELM
MU
CLL
295
1
0
25 Nov 2025
MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent
Yuxia Fu
Zhizhen Zhang
Y. Zhang
Zijian Wang
Zi-Rui Huang
Yadan Luo
MoMe
357
3
0
24 Nov 2025
Escaping Optimization Stagnation: Taking Steps Beyond Task Arithmetic via Difference Vectors
Jinping Wang
Zhiqiang Gao
Dinggen Zhang
Zhiwu Xie
MoMe
320
0
0
22 Nov 2025
MergeSlide: Continual Model Merging and Task-to-Class Prompt-Aligned Inference for Lifelong Learning on Whole Slide Images
Doanh C. Bui
Ba-Hung Ngo
H. Pham
Khang Phuoc-Quy Nguyen
Maï K. Nguyen
Y. Nakashima
CLL
MoMe
VLM
344
0
0
17 Nov 2025
A Novel Hierarchical Integration Method for Efficient Model Merging in Medical LLMs
Prakrit Timilsina
Anuj Nepal
Rajan Kadel
Robin Doss
MoMe
127
0
0
17 Nov 2025
Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Wei-Jia Chen
Min-Yen Tsai
Cheng-Yi Lee
Chia-Mu Yu
MoMe
AAML
426
0
0
14 Nov 2025
Continual Unlearning for Text-to-Image Diffusion Models: A Regularization Perspective
Justin Lee
Zheda Mai
Jinsu Yoo
Chongyu Fan
Cheng Zhang
Wei-Lun Chao
DiffM
VLM
245
0
0
11 Nov 2025
Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral Signatures
Suqing Wang
Ziyang Ma
Xinyi Li
Zuchao Li
179
0
0
09 Nov 2025
Model Merging Improves Zero-Shot Generalization in Bioacoustic Foundation Models
Davide Marincione
Donato Crisostomi
Roberto Dessi
Emanuele Rodolà
Emanuele Rossi
MoMe
AI4CE
VLM
366
1
0
07 Nov 2025
Steering Language Models with Weight Arithmetic
Constanza Fierro
Fabien Roger
MoMe
LLMSV
587
3
0
07 Nov 2025
Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance
Kentaro Ueda
François Portet
H. Suwa
Keiichi Yasumoto
CLL
MoMe
424
0
0
04 Nov 2025
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
Raza Imam
Hu Wang
Dwarikanath Mahapatra
Mohammad Yaqub
MoMe
314
0
0
31 Oct 2025
Parameterized Prompt for Incremental Object Detection
Zijia An
Boyu Diao
R. Liu
Libo Huang
Chuanguang Yang
Fei Wang
Zhulin An
Yongjun Xu
CLL
VLM
256
0
0
31 Oct 2025
WeaveRec: An LLM-Based Cross-Domain Sequential Recommendation Framework with Model Merging
Min Hou
Xin Liu
Le Wu
Chenyi He
Hao Liu
Z. Li
Xin Li
Si Wei
MoMe
340
0
0
30 Oct 2025
World Simulation with Video Foundation Models for Physical AI
Nvidia
A. M. Ali
Junjie Bai
Maciej Bala
Yogesh Balaji
...
Jing Zhang
Qinsheng Zhang
Kaiwen Zheng
Andrew Zhu
Yuke Zhu
VGen
PINN
602
41
0
28 Oct 2025
Eigen-Value: Efficient Domain-Robust Data Valuation via Eigenvalue-Based Approach
Youngjun Choi
Joonseong Kang
Sungjun Lim
Kyungwoo Song
265
0
0
27 Oct 2025
Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
Yuxin Pan
Zhiguang Cao
Chengyang Gu
Liu Liu
Peilin Zhao
Yize Chen
Fangzhen Lin
207
0
0
24 Oct 2025
Model Merging with Functional Dual Anchors
Kexuan Shi
Yandong Wen
Weiyang Liu
MoMe
309
2
0
24 Oct 2025
Mapping Post-Training Forgetting in Language Models at Scale
Jackson Harmon
Andreas Hochlehnert
Matthias Bethge
Ameya Prabhu
CLL
KELM
187
1
0
20 Oct 2025
Hierarchical Federated Unlearning for Large Language Models
Yisheng Zhong
Zhengbang Yang
Zhuangdi Zhu
MU
246
0
0
19 Oct 2025
MIN-Merging: Merge the Important Neurons for Model Merging
Yunfei Liang
MoMe
591
0
0
18 Oct 2025
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
Shih-yang Liu
Xin Dong
Ximing Lu
Shizhe Diao
Mingjie Liu
...
Yu Wang
K.-T. Cheng
Yejin Choi
Jan Kautz
Pavlo Molchanov
143
10
0
16 Oct 2025
Backdoor Unlearning by Linear Task Decomposition
Amel Abdelraheem
Alessandro Favero
Gérôme Bovet
Pascal Frossard
AAML
MU
246
0
0
16 Oct 2025
Directional Reasoning Injection for Fine-Tuning MLLMs
Chao Huang
Zeliang Zhang
Jiang Liu
Ximeng Sun
Jialian Wu
X. Yu
Ze Wang
Chenliang Xu
Emad Barsoum
Zicheng Liu
MoMe
LRM
269
2
0
16 Oct 2025
Harmonizing Diverse Models: A Layer-wise Merging Strategy for Consistent Generation
Xujun Peng
Anoop Kumar
Jingyu Wu
Parker Glenn
Daben Liu
MoMe
192
0
0
16 Oct 2025
Purifying Task Vectors in Knowledge-Aware Subspace for Model Merging
Bang An
Yibo Yang
Philip Torr
Bernard Ghanem
MoMe
191
1
0
16 Oct 2025
Weight Weaving: Parameter Pooling for Data-Free Model Merging
Levy G. Chaves
Eduardo Valle
Sandra Avila
MoMe
254
1
0
15 Oct 2025
Towards Reversible Model Merging For Low-rank Weights
Mohammadsajad Alipour
Mohammad Mohammadi Amiri
MoMe
175
0
0
15 Oct 2025
REAP the Experts: Why Pruning Prevails for One-Shot MoE compression
Mike Lasby
Ivan Lazarevich
Nish Sinnadurai
Sean Lie
Yani Andrew Ioannou
Vithursan Thangarasa
149
5
0
15 Oct 2025
Exploring and Leveraging Class Vectors for Classifier Editing
Jaeik Kim
Jaeyoung Do
VLM
212
0
0
13 Oct 2025
On-device System of Compositional Multi-tasking in Large Language Models
Ondrej Bohdal
Konstantinos Theodosiadis
Asterios Mpatziakas
Dimitris Filippidis
Iro Spyrou
...
Kyeng-Hun Lee
J. Moon
Hyeonmok Ko
Mete Ozay
Umberto Michieli
138
1
0
11 Oct 2025
Towards Efficient Multimodal Unified Reasoning Model via Model Merging
Qixiang Yin
Huanjin Yao
Jianghao Chen
Jiaxing Huang
Z. Zhao
Fei Su
LRM
MoMe
335
1
0
10 Oct 2025
Don't Throw Away Your Pretrained Model
Shangbin Feng
Wenhao Yu
Yike Wang
Hongming Zhang
Yulia Tsvetkov
Dong Yu
MoMe
254
4
0
10 Oct 2025
Diagnosing and Mitigating System Bias in Self-Rewarding RL
Chuyi Tan
Peiwen Yuan
Xinglin Wang
Yiwei Li
Shaoxiong Feng
...
Jiayi Shi
Ji Zhang
Boyuan Pan
Yao Hu
Kan Li
132
0
0
10 Oct 2025
Backdoor Vectors: a Task Arithmetic View on Backdoor Attacks and Defenses
Stanisław Pawlak
Jan Dubiñski
Daniel Marczak
Bartłomiej Twardowski
AAML
MoMe
250
0
0
09 Oct 2025
Do We Really Need Permutations? Impact of Model Width on Linear Mode Connectivity
Akira Ito
Masanori Yamada
Daiki Chijiwa
Atsutoshi Kumagai
MoMe
213
0
0
09 Oct 2025
FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning
Yunbo Li
Jiaping Gui
Zhihang Deng
Fanchao Meng
Yue Wu
FedML
381
9
0
09 Oct 2025
Gradient-Sign Masking for Task Vector Transport Across Pre-Trained Models
Filippo Rinaldi
Aniello Panariello
Giacomo Salici
Fengyuan Liu
Marco Ciccone
Angelo Porrello
Simone Calderara
205
1
0
07 Oct 2025
BaldWhisper: Faster Whisper with Head Shearing and Layer Merging
Yaya Sy
Christophe Cerisara
Irina Illina
106
0
0
06 Oct 2025
FedSRD: Sparsify-Reconstruct-Decompose for Communication-Efficient Federated Large Language Models Fine-Tuning
Guochen Yan
Luyuan Xie
Qingni Shen
Yuejian Fang
Zhonghai Wu
FedML
222
1
0
06 Oct 2025
Boomerang Distillation Enables Zero-Shot Model Size Interpolation
Sara Kangaslahti
Nihal V. Nayak
Jonathan Geuter
Marco Fumero
Francesco Locatello
David Alvarez-Melis
198
1
0
06 Oct 2025
REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration
Yisu Wang
Ming Wang
Haoyuan Song
Wenjie Huang
Chaozheng Wang
Yi Xie
Xuming Ran
KELM
MoMe
CLL
166
1
0
02 Oct 2025
Expert Merging: Model Merging with Unsupervised Expert Alignment and Importance-Guided Layer Chunking
Dengming Zhang
Xiaowen Ma
Zhenliang Ni
Zhenkai Wu
Han Shu
Xin Jiang
Xinghao Chen
MoMe
173
3
0
30 Sep 2025
1
2
3
4
5
6
7
8
Next
Page 1 of 8