Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging"

43 / 43 papers shown

Stay Unique, Stay Efficient: Preserving Model Personality in Multi-Task Merging

136

01 Dec 2025

A Systematic Study of Model Merging Techniques in Large Language Models

312

26 Nov 2025

Defending Unauthorized Model Merging via Dual-Stage Weight Protection

409

14 Nov 2025

T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis

Raza Imam

Hu Wang

Dwarikanath Mahapatra

Mohammad Yaqub

The Thinking Spectrum: An Empirical Study of Tunable Reasoning in LLMs through Model Merging

295

31 Oct 2025

227

26 Sep 2025

Black-box Model Merging for Language-Model-as-a-Service with Massive Model Repositories

...

201

16 Sep 2025

On Task Vectors and Gradients

Luca Zhou

Daniele Solombrino

Giuseppe Alessio D’Inverno

Fabrizio Silvestri

Emanuele Rodolà

Tensorized Clustered LoRA Merging for Multi-Task Interference

419

22 Aug 2025

182

06 Aug 2025

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

...

206

04 Aug 2025

STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

300

02 Jun 2025

Navigating the Accuracy-Size Trade-Off with Flexible Model Merging

Akash Dhasade

Divyansh Jhunjhunwala

308

29 May 2025

Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration

310

29 May 2025

Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model MergingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Haobo Zhang

Jiayu Zhou

CarboFormer: A Lightweight Semantic Segmentation Architecture for Efficient Carbon Dioxide Detection Using Optical Gas Imaging

268

28 May 2025

183

23 May 2025

Activation-Guided Consensus Merging for Large Language Models

452

20 May 2025

CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

329

11 May 2025

FedMerge: Federated Personalization via Model Merging

378

09 Apr 2025

MASS: MoErging through Adaptive Subspace Selection

Alessandro Zirilli

Antonio Andrea Gargiulo

AdaRank: Adaptive Rank Pruning for Enhanced Model Merging

292

06 Apr 2025

295

28 Mar 2025

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

440

26 Mar 2025

FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

Hao Mark Chen

S. Hu

Wayne Luk

Timothy M. Hospedales

Hongxiang Fan

From Task-Specific Models to Unified Systems: A Review of Model Merging Approaches

472

16 Mar 2025

406

13 Mar 2025

Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors

375

11 Mar 2025

Task Vector Quantization for Memory-Efficient Model Merging

254

10 Mar 2025

Seeing Delta Parameters as JPEG Images: Data-Free Delta Compression with Discrete Cosine Transform

155

09 Mar 2025

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

519

07 Mar 2025

GNNMerge: Merging of GNN Models Without Accessing Training Data

540

05 Mar 2025

CAMEx: Curvature-aware Merging of ExpertsInternational Conference on Learning Representations (ICLR), 2025

356

26 Feb 2025

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

1.1K

24 Feb 2025

Scalable Model Merging with Progressive Layer-wise Distillation

655

18 Feb 2025

1bit-Merging: Dynamic Quantized Merging for Large Language Models

437

15 Feb 2025

LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging

356

15 Feb 2025

Multi-Task Model Merging via Adaptive Weight Disentanglement

585

10 Jan 2025

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

655

06 Jan 2025

Task Singular Vectors: Reducing Task Interference in Model MergingComputer Vision and Pattern Recognition (CVPR), 2024

Antonio Andrea Gargiulo

ATM: Improving Model Merging by Alternating Tuning and Merging

642

26 Nov 2024

Luca Zhou

Daniele Solombrino

Fabrizio Silvestri

Emanuele Rodolà