Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2212.04089
Cited By

Editing Models with Task Arithmetic

v1v2v3 (latest)

Editing Models with Task Arithmetic

International Conference on Learning Representations (ICLR), 2022

8 December 2022

Gabriel Ilharco

Marco Tulio Ribeiro

Mitchell Wortsman

Suchin Gururangan

Hannaneh Hajishirzi

ArXiv (abs)PDF HTML HuggingFace (7 upvotes)

Papers citing "Editing Models with Task Arithmetic"

50 / 525 papers shown

Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Sara Kangaslahti

Jonathan Geuter

Francesco Locatello

David Alvarez-Melis

158

0

0

06 Oct 2025

How does the optimizer implicitly bias the model merging loss landscape?

How does the optimizer implicitly bias the model merging loss landscape?

Chenxiang Zhang

Alexander Theus

Antonio Orvieto

189

1

0

06 Oct 2025

Learning to Interpret Weight Differences in Language Models

Learning to Interpret Weight Differences in Language Models

224

1

0

06 Oct 2025

MLLMEraser: Achieving Test-Time Unlearning in Multimodal Large Language Models through Activation Steering

MLLMEraser: Achieving Test-Time Unlearning in Multimodal Large Language Models through Activation Steering

248

0

0

05 Oct 2025

BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

102

1

0

30 Sep 2025

Expert Merging: Model Merging with Unsupervised Expert Alignment and Importance-Guided Layer Chunking

Expert Merging: Model Merging with Unsupervised Expert Alignment and Importance-Guided Layer Chunking

152

2

0

30 Sep 2025

Understanding the Dilemma of Unlearning for Large Language Models

Understanding the Dilemma of Unlearning for Large Language Models

259

1

0

29 Sep 2025

Model Merging Scaling Laws in Large Language Models

Model Merging Scaling Laws in Large Language Models

326

1

0

29 Sep 2025

TDHook: A Lightweight Framework for Interpretability

TDHook: A Lightweight Framework for Interpretability

129

0

0

29 Sep 2025

Real-Aware Residual Model Merging for Deepfake Detection

Real-Aware Residual Model Merging for Deepfake Detection

152

0

0

29 Sep 2025

Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs

Stable Forgetting: Bounded Parameter-Efficient Unlearning in LLMs

Hemanth Saratchandran

121

1

0

29 Sep 2025

Merge Now, Regret Later: The Hidden Cost of Model Merging is Adversarial Transferability

Merge Now, Regret Later: The Hidden Cost of Model Merging is Adversarial Transferability

Aaryan Ajay Sharma

189

1

0

28 Sep 2025

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

374

0

0

28 Sep 2025

Toward a Holistic Approach to Continual Model Merging

Toward a Holistic Approach to Continual Model Merging

190

1

0

28 Sep 2025

Dual-Space Smoothness for Robust and Balanced LLM Unlearning

Dual-Space Smoothness for Robust and Balanced LLM Unlearning

116

0

0

27 Sep 2025

Guard Vector: Beyond English LLM Guardrails with Task-Vector Composition and Streaming-Aware Prefix SFT

Guard Vector: Beyond English LLM Guardrails with Task-Vector Composition and Streaming-Aware Prefix SFT

Dongyoung Jeong

143

0

0

27 Sep 2025

Temporal Generalization: A Reality Check

Temporal Generalization: A Reality Check

132

0

0

27 Sep 2025

Context Parametrization with Compositional Adapters

Context Parametrization with Compositional Adapters

124

0

0

26 Sep 2025

Closing the Oracle Gap: Increment Vector Transformation for Class Incremental Learning

Closing the Oracle Gap: Increment Vector Transformation for Class Incremental Learning

143

0

0

26 Sep 2025

The Thinking Spectrum: An Empirical Study of Tunable Reasoning in LLMs through Model Merging

The Thinking Spectrum: An Empirical Study of Tunable Reasoning in LLMs through Model Merging

224

2

0

26 Sep 2025

SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

Arvind Srinivasan

...

355

1

0

25 Sep 2025

Null-Space Filtering for Data-Free Continual Model Merging: Preserving Transparency, Promoting Fidelity

Null-Space Filtering for Data-Free Continual Model Merging: Preserving Transparency, Promoting Fidelity

128

0

0

25 Sep 2025

Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference

Faster, Smaller, and Smarter: Task-Aware Expert Merging for Online MoE Inference

155

0

0

24 Sep 2025

LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions

LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions

...

344

5

0

23 Sep 2025

SEQR: Secure and Efficient QR-based LoRA Routing

SEQR: Secure and Efficient QR-based LoRA Routing

William Fleshman

Benjamin Van Durme

161

0

0

22 Sep 2025

Accurate and Efficient Low-Rank Model Merging in Core Space

Accurate and Efficient Low-Rank Model Merging in Core Space

Aniello Panariello

Simone Magistri

Angelo Porrello

Bartłomiej Twardowski

Andrew D. Bagdanov

Simone Calderara

Joost van de Weijer

272

2

0

22 Sep 2025

Variational Task Vector Composition

Variational Task Vector Composition

194

0

0

21 Sep 2025

Local Mechanisms of Compositional Generalization in Conditional Diffusion

Local Mechanisms of Compositional Generalization in Conditional Diffusion

244

1

0

19 Sep 2025

HAM: Hierarchical Adapter Merging for Scalable Continual Learning

HAM: Hierarchical Adapter Merging for Scalable Continual Learning

Eric Nuertey Coleman

Luigi Quarantiello

Samrat Mukherjee

Vincenzo Lomonaco

306

1

0

16 Sep 2025

Programmable Cognitive Bias in Social Agents

Programmable Cognitive Bias in Social Agents

185

2

0

16 Sep 2025

Harnessing Optimization Dynamics for Curvature-Informed Model Merging

Harnessing Optimization Dynamics for Curvature-Informed Model Merging

Pouria Mahdavinia

Niloofar Mireshghallah

183

1

0

14 Sep 2025

Continually Adding New Languages to Multilingual Language Models

Continually Adding New Languages to Multilingual Language Models

203

2

0

14 Sep 2025

Delta Activations: A Representation for Finetuned Large Language Models

Delta Activations: A Representation for Finetuned Large Language Models

164

0

0

04 Sep 2025

Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs

Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs

Maximilian Müller

Francesco Croce

192

4

0

02 Sep 2025

Surrogate Benchmarks for Model Merging Optimization

Surrogate Benchmarks for Model Merging Optimization

Nozomu Yoshinari

Toshiyuki Nishimoto

Shinichi Shirakawa

151

0

0

02 Sep 2025

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

184

3

0

01 Sep 2025

Model Unmerging: Making Your Models Unmergeable for Secure Model Sharing

Model Unmerging: Making Your Models Unmergeable for Secure Model Sharing

161

0

0

01 Sep 2025

Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning

Improving Fisher Information Estimation and Efficiency for LoRA-based LLM Unlearning

115

1

0

29 Aug 2025

Rethinking Layer-wise Model Merging through Chain of Merges

Rethinking Layer-wise Model Merging through Chain of Merges

Riccardo Salami

Angelo Porrello

Simone Calderara

199

0

0

29 Aug 2025

Lethe: Purifying Backdoored Large Language Models with Knowledge Dilution

Lethe: Purifying Backdoored Large Language Models with Knowledge Dilution

160

0

0

28 Aug 2025

UNIFORM: Unifying Knowledge from Large-scale and Diverse Pre-trained Models

UNIFORM: Unifying Knowledge from Large-scale and Diverse Pre-trained Models

139

1

0

27 Aug 2025

PSO-Merging: Merging Models Based on Particle Swarm Optimization

PSO-Merging: Merging Models Based on Particle Swarm Optimization

130

0

0

27 Aug 2025

AMELIA: A Family of Multi-task End-to-end Language Models for Argumentation

AMELIA: A Family of Multi-task End-to-end Language Models for Argumentation

86

0

0

25 Aug 2025

Modular Embedding Recomposition for Incremental Learning

Modular Embedding Recomposition for Incremental Learning

Aniello Panariello

Emanuele Frascaroli

Lorenzo Bonicelli

Angelo Porrello

Simone Calderara

201

2

0

22 Aug 2025

On Task Vectors and Gradients

On Task Vectors and Gradients

Daniele Solombrino

Donato Crisostomi

Maria Sofia Bucarelli

Giuseppe Alessio D’Inverno

Fabrizio Silvestri

Emanuele Rodolà

412

1

0

22 Aug 2025

Think in Blocks: Adaptive Reasoning from Direct Response to Deep Reasoning

Think in Blocks: Adaptive Reasoning from Direct Response to Deep Reasoning

OffRL LRM AI4CE

86

0

0

21 Aug 2025

Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation

Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation

250

0

0

18 Aug 2025

Cost-Aware Contrastive Routing for LLMs

Cost-Aware Contrastive Routing for LLMs

Reza Shirkavand

Heng-Chiao Huang

313

1

0

17 Aug 2025

Rethinking Safety in LLM Fine-tuning: An Optimization Perspective

Rethinking Safety in LLM Fine-tuning: An Optimization Perspective

David M. Krueger

141

3

0

17 Aug 2025

MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation

MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation

Francesco Sammarco

123

2

0

14 Aug 2025

1 2 3 4 5...9 10 11