Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2212.04089
Cited By

Editing Models with Task Arithmetic

v1v2v3 (latest)

Editing Models with Task Arithmetic

International Conference on Learning Representations (ICLR), 2022

8 December 2022

Gabriel Ilharco

Marco Tulio Ribeiro

Mitchell Wortsman

Suchin Gururangan

Hannaneh Hajishirzi

ArXiv (abs)PDF HTML HuggingFace (7 upvotes)

Papers citing "Editing Models with Task Arithmetic"

50 / 525 papers shown

On the Limitations and Prospects of Machine Unlearning for Generative AI

On the Limitations and Prospects of Machine Unlearning for Generative AI

295

14

0

01 Aug 2024

Efficient Pareto Manifold Learning with Low-Rank Structure

Efficient Pareto Manifold Learning with Low-Rank Structure

199

9

0

30 Jul 2024

Can LLMs be Fooled? Investigating Vulnerabilities in LLMs

Can LLMs be Fooled? Investigating Vulnerabilities in LLMs

297

9

0

30 Jul 2024

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

382

7

0

30 Jul 2024

Diffusion Models for Multi-Task Generative Modeling

Diffusion Models for Multi-Task Generative Modeling

Bunyamin Sisman

Benjamin Z. Yao

228

9

0

24 Jul 2024

Model editing for distribution shifts in uranium oxide morphological
analysis

Model editing for distribution shifts in uranium oxide morphological analysis

Madelyn Shapiro

221

1

0

22 Jul 2024

Recent Advances in Generative AI and Large Language Models: Current
Status, Challenges, and Perspectives

Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives

Danda B. Rawat

494

88

0

20 Jul 2024

Pareto Low-Rank Adapters: Efficient Multi-Task Learning with Preferences

Pareto Low-Rank Adapters: Efficient Multi-Task Learning with Preferences

Nikolaos Dimitriadis

Pascal Frossard

François Fleuret

562

9

0

10 Jul 2024

Scaling Up Personalized Aesthetic Assessment via Task Vector
Customization

Scaling Up Personalized Aesthetic Assessment via Task Vector Customization

208

5

0

09 Jul 2024

MagMax: Leveraging Model Merging for Seamless Continual Learning

MagMax: Leveraging Model Merging for Seamless Continual Learning

Bartłomiej Twardowski

Tomasz Trzciñski

Sebastian Cygert

206

44

0

08 Jul 2024

Harmony in Diversity: Merging Neural Networks with Canonical Correlation
Analysis

Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

Albert Manuel Orozco Camacho

Eugene Belilovsky

234

12

0

07 Jul 2024

Learning Scalable Model Soup on a Single GPU: An Efficient Subspace
Training Strategy

Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy

337

2

0

04 Jul 2024

Knowledge Composition using Task Vectors with Learned Anisotropic
Scaling

Knowledge Composition using Task Vectors with Learned Anisotropic Scaling

Frederic Z. Zhang

Cristian Rodriguez-Opazo

Anton van den Hengel

Ehsan Abbasnejad

275

27

0

03 Jul 2024

PLeaS -- Merging Models with Permutations and Least Squares

PLeaS -- Merging Models with Permutations and Least Squares

Pang Wei Koh

302

9

0

02 Jul 2024

It's Morphing Time: Unleashing the Potential of Multiple LLMs via
Multi-objective Optimization

It's Morphing Time: Unleashing the Potential of Multiple LLMs via Multi-objective Optimization

Ke Tang

362

13

0

29 Jun 2024

Knowledge-Aware Parsimony Learning: A Perspective from Relational Graphs

Knowledge-Aware Parsimony Learning: A Perspective from Relational Graphs

James Kwok

246

0

0

29 Jun 2024

Enhancing Accuracy and Parameter-Efficiency of Neural Representations
for Network Parameterization

Enhancing Accuracy and Parameter-Efficiency of Neural Representations for Network Parameterization

Jayaraman J. Thiagarajan

339

3

0

29 Jun 2024

Evaluating Copyright Takedown Methods for Language Models

Evaluating Copyright Takedown Methods for Language Models

Weijia Shi

Yangsibo Huang

Luke Zettlemoyer

Kai Li

Peter Henderson

459

39

0

26 Jun 2024

Sequential Editing for Lifelong Training of Speech Recognition Models

Sequential Editing for Lifelong Training of Speech Recognition Models

Devang Kulshreshtha

Saket Dingliwal

Brady C. Houston

Nikolaos Pappas

149

1

0

25 Jun 2024

PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning

PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning

Shiva K. Pentyala

Regunathan Radhakrishnan

Cheng

252

12

0

25 Jun 2024

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs

Tsachy Weissman

444

27

0

24 Jun 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies

WARP: On the Benefits of Weight Averaged Rewarded Policies

Alexandre Ramé

Léonard Hussenot

Pierre-Louis Cedoz

Pier Giuseppe Sessa

Arthur Douillard

320

33

0

24 Jun 2024

Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging

Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging

...

329

10

0

24 Jun 2024

Label Words as Local Task Vectors in In-Context Learning

Label Words as Local Task Vectors in In-Context Learning

252

4

0

23 Jun 2024

MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning

MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning

326

10

0

21 Jun 2024

RE-AdaptIR: Improving Information Retrieval through Reverse Engineered
Adaptation

RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation

William Fleshman

Benjamin Van Durme

194

1

0

20 Jun 2024

Towards Minimal Targeted Updates of Language Models with Targeted
Negative Training

Towards Minimal Targeted Updates of Language Models with Targeted Negative Training

Lily H. Zhang

Rajesh Ranganath

330

1

0

19 Jun 2024

Self-MoE: Towards Compositional Large Language Models with
Self-Specialized Experts

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

Leonid Karlinsky

Hongyin Luo

Jacob A. Hansen

James Glass

Alan Ritter

240

19

0

17 Jun 2024

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective
Unlearning in LLMs

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

Praneet Adusumilli

Dennis Wei

Nathalie Baracaldo

233

17

0

17 Jun 2024

MetaGPT: Merging Large Language Models Using Model Exclusive Task
Arithmetic

MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic

Bingning Wang

Weipeng Chen

389

40

0

17 Jun 2024

On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion

On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion

Wei Wei

Xiaoye Qu

336

10

0

17 Jun 2024

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

Wei Wei

Xiaoye Qu

280

90

0

17 Jun 2024

In-Context Editing: Learning Knowledge from Self-Induced Distributions

In-Context Editing: Learning Knowledge from Self-Induced Distributions

582

15

0

17 Jun 2024

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language
Models

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models

Kang Liu

Jun Zhao

345

51

0

16 Jun 2024

Towards Efficient Pareto Set Approximation via Mixture of Experts Based
Model Fusion

Towards Efficient Pareto Set Approximation via Mixture of Experts Based Model Fusion

Li Shen

Han Hu

200

13

0

14 Jun 2024

Interpreting the Weight Space of Customized Diffusion Models

Interpreting the Weight Space of Customized Diffusion Models

Yossi Gandelsman

Kuan-Chieh Wang

Gordon Wetzstein

Alexei A. Efros

428

20

0

13 Jun 2024

A More Practical Approach to Machine Unlearning

A More Practical Approach to Machine Unlearning

David Zagardo

97

5

0

13 Jun 2024

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

629

10

0

11 Jun 2024

Improving Alignment and Robustness with Circuit Breakers

Improving Alignment and Robustness with Circuit BreakersNeural Information Processing Systems (NeurIPS), 2024

Maksym Andriushchenko

Matt Fredrikson

624

214

0

06 Jun 2024

FusionBench: A Unified Library and Comprehensive Benchmark for Deep Model Fusion

FusionBench: A Unified Library and Comprehensive Benchmark for Deep Model Fusion

454

38

0

05 Jun 2024

Operational Latent Spaces

Operational Latent Spaces

Scott H. Hawley

Austin R. Tackett

149

1

0

04 Jun 2024

Pretrained Hybrids with MAD Skills

Pretrained Hybrids with MAD Skills

Nicholas Roberts

Satya Sai Srinath Namburi

376

0

0

02 Jun 2024

An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates

An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates

Albin Soutif--Cormerais

Simone Magistri

Joost van de Weijer

Andew D. Bagdanov

429

8

0

28 May 2024

Navigating the Safety Landscape: Measuring Risks in Finetuning Large
Language Models

Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models

Sheng-Hsuan Peng

Duen Horng Chau

336

51

0

27 May 2024

Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models

Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models

Chun-ying Huang

461

97

0

27 May 2024

Ensembling Diffusion Models via Adaptive Feature Aggregation

Ensembling Diffusion Models via Adaptive Feature Aggregation

Zhiwei Jiang

350

15

0

27 May 2024

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of
Large Language Models

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

Peng Wang

Ningyu Zhang

Fei Huang

Huajun Chen

315

61

0

23 May 2024

EMR-Merging: Tuning-Free High-Performance Model Merging

EMR-Merging: Tuning-Free High-Performance Model MergingNeural Information Processing Systems (NeurIPS), 2024

Peng Ye

Tao Chen

Wanli Ouyang

294

73

0

23 May 2024

Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity

Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity

593

7

0

22 May 2024

MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

255

13

0

19 May 2024

1 2 3...10 11 7 8 9

Page 8 of 11

Pageof 11