Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2212.04089
Cited By
v1
v2
v3 (latest)
Editing Models with Task Arithmetic
International Conference on Learning Representations (ICLR), 2022
8 December 2022
Gabriel Ilharco
Marco Tulio Ribeiro
Mitchell Wortsman
Suchin Gururangan
Ludwig Schmidt
Hannaneh Hajishirzi
Ali Farhadi
KELM
MoMe
MU
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (7 upvotes)
Papers citing
"Editing Models with Task Arithmetic"
50 / 525 papers shown
On the Emergence of Linear Analogies in Word Embeddings
Daniel J. Korchinski
Dhruva Karkada
Yasaman Bahri
Matthieu Wyart
259
1
0
24 May 2025
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
Takashi Ishida
Thanawat Lodkaew
Ikko Yamane
708
2
0
23 May 2025
When Are Concepts Erased From Diffusion Models?
Kevin Lu
Nicky Kriplani
Rohit Gandikota
Minh Pham
David Bau
Chinmay Hegde
Niv Cohen
585
5
0
22 May 2025
Training-Free Reasoning and Reflection in MLLMs
Hongchen Wei
Zhenzhong Chen
OffRL
VLM
LRM
254
1
0
22 May 2025
Model Merging is Secretly Certifiable: Non-Vacuous Generalisation Bounds for Low-Shot Learning
Taehoon Kim
Henry Gouk
Minyoung Kim
Timothy M. Hospedales
351
0
0
21 May 2025
Covert Attacks on Machine Learning Training in Passively Secure MPC
IACR Cryptology ePrint Archive (IACR ePrint), 2025
Matthew Jagielski
Daniel Escudero
Rahul Rachuri
Peter Scholl
306
0
0
21 May 2025
Context-Free Synthetic Data Mitigates Forgetting
Parikshit Bansal
Sujay Sanghavi
CLL
350
1
0
20 May 2025
SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment
Wonje Jeung
Sangyeon Yoon
Minsuk Kahng
Albert No
LRM
LLMSV
731
8
0
20 May 2025
Activation-Guided Consensus Merging for Large Language Models
Yuxuan Yao
Shuqi Liu
Zehua Liu
Qintong Li
Mingyang Liu
Xiongwei Han
Zhijiang Guo
Han Wu
Linqi Song
MoMe
452
0
0
20 May 2025
Text Generation Beyond Discrete Token Sampling
Yufan Zhuang
Liyuan Liu
Chandan Singh
Jingbo Shang
Jianfeng Gao
OOD
513
9
0
20 May 2025
Shadow-FT: Tuning Instruct Model via Training on Paired Base Model
Taiqiang Wu
Runming Yang
Jiayi Li
Pengfei Hu
Ngai Wong
Ngai Wong
Yujiu Yang
699
1
0
19 May 2025
Distilling a speech and music encoder with task arithmetic
Fabian Ritter-Gutierrez
Yi-Cheng Lin
Jui-Chiang Wei
Jeremy H.M Wong
Eng Siong Chng
Nancy F. Chen
Hung-yi Lee
264
8
0
19 May 2025
Scalable Strategies for Continual Learning with Replay
Truman Hickok
CLL
387
1
0
18 May 2025
Cross-Model Transfer of Task Vectors via Few-Shot Orthogonal Alignment
Kazuhiko Kawamoto
Atsuhiro Endo
Hiroshi Kera
319
0
0
17 May 2025
MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging
Zihuan Qiu
Yi Xu
Chiyuan He
Fanman Meng
Linfeng Xu
Qi Wu
Hongliang Li
CLL
MoMe
423
3
0
17 May 2025
Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning
Puning Yang
Qizhou Wang
Zhuo Huang
Tongliang Liu
Chengqi Zhang
Bo Han
MU
399
14
0
17 May 2025
Do different prompting methods yield a common task representation in language models?
Guy Davidson
Todd M. Gureckis
Brenden M. Lake
Adina Williams
380
4
0
17 May 2025
Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual Transfer
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Seungyoon Lee
Seongtae Hong
Hyeonseok Moon
Heuiseok Lim
KELM
324
0
0
16 May 2025
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
Yifei He
Siqi Zeng
Yuzheng Hu
Rui Yang
Tong Zhang
Han Zhao
MoMe
ALM
684
8
0
16 May 2025
A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jean-Philippe Corbeil
Amin Dada
Jean-Michel Attendu
Asma Ben Abacha
Alessandro Sordoni
Lucas Caccia
François Beaulieu
Thomas Lin
Jens Kleesiek
Paul Vozila
LM&MA
375
11
0
15 May 2025
Layered Unlearning for Adversarial Relearning
Timothy Qian
Vinith Suriyakumar
Ashia Wilson
Dylan Hadfield-Menell
MU
345
1
0
14 May 2025
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
Wenju Sun
Qingyong Li
Yangli-ao Geng
Boyang Li
MoMe
329
6
0
11 May 2025
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
Computer Vision and Pattern Recognition (CVPR), 2025
Kunpeng Qiu
Zhiqiang Gao
Zhiying Zhou
Mingjie Sun
Yongxin Guo
MedIm
480
17
0
09 May 2025
WaterDrum: Watermarking for Data-centric Unlearning Metric
Xinyang Lu
Xinyuan Niu
Gregory Kang Ruey Lau
Bui Thi Cam Nhung
Rachael Hwee Ling Sim
Fanyu Wen
Chuan-Sheng Foo
Szu Hui Ng
Bryan Kian Hsiang Low
MU
271
4
0
08 May 2025
Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation
Vaidehi Patil
Yi-Lin Sung
Peter Hase
Jie Peng
Jen-tse Huang
Joey Tianyi Zhou
AAML
MU
527
8
0
01 May 2025
GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling
Siqi Li
Yufan Shen
Xiangnan Chen
Jiayi Chen
Hengwei Ju
...
Botian Shi
Y. Liu
Xinyu Cai
Yu Qiao
Yu Qiao
VLM
ELM
584
2
0
30 Apr 2025
Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors
Ren-Wei Liang
Chin-Ting Hsu
Chan-Hung Yu
Saransh Agrawal
Shih-Cheng Huang
Shang-Tse Chen
Kuan-Hao Huang
Shao-Hua Sun
329
0
0
27 Apr 2025
Param
Δ
Δ
Δ
for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost
Sheng Cao
Mingrui Wu
Karthik Prasad
Yuandong Tian
Zechun Liu
MoMe
339
0
0
23 Apr 2025
Parameter-Efficient Checkpoint Merging via Metrics-Weighted Averaging
Shi Jie Yu
Sehyun Choi
MoMe
285
0
0
23 Apr 2025
Advancing AI-assisted Hardware Design with Hierarchical Decentralized Training and Personalized Inference-Time Optimization
Hao Mark Chen
Zehuan Zhang
Wanru Zhao
Nicholas D. Lane
Hongxiang Fan
258
0
0
21 Apr 2025
Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning
International Conference on Learning Representations (ICLR), 2025
Yeoreum Lee
Jinwook Jung
Sungyong Baik
MoMe
422
7
0
20 Apr 2025
TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data
Fei Zhu
Zhaoxiang Zhang
OODD
UQCV
377
1
0
20 Apr 2025
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
Tianhui Song
Weixin Feng
Shuai Wang
Guojian Pang
Bangyu Xiang
Bo Zheng
Limin Wang
MoMe
343
4
0
16 Apr 2025
Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMs
International Conference on Learning Representations (ICLR), 2025
Rui Dai
Sile Hu
Xu Shen
Yonggang Zhang
Xinmei Tian
Jieping Ye
MoMe
320
6
0
15 Apr 2025
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
International Conference on Learning Representations (ICLR), 2025
Hongkang Li
Yihua Zhang
Shuai Zhang
Ming Wang
Sijia Liu
Pin-Yu Chen
MoMe
801
21
0
15 Apr 2025
Alleviating the Fear of Losing Alignment in LLM Fine-tuning
IEEE Symposium on Security and Privacy (S&P), 2025
Kang Yang
Guanhong Tao
X. Chen
Jun Xu
280
11
0
13 Apr 2025
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation
Juzheng Zhang
Jiacheng You
Ashwinee Panda
Tom Goldstein
MoMe
357
10
0
10 Apr 2025
FedMerge: Federated Personalization via Model Merging
Shutong Chen
Tianyi Zhou
Guodong Long
Jing Jiang
Chengqi Zhang
FedML
MoMe
379
1
0
09 Apr 2025
SEA-LION: Southeast Asian Languages in One Network
Raymond Ng
Thanh Ngan Nguyen
Yuli Huang
Ngee Chia Tai
Wai Yi Leong
...
David Ong Tat-Wee
B. Liu
William-Chandra Tjhi
Xiaoshi Zhong
Leslie Teo
430
25
0
08 Apr 2025
Not All Data Are Unlearned Equally
Aravind Krishnan
Siva Reddy
Marius Mosbach
MU
896
7
0
07 Apr 2025
Exact Unlearning of Finetuning Data via Model Merging at Scale
Kevin Kuo
Amrith Rajagopal Setlur
Kartik Srinivas
Aditi Raghunathan
Virginia Smith
MoMe
CLL
MU
289
12
0
06 Apr 2025
MASS: MoErging through Adaptive Subspace Selection
Donato Crisostomi
Alessandro Zirilli
Antonio Andrea Gargiulo
Maria Sofia Bucarelli
Simone Scardapane
Fabrizio Silvestri
Iacopo Masi
Emanuele Rodolà
MoMe
292
0
0
06 Apr 2025
Efficient Model Editing with Task-Localized Sparse Fine-tuning
International Conference on Learning Representations (ICLR), 2025
Leonardo Iurada
Marco Ciccone
Tatiana Tommasi
KELM
MoMe
350
10
0
03 Apr 2025
BECAME: BayEsian Continual Learning with Adaptive Model MErging
Mei Li
Yuxiang Lu
Qinyan Dai
Wei Ji
Yue Ding
Hongtao Lu
CLL
MoMe
349
1
0
03 Apr 2025
Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach
Francesco P. Ramunno
Paolo Massa
Vitaliy Kinakh
Brandon Panos
A. Csillaghy
Slava Voloshynovskiy
DiffM
272
3
0
31 Mar 2025
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
Computer Vision and Pattern Recognition (CVPR), 2025
Yiyang Du
Xiaochen Wang
Cai Chen
Jiabo Ye
Yiru Wang
...
J.N. Zhang
Fei Huang
Zhifang Sui
Maosong Sun
Yi Liu
MoMe
230
4
0
31 Mar 2025
SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning
Tianyang Xu
Xiaoze Liu
Feijie Wu
Xiaoqian Wang
Jing Gao
MU
551
5
0
29 Mar 2025
AdaRank: Adaptive Rank Pruning for Enhanced Model Merging
Chanhyuk Lee
Jiho Choi
Chanryeol Lee
Donggyun Kim
Seunghoon Hong
MoMe
295
5
0
28 Mar 2025
Reinforced Model Merging
J. N. Han
Jingwen Ye
Shunyu Liu
Haofei Zhang
Jie Song
Zunlei Feng
Weilong Dai
MoMe
274
0
0
27 Mar 2025
Guided Model Merging for Hybrid Data Learning: Leveraging Centralized Data to Refine Decentralized Models
Junyi Zhu
Ruicong Yao
Taha Ceritli
Savas Ozkan
Matthew B. Blaschko
Eunchung Noh
Jeongwon Min
Cho Jung Min
Mete Ozay
FedML
486
0
0
26 Mar 2025
Previous
1
2
3
4
5
...
9
10
11
Next