Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2212.04089
Cited By
v1
v2
v3 (latest)
Editing Models with Task Arithmetic
International Conference on Learning Representations (ICLR), 2022
8 December 2022
Gabriel Ilharco
Marco Tulio Ribeiro
Mitchell Wortsman
Suchin Gururangan
Ludwig Schmidt
Hannaneh Hajishirzi
Ali Farhadi
KELM
MoMe
MU
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (7 upvotes)
Papers citing
"Editing Models with Task Arithmetic"
50 / 525 papers shown
VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models
Lingjie Jiang
Shaohan Huang
Xun Wu
Yixia Li
Dongdong Zhang
Furu Wei
MLLM
VLM
183
3
0
13 Aug 2025
Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation
Sameer Ambekar
Daniel M. Lang
Julia A. Schnabel
119
0
0
11 Aug 2025
Low-Rank Expert Merging for Multi-Source Domain Adaptation in Person Re-Identification
Taha Mustapha Nehdi
Nairouz Mrabah
Atif Belal
M. Pedersoli
Mohammadhadi Shateri
MoMe
282
0
0
09 Aug 2025
Learning from Oblivion: Predicting Knowledge Overflowed Weights via Retrodiction of Forgetting
Jinhyeok Jang
Jaehong Kim
Jung Uk Kim
113
0
0
07 Aug 2025
Sculpting Margin Penalty: Intra-Task Adapter Merging and Classifier Calibration for Few-Shot Class-Incremental Learning
Liang Bai
Hong Song
Jinfu Li
Yucong Lin
Jingfan Fan
Tianyu Fu
Danni Ai
Deqiang Xiao
Jian Yang
CLL
142
0
0
07 Aug 2025
Industrial LLM-based Code Optimization under Regulation: A Mixture-of-Agents Approach
Mari Ashiga
Vardan K. Voskanyan
Fateme Dinmohammadi
Jingzhi Gong
P. Brookes
Matthew Truscott
Rafail Giavrimis
Mike Basios
Leslie Kanthan
Wei Jie
148
1
0
05 Aug 2025
RegMean++: Enhancing Effectiveness and Generalization of Regression Mean for Model Merging
The-Hai Nguyen
Dang Huu-Tien
Takeshi Suzuki
Le-Minh Nguyen
MoMe
286
2
0
05 Aug 2025
Welcome New Doctor: Continual Learning with Expert Consultation and Autoregressive Inference for Whole Slide Image Analysis
Doanh C. Bui
Jin Tae Kwak
CLL
MedIm
135
0
0
04 Aug 2025
DisTaC: Conditioning Task Vectors via Distillation for Robust Model Merging
Kotaro Yoshida
Yuji Naraki
Takafumi Horie
Ryotaro Shimizu
Hiroki Naganuma
MoMe
VLM
177
0
0
02 Aug 2025
Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs
Ziqian Zhong
Aditi Raghunathan
208
3
0
31 Jul 2025
Forgetting of task-specific knowledge in model merging-based continual learning
Timm Hess
Gido M. van de Ven
Tinne Tuytelaars
CLL
FedML
MoMe
KELM
VLM
184
0
0
31 Jul 2025
Modular Delta Merging with Orthogonal Constraints: A Scalable Framework for Continual and Reversible Model Composition
Haris Khan
Shumaila Asif
Sadia Asif
MoMe
CLL
204
0
0
28 Jul 2025
Uncertainty-driven Embedding Convolution
Sungjun Lim
Kangjun Noh
Youngjun Choi
Heeyoung Lee
Kyungwoo Song
BDL
295
0
0
28 Jul 2025
CLoRA: Parameter-Efficient Continual Learning with Low-Rank Adaptation
Shishir Muralidhara
D. Stricker
René Schuster
CLL
175
0
0
26 Jul 2025
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator
YuXin Li
Felix Dangel
Derek Tam
Colin Raffel
212
2
0
24 Jul 2025
Look the Other Way: Designing 'Positive' Molecules with Negative Data via Task Arithmetic
Rıza Özçelik
Sarah de Ruiter
F. Grisoni
213
0
0
23 Jul 2025
FlexOlmo: Open Language Models for Flexible Data Use
Weijia Shi
Akshita Bhagia
Kevin Farhat
Niklas Muennighoff
Pete Walsh
...
Luke Zettlemoyer
Pang Wei Koh
Hannaneh Hajishirzi
Ali Farhadi
Sewon Min
MoE
398
4
0
09 Jul 2025
On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning
Tongtian Zhu
Tianyu Zhang
Mingze Wang
Zhanpeng Zhou
Can Wang
FedML
MoMe
342
0
0
09 Jul 2025
Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning
Giwon Lee
Wooseong Jeong
Daehee Park
Jaewoo Jeong
Kuk-Jin Yoon
368
0
0
07 Jul 2025
Addressing The Devastating Effects Of Single-Task Data Poisoning In Exemplar-Free Continual Learning
Stanisław Pawlak
Bartłomiej Twardowski
Tomasz Trzciñski
Joost van de Weijer
AAML
CLL
174
0
0
05 Jul 2025
DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic
Munish Monga
Vishal M. Chudasama
Pankaj Wasnik
Biplab Banerjee
534
0
0
26 Jun 2025
Multiple Streams of Knowledge Retrieval: Enriching and Recalling in Transformers
Todd Nief
David Reber
Sean Richardson
Ari Holtzman
KELM
193
0
0
25 Jun 2025
Latent Concept Disentanglement in Transformer-based Language Models
Guan Zhe Hong
Bhavya Vasudeva
Willie Neiswanger
Cyrus Rashtchian
Prabhakar Raghavan
Rina Panigrahy
ReLM
LRM
340
2
0
20 Jun 2025
Subspace-Boosted Model Merging
Ronald Skorobogat
Karsten Roth
Mariana-Iuliana Georgescu
MoMe
401
2
0
19 Jun 2025
Learning-Time Encoding Shapes Unlearning in LLMs
Ruihan Wu
Konstantin Garov
Kamalika Chaudhuri
MU
221
0
0
18 Jun 2025
Knowledge Adaptation as Posterior Correction
Mohammad Emtiyaz Khan
252
1
0
17 Jun 2025
Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning
William F. Shen
Xinchi Qiu
Nicola Cancedda
Nicholas D. Lane
CLL
280
2
0
17 Jun 2025
The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions
Devin Kwok
Gül Sena Altıntaş
Colin Raffel
David Rolnick
424
2
0
16 Jun 2025
Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills
Changsheng Wang
Chongyu Fan
Yihua Zhang
Jinghan Jia
Dennis Wei
Parikshit Ram
Nathalie Baracaldo
Sijia Liu
MU
KELM
LRM
346
7
0
15 Jun 2025
A correlation-permutation approach for speech-music encoders model merging
Fabian Ritter-Gutierrez
Yi-Cheng Lin
Jeremy H.M Wong
Hung-yi Lee
Eng Siong Chng
Nancy F. Chen
MoMe
265
2
0
13 Jun 2025
Model Organisms for Emergent Misalignment
Edward Turner
Anna Soligo
Mia Taylor
Senthooran Rajamanoharan
Neel Nanda
190
23
0
13 Jun 2025
Lifting Data-Tracing Machine Unlearning to Knowledge-Tracing for Foundation Models
Yuwen Tan
Boqing Gong
MU
276
1
0
12 Jun 2025
SoK: Machine Unlearning for Large Language Models
Jie Ren
Yue Xing
Yingqian Cui
Charu C. Aggarwal
Hui Liu
MU
182
2
0
10 Jun 2025
Merging Smarter, Generalizing Better: Enhancing Model Merging on OOD Data
Bingjie Zhang
Hongkang Li
Changlong Shi
Guowei Rong
He Zhao
Dongsheng Wang
Dandan Guo
Meng Wang
MoMe
305
0
0
10 Jun 2025
Do Concept Replacement Techniques Really Erase Unacceptable Concepts?
Anudeep Das
Gurjot Singh
Prach Chantasantitam
Nirmal Asokan
DiffM
224
0
0
10 Jun 2025
Transferring Linear Features Across Language Models With Model Stitching
Alan Chen
Jack Merullo
Alessandro Stolfo
Ellie Pavlick
243
1
0
07 Jun 2025
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness
Rongzhe Wei
Peizhi Niu
Hans Hao-Hsun Hsu
Ruihan Wu
Haoteng Yin
...
Vamsi K. Potluru
Eli Chien
Kamalika Chaudhuri
S. Rasoul Etesami
P. Li
MU
KELM
521
6
0
06 Jun 2025
Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning
Yuheng Lei
Sitong Mao
Shunbo Zhou
Hongyuan Zhang
Xuelong Li
Ping Luo
CLL
247
0
0
06 Jun 2025
StatsMerging: Statistics-Guided Model Merging via Task-Specific Teacher Distillation
Ranjith Merugu
Bryan Bo Cao
Shubham Jain
FedML
MoMe
387
1
0
05 Jun 2025
Out-of-Distribution Graph Models Merging
Yidi Wang
Jiawei Gu
pei Xiaobing
Xubin Zheng
Xiao Luo
Pengyang Wang
Ziyue Qiao
MoMe
OODD
367
0
0
04 Jun 2025
Adaptive Task Vectors for Large Language Models
Joonseong Kang
Soojeong Lee
Subeen Park
Sumin Park
Taero Kim
Jihee Kim
Ryunyi Lee
Kyungwoo Song
262
0
0
03 Jun 2025
FedRPCA: Enhancing Federated LoRA Aggregation Using Robust PCA
Divyansh Jhunjhunwala
Arian Raje
Madan Ravi Ganesh
Chaithanya Kumar Mummadi
Chaoqun Dong
Jiawei Zhou
Wan-Yi Lin
Gauri Joshi
Zhenzhen Li
300
0
0
01 Jun 2025
Assembly of Experts: Linear-time construction of the Chimera LLM variants with emergent and adaptable behaviors
Henrik Klagges
Robert Dahlke
Fabian Klemm
Benjamin Merkel
Daniel Klingmann
David A. Reiss
Dan Zecha
MoMe
MoE
236
2
0
31 May 2025
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
Yuatyong Chaichana
Thanapat Trachu
Peerat Limkonchotiwat
Konpat Preechakul
Tirasan Khandhawit
Ekapol Chuangsuwanich
MoMe
585
1
0
29 May 2025
Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration
Wenju Sun
Qingyong Li
Wen Wang
Yang Liu
Yangli-ao Geng
Boyang Li
MoMe
310
2
0
29 May 2025
Two Is Better Than One: Rotations Scale LoRAs
Hongcan Guo
Guoshun Nan
Yuan Yang
Diyang Zhang
Haotian Li
...
Yuhan Ran
Xinye Cao
Sicong Leng
Xiaofeng Tao
Xudong Jiang
246
0
0
29 May 2025
Navigating the Accuracy-Size Trade-Off with Flexible Model Merging
Akash Dhasade
Divyansh Jhunjhunwala
Milos Vujasinovic
Gauri Joshi
Anne-Marie Kermarrec
MoMe
308
0
0
29 May 2025
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors
Filippo Rinaldi
Giacomo Capitani
Lorenzo Bonicelli
Donato Crisostomi
Federico Bolelli
E. Ficarra
Emanuele Rodolà
Simone Calderara
Angelo Porrello
240
9
0
28 May 2025
Multi-objective Large Language Model Alignment with Hierarchical Experts
Zhuo Li
Guodong DU
Weiyang Guo
Yigeng Zhou
Xiucheng Li
...
Fangming Liu
Yequan Wang
Deheng Ye
Min Zhang
Jing Li
ALM
MoE
326
1
0
27 May 2025
Concept Reachability in Diffusion Models: Beyond Dataset Constraints
Marta Aparicio Rodriguez
Xenia Miscouridou
Anastasia Borovykh
259
0
0
25 May 2025
Previous
1
2
3
4
5
6
...
9
10
11
Next
Page 3 of 11
Page
of 11
Go