Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2212.04089
Cited By
v1
v2
v3 (latest)
Editing Models with Task Arithmetic
International Conference on Learning Representations (ICLR), 2022
8 December 2022
Gabriel Ilharco
Marco Tulio Ribeiro
Mitchell Wortsman
Suchin Gururangan
Ludwig Schmidt
Hannaneh Hajishirzi
Ali Farhadi
KELM
MoMe
MU
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (7 upvotes)
Papers citing
"Editing Models with Task Arithmetic"
50 / 523 papers shown
Title
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
International Conference on Learning Representations (ICLR), 2024
Wenlong Deng
Yize Zhao
V. Vakilian
Minghui Chen
Xiaoxiao Li
Christos Thrampoulidis
427
8
0
12 Oct 2024
CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
International Conference on Learning Representations (ICLR), 2024
Jiamu Zheng
Jinghuai Zhang
Xuhong Zhang
Xuhong Zhang
Jianwei Yin
Tao Lin
KELM
490
0
0
12 Oct 2024
ELICIT: LLM Augmentation via External In-Context Capability
International Conference on Learning Representations (ICLR), 2024
Futing Wang
Jianhao Yan
Yue Zhang
Tao Lin
310
5
0
12 Oct 2024
MergePrint: Merge-Resistant Fingerprints for Robust Black-box Ownership Verification of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Shojiro Yamabe
Futa Waseda
Tsubasa Takahashi
Koki Wataoka
MoMe
385
1
0
11 Oct 2024
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
International Conference on Learning Representations (ICLR), 2024
Seongyun Lee
Geewook Kim
Jiyeon Kim
Hyunji Lee
Hoyeon Chang
Sue Hyun Park
Minjoon Seo
230
4
0
10 Oct 2024
Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Zhipeng Chen
Liang Song
K. Zhou
Wayne Xin Zhao
Binghai Wang
Weipeng Chen
Ji-Rong Wen
314
0
0
10 Oct 2024
WAPITI: A Watermark for Finetuned Open-Source LLMs
Lingjie Chen
Ruizhong Qiu
Siyu Yuan
Zhining Liu
Tianxin Wei
Hyunsik Yoo
Zhichen Zeng
Deqing Yang
Hanghang Tong
WaLM
286
12
0
09 Oct 2024
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning
Chongyu Fan
Jiancheng Liu
Licong Lin
Jinghan Jia
Ruiqi Zhang
Song Mei
Sijia Liu
MU
566
65
0
09 Oct 2024
Glider: Global and Local Instruction-Driven Expert Router
Pingzhi Li
Prateek Yadav
Jaehong Yoon
Jie Peng
Yi-Lin Sung
Joey Tianyi Zhou
Tianlong Chen
MoMe
MoE
307
3
0
09 Oct 2024
Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning
Computer Vision and Pattern Recognition (CVPR), 2024
Qianli Ma
Xuefei Ning
Dongrui Liu
Li Niu
Linfeng Zhang
MoMe
263
2
0
09 Oct 2024
Diversity-Rewarded CFG Distillation
International Conference on Learning Representations (ICLR), 2024
Geoffrey Cideron
A. Agostinelli
Johan Ferret
Sertan Girgin
Romuald Elie
Olivier Bachem
Sarah Perrin
Alexandre Ramé
223
5
0
08 Oct 2024
Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Zheyang Xiong
Ziyang Cai
John Cooper
Albert Ge
Vasilis Papageorgiou
...
Saurabh Agarwal
Grigorios G Chrysos
Samet Oymak
Kangwook Lee
Dimitris Papailiopoulos
LRM
227
8
0
08 Oct 2024
NegMerge: Sign-Consensual Weight Merging for Machine Unlearning
Hyoseo Kim
Dongyoon Han
Junsuk Choe
MU
MoMe
285
3
0
08 Oct 2024
Low-Rank Continual Personalization of Diffusion Models
Łukasz Staniszewski
Katarzyna Zaleska
Kamil Deja
DiffM
357
1
0
07 Oct 2024
Exploring the Personality Traits of LLMs through Latent Features Steering
Shu Yang
Shenzhe Zhu
Ruoxuan Bao
Liu Liu
Yu Cheng
Di Wang
150
1
0
07 Oct 2024
MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network
Asian Conference on Computer Vision (ACCV), 2024
Doanh C. Bui
Jin Tae Kwak
DiffM
MedIm
126
1
0
06 Oct 2024
OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized Distributions
Yu-Shin Huang
Peter Just
Krishna Narayanan
Chao Tian
258
15
0
06 Oct 2024
Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models
Theo Putterman
Derek Lim
Yoav Gelberg
Stefanie Jegelka
Haggai Maron
AI4CE
268
12
0
05 Oct 2024
What Matters for Model Merging at Scale?
Prateek Yadav
Tu Vu
Jonathan Lai
Alexandra Chronopoulou
Manaal Faruqui
Joey Tianyi Zhou
Tsendsuren Munkhdalai
MoMe
246
41
0
04 Oct 2024
HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Lingrui Mei
Shenghua Liu
Yiwei Wang
Baolong Bi
Ruibin Yuan
Xueqi Cheng
235
8
0
03 Oct 2024
Parameter Competition Balancing for Model Merging
Neural Information Processing Systems (NeurIPS), 2024
Guodong DU
Junlin Lee
Jing Li
Runhua Jiang
Yifei Guo
...
Hanting Liu
Sim Kuan Goh
Jing Li
Daojing He
Min Zhang
MoMe
199
40
0
03 Oct 2024
DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation
International Conference on Learning Representations (ICLR), 2024
Changdae Oh
Yixuan Li
Kyungwoo Song
Sangdoo Yun
Dongyoon Han
OOD
MoMe
436
15
0
03 Oct 2024
Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Tingfeng Hui
Zhenyu Zhang
Shuohuan Wang
Yu Sun
Hua Wu
Sen Su
MoE
208
2
0
02 Oct 2024
Towards Inference-time Category-wise Safety Steering for Large Language Models
Amrita Bhattacharjee
Shaona Ghosh
Traian Rebedea
Christopher Parisien
LLMSV
172
14
0
02 Oct 2024
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
International Conference on Learning Representations (ICLR), 2024
Lucas Bandarkar
Benjamin Muller
Pritish Yuvraj
Rui Hou
Nayan Singhal
Hongjiang Lv
Bing-Quan Liu
KELM
LRM
MoMe
363
12
0
02 Oct 2024
Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks
Edan Kinderman
Itay Hubara
Haggai Maron
Daniel Soudry
MoMe
328
3
0
02 Oct 2024
Disentangling Latent Shifts of In-Context Learning with Weak Supervision
Josip Jukić
Jan Snajder
244
1
0
02 Oct 2024
Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
International Conference on Learning Representations (ICLR), 2024
Saurav Jha
Shiqi Yang
Masato Ishii
Mengjie Zhao
Christian Simon
Muhammad Jehanzeb Mirza
Dong Gong
Lina Yao
Shusuke Takahashi
Yuki Mitsufuji
DiffM
425
2
0
01 Oct 2024
Dual Consolidation for Pre-Trained Model-Based Domain-Incremental Learning
Computer Vision and Pattern Recognition (CVPR), 2024
Da-Wei Zhou
Zi-Wen Cai
Han-Jia Ye
Lijun Zhang
De-Chuan Zhan
CLL
AI4CE
387
9
0
01 Oct 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Neural Information Processing Systems (NeurIPS), 2024
Shuhao Chen
Weisen Jiang
Xiaoyuan Zhang
James T. Kwok
Yu Zhang
RALM
MQ
223
39
0
30 Sep 2024
The Construction of Instruction-tuned LLMs for Finance without Instruction Data Using Continual Pretraining and Model Merging
Masanori Hirano
Kentaro Imajo
MoMe
133
3
0
30 Sep 2024
Realistic Evaluation of Model Merging for Compositional Generalization
Derek Tam
Yash Kant
Brian Lester
Igor Gilitschenski
Colin Raffel
MoMe
237
10
0
26 Sep 2024
Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
International Conference on Learning Representations (ICLR), 2024
Ziyu Zhao
Tao Shen
Didi Zhu
Zexi Li
Jing Su
Xuwu Wang
Kun Kuang
Fei Wu
MoMe
400
31
0
24 Sep 2024
Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks
The Visual Computer (VC), 2024
Roberto Alcover-Couso
Juan C. Sanmiguel
Marcos Escudero-Viñolo
Jose M. Martínez
FedML
MoMe
149
3
0
24 Sep 2024
Towards understanding evolution of science through language model series
Junjie Dong
Zhuoqi Lyu
Qing Ke
AI4TS
354
0
0
15 Sep 2024
Fingerprint Vector: Enabling Scalable and Efficient Model Fingerprint Transfer via Vector Addition
Zhenhua Xu
Wenpeng Xing
Zhebo Wang
Wenpeng Xing
Chen Jie
Mohan Li
Meng Han
276
2
0
13 Sep 2024
Erasure Coded Neural Network Inference via Fisher Averaging
International Symposium on Information Theory (ISIT), 2024
Divyansh Jhunjhunwala
Neharika Jali
Gauri Joshi
Shiqiang Wang
MoMe
FedML
144
3
0
02 Sep 2024
Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Yuncheng Yang
Yulei Qin
Tong Wu
Zihan Xu
Gang Li
...
Yuchen Shi
Ke Li
Xing Sun
Jie Yang
Yun Gu
ALM
OffRL
MoE
311
1
0
28 Aug 2024
Improving the Classification Effect of Clinical Images of Diseases for Multi-Source Privacy Protection
Tian Bowen
Xu Zhengyang
Yin Zhihao
Wang Jingying
Yue Yutao
FedML
154
1
0
23 Aug 2024
SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
Mohammadreza Pourreza
Ruoxi Sun
Hailong Li
Lesly Miculicich
Tomas Pfister
Sercan O. Arik
MoMe
192
14
0
22 Aug 2024
Approaching Deep Learning through the Spectral Dynamics of Weights
David Yunis
Kumar Kshitij Patel
Samuel Wheeler
Pedro H. P. Savarese
Gal Vardi
Karen Livescu
Michael Maire
Matthew R. Walter
274
12
0
21 Aug 2024
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models
Anke Tang
Li Shen
Yong Luo
Shuai Xie
Han Hu
Lefei Zhang
Di Lin
Dacheng Tao
MoMe
268
9
0
19 Aug 2024
FuseChat: Knowledge Fusion of Chat Models
Fanqi Wan
Longguang Zhong
Ziyi Yang
Ruijun Chen
Xiaojun Quan
ALM
KELM
MoMe
321
35
0
15 Aug 2024
Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning
European Conference on Computer Vision (ECCV), 2024
Shibo Jie
Yehui Tang
Jianyuan Guo
Zhi-Hong Deng
Kai Han
Yunhe Wang
VLM
175
6
0
13 Aug 2024
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning
Prateek Yadav
Colin Raffel
Mohammed Muqeeth
Lucas Caccia
Haokun Liu
Tianlong Chen
Joey Tianyi Zhou
Leshem Choshen
Alessandro Sordoni
MoMe
364
44
0
13 Aug 2024
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement
Le Yu
Bowen Yu
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
197
9
0
06 Aug 2024
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng
Jun Wang
Chuanhao Li
Quanfeng Lu
Hao Tian
...
Jifeng Dai
Ping Luo
Ping Luo
Kaipeng Zhang
Wenqi Shao
VLM
194
44
0
05 Aug 2024
Task Prompt Vectors: Effective Initialization through Multi-Task Soft-Prompt Transfer
Wei Chen
Long Chen
Ivan Srba
Yu Wu
MoMe
VLM
267
9
0
02 Aug 2024
On the Limitations and Prospects of Machine Unlearning for Generative AI
Shiji Zhou
Lianzhe Wang
Jiangnan Ye
Yongliang Wu
Heng Chang
MU
243
12
0
01 Aug 2024
Efficient Pareto Manifold Learning with Low-Rank Structure
Weiyu Chen
James T. Kwok
157
9
0
30 Jul 2024
Previous
1
2
3
...
10
11
6
7
8
9
Next