v1v2v3 (latest)

Editing Models with Task Arithmetic

International Conference on Learning Representations (ICLR), 2022

8 December 2022

ArXiv (abs)PDF HTML HuggingFace (7 upvotes)

Papers citing "Editing Models with Task Arithmetic"

50 / 525 papers shown

Towards Modular LLMs by Building and Reusing a Library of LoRAsInternational Conference on Machine Learning (ICML), 2024

Nicolas Le Roux

241

18 May 2024

A safety realignment framework via subspace-oriented model fusion for large language modelsKnowledge-Based Systems (KBS), 2024

217

15 May 2024

Localizing Task Information for Improved Model Merging and CompressionInternational Conference on Machine Learning (ICML), 2024

Ke Wang

Nikolaos Dimitriadis

Guillermo Ortiz-Jimenez

Franccois Fleuret

Pascal Frossard

MoMe

279

13 May 2024

Zero-Shot Tokenizer TransferNeural Information Processing Systems (NeurIPS), 2024

278

13 May 2024

Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning

Masane Fuchi

Tomohiro Takagi

DiffM VLM

260

12 May 2024

To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language ModelsInternational Conference on Machine Learning (ICML), 2024

George-Octavian Barbulescu

Peter Triantafillou

341

06 May 2024

Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuningInternational Conference on Machine Learning (ICML), 2024

Jing Xu

Jingzhao Zhang

222

04 May 2024

Creative Problem Solving in Large Language and Vision Models -- What Would it Take?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

314

02 May 2024

Espresso: Robust Concept Filtering in Text-to-Image Models

498

30 Apr 2024

HFT: Half Fine-Tuning for Large Language Models

Weiran Xu

288

29 Apr 2024

Model Extrapolation Expedites Alignment

413

25 Apr 2024

No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement

273

24 Apr 2024

DynaMMo: Dynamic Model Merging for Efficient Class Incremental Learning for Medical Images

Mohammad Areeb Qazi

Ibrahim Almakky

Anees Ur Rehman Hashmi

Santosh Sanjeev

Mohammad Yaqub

MoMe

244

22 Apr 2024

Decomposing and Editing Predictions by Modeling Model Computation

Harshay Shah

Andrew Ilyas

Aleksander Madry

KELM

290

17 Apr 2024

In-Context Learning State Vector with Inner and Momentum Optimization

Baotian Hu

Min Zhang

252

17 Apr 2024

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Nithin Gopalakrishnan Nair

Jeya Maria Jose Valanarasu

Vishal M. Patel

MoMe

285

15 Apr 2024

Learn Your Reference Model for Real Good Alignment

547

15 Apr 2024

DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models

Aditya Balu

Soumik Sarkar

240

11 Apr 2024

Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging

Qi Li

138

08 Apr 2024

Lossless and Near-Lossless Compression for Foundation Models

270

05 Apr 2024

Digital Forgetting in Large Language Models: A Survey of Unlearning MethodsArtificial Intelligence Review (Artif Intell Rev), 2024

Alberto Blanco-Justicia

N. Jebreel

Benet Manzanares-Salor

328

02 Apr 2024

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models BetterInternational Conference on Learning Representations (ICLR), 2024

Shuaiqi Wang

...

Sergey Yekhanin

421

02 Apr 2024

Model Stock: All we need is just a few fine-tuned models

348

28 Mar 2024

A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA

Ayush Thakur

Rashmi Vashisth

MoMe

24 Mar 2024

Emergent World Models and Latent Variable Estimation in Chess-Playing Language Models

Adam Karvonen

266

21 Mar 2024

FissionFusion: Fast Geometric Generation and Hierarchical Souping for Medical Image Analysis

Santosh Sanjeev

Nuren Zhaksylyk

Ibrahim Almakky

Anees Ur Rehman Hashmi

Mohammad Areeb Qazi

Mohammad Yaqub

320

20 Mar 2024

FedFisher: Leveraging Fisher Information for One-Shot Federated LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

Divyansh Jhunjhunwala

Shiqiang Wang

Gauri Joshi

FedML

209

19 Mar 2024

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility

Lei Zhang

276

18 Mar 2024

DAM: Dynamic Adapter Merging for Continual Video QA LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

Feng Cheng

Ziyang Wang

Yi-Lin Sung

Yan-Bo Lin

Mohit Bansal

Gedas Bertasius

CLL MoMe

360

13 Mar 2024

Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated ExpertsInternational Conference on Machine Learning (ICML), 2024

Jonathan Richard Schwarz

Ying Wei

MoE

475

13 Mar 2024

SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated DataNeural Information Processing Systems (NeurIPS), 2024

Mohit Bansal

249

11 Mar 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

...

746

298

05 Mar 2024

Training-Free Pretrained Model Merging

392

04 Mar 2024

Dissecting Language Models: Machine Unlearning via Selective Pruning

Nicholas Pochinkov

Nandi Schoots

MILM MU

215

02 Mar 2024

Eight Methods to Evaluate Robust Unlearning in LLMs

Dylan Hadfield-Menell

ELM MU

336

115

26 Feb 2024

Training Neural Networks from Scratch with Parallel Low-Rank Adapters

270

26 Feb 2024

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Ningyu Zhang

Huajun Chen

198

25 Feb 2024

Knowledge Fusion of Chat LLMs: A Preliminary Technical Report

Wei Bi

491

25 Feb 2024

Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy?

252

23 Feb 2024

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

310

22 Feb 2024

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

David Brandfonbrener

287

22 Feb 2024

Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

Yannis Panagakis

274

19 Feb 2024

Rethinking Machine Unlearning for Large Language Models

...

Mohit Bansal

Yang Liu

426

196

13 Feb 2024

Learning to Route Among Specialized Experts for Zero-Shot Generalization

260

08 Feb 2024

On the Emergence of Cross-Task Linearity in the Pretraining-Finetuning ParadigmInternational Conference on Machine Learning (ICML), 2024

365

06 Feb 2024

Representation Surgery for Multi-Task Model Merging

Li Shen

307

05 Feb 2024

MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers

Somayeh Sojoudi

391

03 Feb 2024

Merging Multi-Task Models via Weight-Ensembling Mixture of Experts

Li Shen

Nan Yin

300

01 Feb 2024

How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?

Rheeya Uppaal

Yixuan Li

Junjie Hu

444

31 Jan 2024

WARM: On the Benefits of Weight Averaged Reward ModelsInternational Conference on Machine Learning (ICML), 2024

Nino Vieillard

Olivier Bachem

350

130

22 Jan 2024