v1v2v3 (latest)

Editing Models with Task Arithmetic

International Conference on Learning Representations (ICLR), 2022

8 December 2022

ArXiv (abs)PDF HTML HuggingFace (7 upvotes)

Papers citing "Editing Models with Task Arithmetic"

50 / 525 papers shown

On the Emergence of Linear Analogies in Word Embeddings

259

24 May 2025

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

Takashi Ishida

Thanawat Lodkaew

Ikko Yamane

708

23 May 2025

When Are Concepts Erased From Diffusion Models?

585

22 May 2025

Training-Free Reasoning and Reflection in MLLMs

Hongchen Wei

Zhenzhong Chen

OffRL VLM LRM

254

22 May 2025

Model Merging is Secretly Certifiable: Non-Vacuous Generalisation Bounds for Low-Shot Learning

Taehoon Kim

Henry Gouk

Minyoung Kim

Timothy M. Hospedales

351

21 May 2025

Covert Attacks on Machine Learning Training in Passively Secure MPCIACR Cryptology ePrint Archive (IACR ePrint), 2025

306

21 May 2025

Context-Free Synthetic Data Mitigates Forgetting

Parikshit Bansal

Sujay Sanghavi

CLL

350

20 May 2025

SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment

731

20 May 2025

Activation-Guided Consensus Merging for Large Language Models

452

20 May 2025

Text Generation Beyond Discrete Token Sampling

513

20 May 2025

Shadow-FT: Tuning Instruct Model via Training on Paired Base Model

699

19 May 2025

Distilling a speech and music encoder with task arithmetic

Fabian Ritter-Gutierrez

264

19 May 2025

Scalable Strategies for Continual Learning with Replay

Truman Hickok

CLL

387

18 May 2025

Cross-Model Transfer of Task Vectors via Few-Shot Orthogonal Alignment

Kazuhiko Kawamoto

Atsuhiro Endo

Hiroshi Kera

319

17 May 2025

MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging

423

17 May 2025

Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning

399

17 May 2025

Do different prompting methods yield a common task representation in language models?

380

17 May 2025

Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

324

16 May 2025

MergeBench: A Benchmark for Merging Domain-Specialized LLMs

684

16 May 2025

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Jean-Philippe Corbeil

375

15 May 2025

Layered Unlearning for Adversarial Relearning

Timothy Qian

Vinith Suriyakumar

Ashia Wilson

Dylan Hadfield-Menell

345

14 May 2025

CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging

329

11 May 2025

Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and SegmentationComputer Vision and Pattern Recognition (CVPR), 2025

480

09 May 2025

WaterDrum: Watermarking for Data-centric Unlearning Metric

Xinyang Lu

Xinyuan Niu

Gregory Kang Ruey Lau

Bui Thi Cam Nhung

Rachael Hwee Ling Sim

Fanyu Wen

Chuan-Sheng Foo

Szu Hui Ng

Bryan Kian Hsiang Low

271

08 May 2025

Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation

527

01 May 2025

GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling

...

584

30 Apr 2025

Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors

329

27 Apr 2025

Param

Δ

for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost

339

23 Apr 2025

Parameter-Efficient Checkpoint Merging via Metrics-Weighted Averaging

Shi Jie Yu

Sehyun Choi

MoMe

285

23 Apr 2025

Advancing AI-assisted Hardware Design with Hierarchical Decentralized Training and Personalized Inference-Time Optimization

258

21 Apr 2025

Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-TuningInternational Conference on Learning Representations (ICLR), 2025

422

20 Apr 2025

TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data

Fei Zhu

Zhaoxiang Zhang

OODD UQCV

377

20 Apr 2025

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

343

16 Apr 2025

Leveraging Submodule Linearity Enhances Task Arithmetic Performance in LLMsInternational Conference on Learning Representations (ICLR), 2025

320

15 Apr 2025

When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear TransformersInternational Conference on Learning Representations (ICLR), 2025

801

15 Apr 2025

Alleviating the Fear of Losing Alignment in LLM Fine-tuningIEEE Symposium on Security and Privacy (S&P), 2025

280

13 Apr 2025

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

357

10 Apr 2025

FedMerge: Federated Personalization via Model Merging

379

09 Apr 2025

SEA-LION: Southeast Asian Languages in One Network

...

430

08 Apr 2025

Not All Data Are Unlearned Equally

Aravind Krishnan

Siva Reddy

Marius Mosbach

896

07 Apr 2025

Exact Unlearning of Finetuning Data via Model Merging at Scale

Kevin Kuo

Amrith Rajagopal Setlur

289

06 Apr 2025

MASS: MoErging through Adaptive Subspace Selection

Donato Crisostomi

Alessandro Zirilli

Antonio Andrea Gargiulo

Maria Sofia Bucarelli

292

06 Apr 2025

Efficient Model Editing with Task-Localized Sparse Fine-tuningInternational Conference on Learning Representations (ICLR), 2025

350

03 Apr 2025

BECAME: BayEsian Continual Learning with Adaptive Model MErging

349

03 Apr 2025

Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach

272

31 Mar 2025

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient OptimizationComputer Vision and Pattern Recognition (CVPR), 2025

...

230

31 Mar 2025

SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning

551

29 Mar 2025

AdaRank: Adaptive Rank Pruning for Enhanced Model Merging

295

28 Mar 2025

Reinforced Model Merging

274

27 Mar 2025

Guided Model Merging for Hybrid Data Learning: Leveraging Centralized Data to Refine Decentralized Models

486

26 Mar 2025