v1v2v3 (latest)

Knowledge is a Region in Weight Space for Fine-tuned Language Models

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

9 February 2023

ArXiv (abs)PDF HTML Github

Papers citing "Knowledge is a Region in Weight Space for Fine-tuned Language Models"

38 / 38 papers shown

Learning to Interpret Weight Differences in Language Models

295

06 Oct 2025

SafeConstellations: Mitigating Over-Refusals in LLMs Through Task-Aware Representation Steering

299

15 Aug 2025

Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

239

25 Jun 2025

TrustLoRA: Low-Rank Adaptation for Failure Detection under Out-of-distribution Data

Fei Zhu

Zhaoxiang Zhang

OODD UQCV

436

20 Apr 2025

Neuroplasticity and Corruption in Model Mechanisms: A Case Study Of Indirect Object IdentificationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Vishnu Kabir Chhabra

Ding Zhu

Mohammad Mahdi Khalili

414

27 Feb 2025

Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models

379

18 Feb 2025

Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic

428

08 Jan 2025

Reversed Attention: On The Gradient Descent Of Attention Layers In GPT

Shahar Katz

Lior Wolf

178

22 Dec 2024

Gradient Localization Improves Lifelong Pretraining of Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

335

07 Nov 2024

Local Contrastive Editing of Gender StereotypesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

385

23 Oct 2024

Deep Model Merging: The Sister of Neural Network Interpretability -- A Survey

Kyle Chard

304

16 Oct 2024

What Matters for Model Merging at Scale?

Prateek Yadav

Tu Vu

Jonathan Lai

Alexandra Chronopoulou

Manaal Faruqui

Joey Tianyi Zhou

Tsendsuren Munkhdalai

MoMe

300

04 Oct 2024

Realistic Evaluation of Model Merging for Compositional Generalization

325

26 Sep 2024

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

469

30 Jul 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies

401

24 Jun 2024

Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation

Jakub Simko

248

18 Jun 2024

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

Zhenyi Lu

Chenghao Fan

Wei Wei

Xiaoye Qu

Dangyang Chen

Yu Cheng

MoMe

319

109

17 Jun 2024

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Ruifeng Xu

410

31 May 2024

Evaluating the External and Parametric Knowledge Fusion of Large Language Models

...

Lifeng Shang

Qun Liu

Yong Liu

Ruiming Tang

KELM

301

29 May 2024

Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction

Rui Yan

440

22 May 2024

Lossless and Near-Lossless Compression for Foundation Models

415

05 Apr 2024

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking

383

107

22 Feb 2024

Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond

Yulan He

366

22 Feb 2024

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

330

20 Feb 2024

Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning

284

13 Feb 2024

WARM: On the Benefits of Weight Averaged Reward ModelsInternational Conference on Machine Learning (ICML), 2024

Nino Vieillard

Olivier Bachem

468

140

22 Jan 2024

A Comprehensive Study of Knowledge Editing for Large Language Models

Ningyu Zhang

Yunzhi Yao

Bo Tian

Peng Wang

Shumin Deng

...

Lei Liang

Huajun Chen

651

144

02 Jan 2024

Merging by Matching Models in Task Parameter Subspaces

Derek Tam

Mohit Bansal

Colin Raffel

MoMe

374

07 Dec 2023

Can Knowledge Graphs Reduce Hallucinations in LLMs? : A SurveyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

440

183

14 Nov 2023

Fuse to Forget: Bias Reduction and Selective Memorization through Model FusionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

325

13 Nov 2023

RSVP: Customer Intent Detection via Agent Response Contrastive and Generative Pre-Training

206

15 Oct 2023

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing PolicyInternational Conference on Learning Representations (ICLR), 2023

Mohit Bansal

342

02 Oct 2023

Cordyceps@LT-EDI: Patching Language-Specific Homophobia/Transphobia Classifiers with a Multilingual Understanding

Dean Ninalga

297

24 Sep 2023

Derivative Free Weight-space Ensembling

Dean Ninalga

MoMe

260

07 Jul 2023

Sparse Model Soups: A Recipe for Improved Pruning via Model AveragingInternational Conference on Learning Representations (ICLR), 2023

548

29 Jun 2023

TIES-Merging: Resolving Interference When Merging ModelsNeural Information Processing Systems (NeurIPS), 2023

470

640

02 Jun 2023

ZipIt! Merging Models from Different Tasks without TrainingInternational Conference on Learning Representations (ICLR), 2023

515

183

04 May 2023

ColD Fusion: Collaborative Descent for Distributed Multitask FinetuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

331

02 Dec 2022