v1v2v3 (latest)

Editing Models with Task Arithmetic

International Conference on Learning Representations (ICLR), 2022

8 December 2022

ArXiv (abs)PDF HTML HuggingFace (7 upvotes)

Papers citing "Editing Models with Task Arithmetic"

23 / 523 papers shown

Title
LIV: Language-Image Representations and Rewards for Robotic ControlInternational Conference on Machine Learning (ICML), 2023 Yecheng Jason Ma William Liang Vaidehi Som Vikash Kumar Amy Zhang Osbert Bastani Dinesh Jayaraman LM&Ro 210 176 0 01 Jun 2023
Language Models Implement Simple Word2Vec-style Vector ArithmeticNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023 Jack Merullo Carsten Eickhoff Ellie Pavlick KELM 285 85 0 25 May 2023
Transferring Learning Trajectories of Neural NetworksInternational Conference on Learning Representations (ICLR), 2023 Daiki Chijiwa 215 4 0 23 May 2023
Detecting and Mitigating Hallucinations in Multilingual SummarisationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Yifu Qiu Yftah Ziser Anna Korhonen Edoardo Ponti Shay B. Cohen HILM 336 58 0 23 May 2023
Editing Large Language Models: Problems, Methods, and OpportunitiesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Yunzhi Yao Peng Wang Bo Tian Shuyang Cheng Zhoubo Li Shumin Deng Huajun Chen Ningyu Zhang KELM 304 392 0 22 May 2023
Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained ModelsNeural Information Processing Systems (NeurIPS), 2023 Guillermo Ortiz-Jiménez Alessandro Favero P. Frossard MoMe 591 175 0 22 May 2023
ZipIt! Merging Models from Different Tasks without TrainingInternational Conference on Learning Representations (ICLR), 2023 George Stoica Daniel Bolya J. Bjorner Pratik Ramesh Taylor N. Hearn Judy Hoffman VLM MoMe 434 160 0 04 May 2023
An Empirical Study of Multimodal Model MergingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Yi-Lin Sung Linjie Li Kevin Qinghong Lin Zhe Gan Joey Tianyi Zhou Lijuan Wang MoMe 285 52 0 28 Apr 2023
Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models D. Honegger Konstantin Schurholt Damian Borth 236 5 0 26 Apr 2023
Elastic Weight Removal for Faithful and Abstractive Dialogue GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023 Nico Daheim Nouha Dziri Mrinmaya Sachan Iryna Gurevych Edoardo Ponti MoMe 245 32 0 30 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023 Ligong Han Yinxiao Li Han Zhang P. Milanfar Dimitris N. Metaxas Feng Yang DiffM 628 364 0 20 Mar 2023
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2023 Daniel Lawson A. H. Qureshi MoMe OffRL 326 14 0 14 Mar 2023
Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?International Conference on Machine Learning (ICML), 2023 Ruisi Cai Zhenyu Zhang Zinan Lin AAML OOD 223 15 0 24 Feb 2023
Modular Deep Learning Jonas Pfeiffer Sebastian Ruder Ivan Vulić Edoardo Ponti MoMe OOD 413 102 0 22 Feb 2023
Knowledge is a Region in Weight Space for Fine-tuned Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Almog Gueta Elad Venezian Colin Raffel Noam Slonim Yoav Katz Leshem Choshen 272 56 0 09 Feb 2023
Exploring the Benefits of Training Expert Language Models over Instruction TuningInternational Conference on Machine Learning (ICML), 2023 Joel Jang Seungone Kim Seonghyeon Ye Doyoung Kim Lajanugen Logeswaran Moontae Lee Kyungjae Lee Minjoon Seo LRM ALM 425 93 0 07 Feb 2023
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization Srinivasan Iyer Xi Lin Ramakanth Pasunuru Todor Mihaylov Daniel Simig ... Jeff Wang Christopher Dewan Asli Celikyilmaz Luke Zettlemoyer Veselin Stoyanov ALM 456 300 0 22 Dec 2022
Model Ratatouille: Recycling Diverse Models for Out-of-Distribution GeneralizationInternational Conference on Machine Learning (ICML), 2022 Alexandre Ramé Kartik Ahuja Jianyu Zhang Matthieu Cord Léon Bottou David Lopez-Paz MoMe OODD 485 101 0 20 Dec 2022
Learning useful representations for shifting tasks and distributionsInternational Conference on Machine Learning (ICML), 2022 Jianyu Zhang Léon Bottou OOD 227 21 0 14 Dec 2022
ColD Fusion: Collaborative Descent for Distributed Multitask FinetuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 Shachar Don-Yehiya Elad Venezian Colin Raffel Noam Slonim Yoav Katz Leshem Choshen MoMe 240 60 0 02 Dec 2022
Sufficient Invariant Learning for Distribution ShiftComputer Vision and Pattern Recognition (CVPR), 2022 Taero Kim Sungjun Lim Kyungwoo Song Yonghan Jung Krikamol Muandet Kyungwoo Song OOD 320 3 0 24 Oct 2022
Improving Data-Efficient Fossil Segmentation via Model Editing Indu Panigrahi Ryan Manzuk A. Maloof Ruth C. Fong 182 1 0 08 Oct 2022
Model Patching: Closing the Subgroup Performance Gap with Data Augmentation Karan Goel Albert Gu Shouqing Yang Christopher Ré 347 128 0 15 Aug 2020