ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.04089
  4. Cited By
Editing Models with Task Arithmetic
v1v2v3 (latest)

Editing Models with Task Arithmetic

International Conference on Learning Representations (ICLR), 2022
8 December 2022
Gabriel Ilharco
Marco Tulio Ribeiro
Mitchell Wortsman
Suchin Gururangan
Ludwig Schmidt
Hannaneh Hajishirzi
Ali Farhadi
    KELMMoMeMU
ArXiv (abs)PDFHTMLHuggingFace (7 upvotes)

Papers citing "Editing Models with Task Arithmetic"

23 / 523 papers shown
Title
LIV: Language-Image Representations and Rewards for Robotic Control
LIV: Language-Image Representations and Rewards for Robotic ControlInternational Conference on Machine Learning (ICML), 2023
Yecheng Jason Ma
William Liang
Vaidehi Som
Vikash Kumar
Amy Zhang
Osbert Bastani
Dinesh Jayaraman
LM&Ro
210
176
0
01 Jun 2023
Language Models Implement Simple Word2Vec-style Vector Arithmetic
Language Models Implement Simple Word2Vec-style Vector ArithmeticNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Jack Merullo
Carsten Eickhoff
Ellie Pavlick
KELM
285
85
0
25 May 2023
Transferring Learning Trajectories of Neural Networks
Transferring Learning Trajectories of Neural NetworksInternational Conference on Learning Representations (ICLR), 2023
Daiki Chijiwa
215
4
0
23 May 2023
Detecting and Mitigating Hallucinations in Multilingual Summarisation
Detecting and Mitigating Hallucinations in Multilingual SummarisationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yifu Qiu
Yftah Ziser
Anna Korhonen
Edoardo Ponti
Shay B. Cohen
HILM
336
58
0
23 May 2023
Editing Large Language Models: Problems, Methods, and Opportunities
Editing Large Language Models: Problems, Methods, and OpportunitiesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yunzhi Yao
Peng Wang
Bo Tian
Shuyang Cheng
Zhoubo Li
Shumin Deng
Huajun Chen
Ningyu Zhang
KELM
304
392
0
22 May 2023
Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained
  Models
Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained ModelsNeural Information Processing Systems (NeurIPS), 2023
Guillermo Ortiz-Jiménez
Alessandro Favero
P. Frossard
MoMe
591
175
0
22 May 2023
ZipIt! Merging Models from Different Tasks without Training
ZipIt! Merging Models from Different Tasks without TrainingInternational Conference on Learning Representations (ICLR), 2023
George Stoica
Daniel Bolya
J. Bjorner
Pratik Ramesh
Taylor N. Hearn
Judy Hoffman
VLMMoMe
434
160
0
04 May 2023
An Empirical Study of Multimodal Model Merging
An Empirical Study of Multimodal Model MergingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yi-Lin Sung
Linjie Li
Kevin Qinghong Lin
Zhe Gan
Joey Tianyi Zhou
Lijuan Wang
MoMe
285
52
0
28 Apr 2023
Sparsified Model Zoo Twins: Investigating Populations of Sparsified
  Neural Network Models
Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models
D. Honegger
Konstantin Schurholt
Damian Borth
236
5
0
26 Apr 2023
Elastic Weight Removal for Faithful and Abstractive Dialogue Generation
Elastic Weight Removal for Faithful and Abstractive Dialogue GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Nico Daheim
Nouha Dziri
Mrinmaya Sachan
Iryna Gurevych
Edoardo Ponti
MoMe
245
32
0
30 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
628
364
0
20 Mar 2023
Merging Decision Transformers: Weight Averaging for Forming Multi-Task
  Policies
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2023
Daniel Lawson
A. H. Qureshi
MoMeOffRL
326
14
0
14 Mar 2023
Robust Weight Signatures: Gaining Robustness as Easy as Patching
  Weights?
Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?International Conference on Machine Learning (ICML), 2023
Ruisi Cai
Zhenyu Zhang
Zinan Lin
AAMLOOD
223
15
0
24 Feb 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
413
102
0
22 Feb 2023
Knowledge is a Region in Weight Space for Fine-tuned Language Models
Knowledge is a Region in Weight Space for Fine-tuned Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Almog Gueta
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
272
56
0
09 Feb 2023
Exploring the Benefits of Training Expert Language Models over
  Instruction Tuning
Exploring the Benefits of Training Expert Language Models over Instruction TuningInternational Conference on Machine Learning (ICML), 2023
Joel Jang
Seungone Kim
Seonghyeon Ye
Doyoung Kim
Lajanugen Logeswaran
Moontae Lee
Kyungjae Lee
Minjoon Seo
LRMALM
425
93
0
07 Feb 2023
OPT-IML: Scaling Language Model Instruction Meta Learning through the
  Lens of Generalization
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Srinivasan Iyer
Xi Lin
Ramakanth Pasunuru
Todor Mihaylov
Daniel Simig
...
Jeff Wang
Christopher Dewan
Asli Celikyilmaz
Luke Zettlemoyer
Veselin Stoyanov
ALM
456
300
0
22 Dec 2022
Model Ratatouille: Recycling Diverse Models for Out-of-Distribution
  Generalization
Model Ratatouille: Recycling Diverse Models for Out-of-Distribution GeneralizationInternational Conference on Machine Learning (ICML), 2022
Alexandre Ramé
Kartik Ahuja
Jianyu Zhang
Matthieu Cord
Léon Bottou
David Lopez-Paz
MoMeOODD
485
101
0
20 Dec 2022
Learning useful representations for shifting tasks and distributions
Learning useful representations for shifting tasks and distributionsInternational Conference on Machine Learning (ICML), 2022
Jianyu Zhang
Léon Bottou
OOD
227
21
0
14 Dec 2022
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning
ColD Fusion: Collaborative Descent for Distributed Multitask FinetuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Shachar Don-Yehiya
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
MoMe
240
60
0
02 Dec 2022
Sufficient Invariant Learning for Distribution Shift
Sufficient Invariant Learning for Distribution ShiftComputer Vision and Pattern Recognition (CVPR), 2022
Taero Kim
Sungjun Lim
Kyungwoo Song
Yonghan Jung
Krikamol Muandet
Kyungwoo Song
OOD
320
3
0
24 Oct 2022
Improving Data-Efficient Fossil Segmentation via Model Editing
Improving Data-Efficient Fossil Segmentation via Model Editing
Indu Panigrahi
Ryan Manzuk
A. Maloof
Ruth C. Fong
182
1
0
08 Oct 2022
Model Patching: Closing the Subgroup Performance Gap with Data
  Augmentation
Model Patching: Closing the Subgroup Performance Gap with Data Augmentation
Karan Goel
Albert Gu
Shouqing Yang
Christopher Ré
347
128
0
15 Aug 2020
Previous
123...10119