ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.10199
  4. Cited By
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based
  Masked Language-models
v1v2v3v4v5 (latest)

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models

Annual Meeting of the Association for Computational Linguistics (ACL), 2021
18 June 2021
Elad Ben-Zaken
Shauli Ravfogel
Yoav Goldberg
ArXiv (abs)PDFHTML

Papers citing "BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models"

50 / 968 papers shown
Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated
  Neural Text Retrievers
Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text RetrieversConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Weng Lam Tam
Xiao Liu
Kaixuan Ji
Lilong Xue
Xing Zhang
Yuxiao Dong
Jiahua Liu
Maodi Hu
Jie Tang
VPVLM
177
31
0
14 Jul 2022
Convolutional Bypasses Are Better Vision Transformer Adapters
Convolutional Bypasses Are Better Vision Transformer AdaptersEuropean Conference on Artificial Intelligence (ECAI), 2022
Shibo Jie
Zhi-Hong Deng
VPVLM
264
159
0
14 Jul 2022
Meta-Learning the Difference: Preparing Large Language Models for
  Efficient Adaptation
Meta-Learning the Difference: Preparing Large Language Models for Efficient AdaptationTransactions of the Association for Computational Linguistics (TACL), 2022
Zejiang Hou
Julian Salazar
George Polovets
183
20
0
07 Jul 2022
On-Device Training Under 256KB Memory
On-Device Training Under 256KB MemoryNeural Information Processing Systems (NeurIPS), 2022
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Chuang Gan
Song Han
MQ
455
259
0
30 Jun 2022
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
ST-Adapter: Parameter-Efficient Image-to-Video Transfer LearningNeural Information Processing Systems (NeurIPS), 2022
Junting Pan
Ziyi Lin
Xiatian Zhu
Jing Shao
Jiaming Song
382
264
0
27 Jun 2022
Sparse Structure Search for Parameter-Efficient Tuning
Sparse Structure Search for Parameter-Efficient Tuning
Shengding Hu
Zhen Zhang
Ning Ding
Yadao Wang
Yasheng Wang
Zhiyuan Liu
Maosong Sun
145
19
0
15 Jun 2022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer
  Learning
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer LearningNeural Information Processing Systems (NeurIPS), 2022
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
340
291
0
13 Jun 2022
Neural Prompt Search
Neural Prompt SearchIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
VPVLMVLM
371
173
0
09 Jun 2022
Modular and On-demand Bias Mitigation with Attribute-Removal Subnetworks
Modular and On-demand Bias Mitigation with Attribute-Removal SubnetworksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Lukas Hauzenberger
Shahed Masoudian
Deepak Kumar
Markus Schedl
Navid Rekabsaz
373
21
0
30 May 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual
  Recognition
AdaptFormer: Adapting Vision Transformers for Scalable Visual RecognitionNeural Information Processing Systems (NeurIPS), 2022
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
613
930
0
26 May 2022
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tu Vu
Aditya Barua
Brian Lester
Daniel Cer
Mohit Iyyer
Noah Constant
CLL
325
72
0
25 May 2022
Know Where You're Going: Meta-Learning for Parameter-Efficient
  Fine-Tuning
Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Mozhdeh Gheini
Xuezhe Ma
Jonathan May
127
9
0
25 May 2022
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures
  of Soft Prompts
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft PromptsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Akari Asai
Mohammadreza Salehi
Matthew E. Peters
Hannaneh Hajishirzi
391
122
0
24 May 2022
Continual Learning with Global Alignment
Continual Learning with Global AlignmentNeural Information Processing Systems (NeurIPS), 2022
Xueying Bai
Jinghuan Shang
Yifan Sun
Niranjan Balasubramanian
CLL
156
1
0
24 May 2022
Representation Projection Invariance Mitigates Representation Collapse
Representation Projection Invariance Mitigates Representation CollapseConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Anastasia Razdaibiedina
A. Khetan
Zohar Karnin
Daniel Khashabi
Vishaal Kapoor
V. Madan
258
7
0
23 May 2022
When does Parameter-Efficient Transfer Learning Work for Machine
  Translation?
When does Parameter-Efficient Transfer Learning Work for Machine Translation?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ahmet Üstün
Asa Cooper Stickland
181
8
0
23 May 2022
BBTv2: Towards a Gradient-Free Future with Large Language Models
BBTv2: Towards a Gradient-Free Future with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianxiang Sun
Zhengfu He
Hong Qian
Yunhua Zhou
Xuanjing Huang
Xipeng Qiu
279
71
0
23 May 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A ReviewIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
655
442
0
21 May 2022
Deep transfer learning for image classification: a survey
Deep transfer learning for image classification: a survey
J. Plested
Musa Phiri
Tom Gedeon
OOD
210
47
0
20 May 2022
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than
  In-Context Learning
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context LearningNeural Information Processing Systems (NeurIPS), 2022
Haokun Liu
Derek Tam
Mohammed Muqeeth
Jay Mohta
Tenghao Huang
Joey Tianyi Zhou
Colin Raffel
453
1,163
0
11 May 2022
Empowering parameter-efficient transfer learning by recognizing the
  kernel structure in self-attention
Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention
Yifan Chen
Devamanyu Hazarika
Mahdi Namazifar
Yang Liu
Di Jin
Dilek Z. Hakkani-Tür
120
8
0
07 May 2022
Efficient Fine-Tuning of BERT Models on the Edge
Efficient Fine-Tuning of BERT Models on the EdgeInternational Symposium on Circuits and Systems (ISCAS), 2022
Danilo Vucetic
Mohammadreza Tayaranian
M. Ziaeefard
J. Clark
B. Meyer
W. Gross
254
42
0
03 May 2022
AdapterBias: Parameter-efficient Token-dependent Representation Shift
  for Adapters in NLP Tasks
AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks
Chin-Lun Fu
Zih-Ching Chen
Yun-Ru Lee
Hung-yi Lee
173
53
0
30 Apr 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Flamingo: a Visual Language Model for Few-Shot LearningNeural Information Processing Systems (NeurIPS), 2022
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLMVLM
695
4,826
0
29 Apr 2022
Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual
  Retrieval
Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual RetrievalInternational Conference on Computational Linguistics (COLING), 2022
Robert Litschko
Ivan Vulić
Goran Glavaš
LRM
393
19
0
05 Apr 2022
Improved and Efficient Conversational Slot Labeling through Question
  Answering
Improved and Efficient Conversational Slot Labeling through Question Answering
Gabor Fuisz
Ivan Vulić
Samuel Gibbons
I. Casanueva
Paweł Budzianowski
215
13
0
05 Apr 2022
PERFECT: Prompt-free and Efficient Few-shot Learning with Language
  Models
PERFECT: Prompt-free and Efficient Few-shot Learning with Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Rabeeh Karimi Mahabadi
Luke Zettlemoyer
James Henderson
Marzieh Saeidi
Lambert Mathias
Ves Stoyanov
Majid Yazdani
VLM
198
76
0
03 Apr 2022
Parameter-efficient Model Adaptation for Vision Transformers
Parameter-efficient Model Adaptation for Vision TransformersAAAI Conference on Artificial Intelligence (AAAI), 2022
Xuehai He
Chunyuan Li
Pengchuan Zhang
Jianwei Yang
Xinze Wang
163
104
0
29 Mar 2022
Few-Shot Learning with Siamese Networks and Label Tuning
Few-Shot Learning with Siamese Networks and Label TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Thomas Müller
Guillermo Pérez-Torró
Marc Franco-Salvador
VLM
203
47
0
28 Mar 2022
A Scalable Model Specialization Framework for Training and Inference
  using Submodels and its Application to Speech Model Personalization
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model PersonalizationInterspeech (Interspeech), 2022
Fadi Biadsy
Youzheng Chen
Xia Zhang
Oleg Rybakov
Andrew Rosenberg
Pedro J. Moreno
203
13
0
23 Mar 2022
Visual Prompt Tuning
Visual Prompt TuningEuropean Conference on Computer Vision (ECCV), 2022
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLMVPVLM
606
2,237
0
23 Mar 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hyperdecoders: Instance-specific decoders for multi-task NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Michal Guerquin
Matthew E. Peters
AI4CE
350
23
0
15 Mar 2022
Modular and Parameter-Efficient Multimodal Fusion with Prompting
Modular and Parameter-Efficient Multimodal Fusion with PromptingFindings (Findings), 2022
Sheng Liang
Mengjie Zhao
Hinrich Schütze
152
50
0
15 Mar 2022
Uncertainty Estimation for Language Reward Models
Uncertainty Estimation for Language Reward Models
Adam Gleave
G. Irving
UQLM
170
36
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual
  Entailment
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual EntailmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLMCLIP
218
158
0
14 Mar 2022
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for
  Pre-trained Language Models
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models
Ning Ding
Yujia Qin
Guang Yang
Fu Wei
Zonghan Yang
...
Jianfei Chen
Yang Liu
Jie Tang
Juan Li
Maosong Sun
356
225
0
14 Mar 2022
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Shengnan An
Yifei Li
Zeqi Lin
Qian Liu
Bei Chen
Qiang Fu
Weizhu Chen
Nanning Zheng
Jian-Guang Lou
VLMAAML
218
47
0
07 Mar 2022
Unfreeze with Care: Space-Efficient Fine-Tuning of Semantic Parsing
  Models
Unfreeze with Care: Space-Efficient Fine-Tuning of Semantic Parsing ModelsThe Web Conference (WWW), 2022
Weiqi Sun
Haidar Khan
Nicolas Guenon des Mesnards
M. Rubino
Konstantine Arkoudas
241
5
0
05 Mar 2022
Controlling the Focus of Pretrained Language Generation Models
Controlling the Focus of Pretrained Language Generation ModelsFindings (Findings), 2022
Jiabao Ji
Yoon Kim
James R. Glass
Tianxing He
269
5
0
02 Mar 2022
HyperPrompt: Prompt-based Task-Conditioning of Transformers
HyperPrompt: Prompt-based Task-Conditioning of TransformersInternational Conference on Machine Learning (ICML), 2022
Yun He
H. Zheng
Yi Tay
Jai Gupta
Yu Du
...
Yaguang Li
Zhaoji Chen
Donald Metzler
Heng-Tze Cheng
Ed H. Chi
LRMVLM
274
107
0
01 Mar 2022
SGPT: GPT Sentence Embeddings for Semantic Search
SGPT: GPT Sentence Embeddings for Semantic Search
Niklas Muennighoff
RALM
587
238
0
17 Feb 2022
Revisiting Parameter-Efficient Tuning: Are We Really There Yet?
Revisiting Parameter-Efficient Tuning: Are We Really There Yet?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Guanzheng Chen
Fangyu Liu
Zaiqiao Meng
Shangsong Liang
250
111
0
16 Feb 2022
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq
  Generation
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tao Ge
Si-Qing Chen
Furu Wei
MoE
306
28
0
16 Feb 2022
Neighborhood Contrastive Learning for Scientific Document
  Representations with Citation Embeddings
Neighborhood Contrastive Learning for Scientific Document Representations with Citation EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Malte Ostendorff
Nils Rethmeier
Isabelle Augenstein
Bela Gipp
Georg Rehm
403
102
0
14 Feb 2022
Context-Tuning: Learning Contextualized Prompts for Natural Language
  Generation
Context-Tuning: Learning Contextualized Prompts for Natural Language GenerationInternational Conference on Computational Linguistics (COLING), 2022
Tianyi Tang
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
235
18
0
21 Jan 2022
Efficient Hierarchical Domain Adaptation for Pretrained Language Models
Efficient Hierarchical Domain Adaptation for Pretrained Language Models
Alexandra Chronopoulou
Matthew E. Peters
Jesse Dodge
211
45
0
16 Dec 2021
VL-Adapter: Parameter-Efficient Transfer Learning for
  Vision-and-Language Tasks
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLMVPVLM
344
433
0
13 Dec 2021
Intrinisic Gradient Compression for Federated Learning
Intrinisic Gradient Compression for Federated Learning
Luke Melas-Kyriazi
Franklyn Wang
FedML
94
4
0
05 Dec 2021
Emojich -- zero-shot emoji generation using Russian language: a
  technical report
Emojich -- zero-shot emoji generation using Russian language: a technical report
Alex Shonenkov
Daria Bakshandaeva
Denis Dimitrov
Aleks D. Nikolich
VLM
209
5
0
04 Dec 2021
Training Neural Networks with Fixed Sparse Masks
Training Neural Networks with Fixed Sparse Masks
Yi-Lin Sung
Varun Nair
Colin Raffel
FedML
395
258
0
18 Nov 2021
Previous
123...181920
Next