Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2106.09685
Cited By
v1
v2 (latest)
LoRA: Low-Rank Adaptation of Large Language Models
International Conference on Learning Representations (ICLR), 2021
17 June 2021
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (49 upvotes)
Github (11998★)
Papers citing
"LoRA: Low-Rank Adaptation of Large Language Models"
50 / 8,602 papers shown
Title
BBTv2: Towards a Gradient-Free Future with Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianxiang Sun
Zhengfu He
Hong Qian
Yunhua Zhou
Xuanjing Huang
Xipeng Qiu
266
70
0
23 May 2022
Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Yuchao Li
Fuli Luo
Chuanqi Tan
Mengdi Wang
Songfang Huang
Shen Li
Junjie Bai
MQ
159
37
0
23 May 2022
A Unified and Biologically-Plausible Relational Graph Representation of Vision Transformers
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Yuzhong Chen
Yu Du
Zhe Xiao
Lin Zhao
Lu Zhang
...
Dajiang Zhu
Tuo Zhang
Xiaoyan Cai
Tianming Liu
Xi Jiang
ViT
187
6
0
20 May 2022
AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for Language Modeling
Haoqin Tu
Zhongliang Yang
Jinshuai Yang
Yong Huang
192
13
0
12 May 2022
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Neural Information Processing Systems (NeurIPS), 2022
Haokun Liu
Derek Tam
Mohammed Muqeeth
Jay Mohta
Tenghao Huang
Joey Tianyi Zhou
Colin Raffel
449
1,148
0
11 May 2022
Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention
Yifan Chen
Devamanyu Hazarika
Mahdi Namazifar
Yang Liu
Di Jin
Dilek Z. Hakkani-Tür
117
8
0
07 May 2022
Engineering flexible machine learning systems by traversing functionally-invariant paths
G. Raghavan
Bahey Tharwat
S. N. Hari
Dhruvil Satani
Matt Thomson
OOD
AI4CE
443
13
0
30 Apr 2022
AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks
Chin-Lun Fu
Zih-Ching Chen
Yun-Ru Lee
Hung-yi Lee
166
52
0
30 Apr 2022
Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Sanghwan Bae
Donghyun Kwak
Sungdong Kim
Dong-hyun Ham
Soyoung Kang
Sang-Woo Lee
W. Park
ALM
265
42
0
30 Apr 2022
Prompt Consistency for Zero-Shot Task Generalization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Chunting Zhou
Junxian He
Xuezhe Ma
Taylor Berg-Kirkpatrick
Graham Neubig
VLM
353
86
0
29 Apr 2022
TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Joel Jang
Seonghyeon Ye
Changho Lee
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Minjoon Seo
CLL
KELM
409
114
0
29 Apr 2022
On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Seongjin Shin
Sang-Woo Lee
Hwijeen Ahn
Sungdong Kim
Hyoungseok Kim
...
Dong Wang
Gichang Lee
W. Park
Jung-Woo Ha
Nako Sung
LRM
295
105
0
28 Apr 2022
Plug-and-Play Adaptation for Continuously-updated QA
Findings (Findings), 2022
Kyungjae Lee
Wookje Han
Seung-won Hwang
Hwaran Lee
Joonsuk Park
Sang-Woo Lee
KELM
195
22
0
27 Apr 2022
Standing on the Shoulders of Giant Frozen Language Models
Yoav Levine
Itay Dalmedigos
Ori Ram
Yoel Zeldes
Daniel Jannai
...
Barak Lenz
Shai Shalev-Shwartz
Amnon Shashua
Kevin Leyton-Brown
Y. Shoham
VLM
216
52
0
21 Apr 2022
A Contrastive Cross-Channel Data Augmentation Framework for Aspect-based Sentiment Analysis
International Conference on Computational Linguistics (COLING), 2022
Bing Wang
Liang Ding
Qihuang Zhong
Ximing Li
Dacheng Tao
168
40
0
16 Apr 2022
Impossible Triangle: What's Next for Pre-trained Language Models?
Chenguang Zhu
Michael Zeng
142
2
0
13 Apr 2022
DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning
European Conference on Computer Vision (ECCV), 2022
Zifeng Wang
Zizhao Zhang
Sayna Ebrahimi
Ruoxi Sun
Han Zhang
...
Xiaoqi Ren
Guolong Su
Vincent Perot
Jennifer Dy
Tomas Pfister
CLL
VLM
VPVLM
330
671
0
10 Apr 2022
Rockafellian Relaxation and Stochastic Optimization under Perturbations
Mathematics of Operations Research (MOR), 2022
Pratiksha Agrawal
Louis L. Chen
Eric Eckstrand
225
8
0
10 Apr 2022
Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual Retrieval
International Conference on Computational Linguistics (COLING), 2022
Robert Litschko
Ivan Vulić
Goran Glavaš
LRM
358
18
0
05 Apr 2022
Parameter-efficient Model Adaptation for Vision Transformers
AAAI Conference on Artificial Intelligence (AAAI), 2022
Xuehai He
Chunyuan Li
Pengchuan Zhang
Jianwei Yang
Xinze Wang
154
104
0
29 Mar 2022
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Interspeech (Interspeech), 2022
Fadi Biadsy
Youzheng Chen
Xia Zhang
Oleg Rybakov
Andrew Rosenberg
Pedro J. Moreno
203
13
0
23 Mar 2022
Visual Prompt Tuning
European Conference on Computer Vision (ECCV), 2022
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
574
2,211
0
23 Mar 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Michal Guerquin
Matthew E. Peters
AI4CE
329
23
0
15 Mar 2022
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models
Ning Ding
Yujia Qin
Guang Yang
Fu Wei
Zonghan Yang
...
Jianfei Chen
Yang Liu
Jie Tang
Juan Li
Maosong Sun
322
225
0
14 Mar 2022
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Shengnan An
Yifei Li
Zeqi Lin
Qian Liu
Bei Chen
Qiang Fu
Weizhu Chen
Nanning Zheng
Jian-Guang Lou
VLM
AAML
213
47
0
07 Mar 2022
Combining Modular Skills in Multitask Learning
Edoardo Ponti
Alessandro Sordoni
Yoshua Bengio
Siva Reddy
MoE
291
42
0
28 Feb 2022
SGPT: GPT Sentence Embeddings for Semantic Search
Niklas Muennighoff
RALM
550
235
0
17 Feb 2022
Revisiting Parameter-Efficient Tuning: Are We Really There Yet?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Guanzheng Chen
Fangyu Liu
Zaiqiao Meng
Shangsong Liang
231
109
0
16 Feb 2022
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tao Ge
Si-Qing Chen
Furu Wei
MoE
288
28
0
16 Feb 2022
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Zhecan Wang
Noel Codella
Yen-Chun Chen
Luowei Zhou
Jianwei Yang
Xiyang Dai
Bin Xiao
Haoxuan You
Shih-Fu Chang
Lu Yuan
CLIP
VLM
193
44
0
15 Jan 2022
Black-Box Tuning for Language-Model-as-a-Service
International Conference on Machine Learning (ICML), 2022
Tianxiang Sun
Yunfan Shao
Hong Qian
Xuanjing Huang
Xipeng Qiu
VLM
390
319
0
10 Jan 2022
Latency Adjustable Transformer Encoder for Language Understanding
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Sajjad Kachuee
M. Sharifkhani
468
1
0
10 Jan 2022
Efficient Hierarchical Domain Adaptation for Pretrained Language Models
Alexandra Chronopoulou
Matthew E. Peters
Jesse Dodge
161
45
0
16 Dec 2021
Learning to Prompt for Continual Learning
Zifeng Wang
Zizhao Zhang
Chen-Yu Lee
Han Zhang
Ruoxi Sun
Xiaoqi Ren
Guolong Su
Vincent Perot
Jennifer Dy
Tomas Pfister
CLL
VPVLM
KELM
VLM
347
1,048
0
16 Dec 2021
Training Multi-Layer Over-Parametrized Neural Network in Subquadratic Time
Zhao Song
Licheng Zhang
Ruizhe Zhang
324
69
0
14 Dec 2021
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLM
VPVLM
311
432
0
13 Dec 2021
Pruning Pretrained Encoders with a Multitask Objective
Patrick Xia
Richard Shin
127
0
0
10 Dec 2021
MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning
C. Eichenberg
Sid Black
Samuel Weinbach
Letitia Parcalabescu
Anette Frank
MLLM
VLM
243
108
0
09 Dec 2021
Improving Differentially Private SGD via Randomly Sparsified Gradients
Junyi Zhu
Matthew B. Blaschko
379
7
0
01 Dec 2021
OpenPrompt: An Open-source Framework for Prompt-learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Ning Ding
Shengding Hu
Weilin Zhao
Yulin Chen
Zhiyuan Liu
Haitao Zheng
Maosong Sun
VLM
LLMAG
237
327
0
03 Nov 2021
DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Xuxi Chen
Tianlong Chen
Weizhu Chen
Ahmed Hassan Awadallah
Zinan Lin
Yu Cheng
MoE
ALM
255
10
0
30 Oct 2021
Semi-Siamese Bi-encoder Neural Ranking Model Using Lightweight Fine-Tuning
The Web Conference (WWW), 2021
Euna Jung
Jaekeol Choi
Wonjong Rhee
127
15
0
28 Oct 2021
Fast Model Editing at Scale
International Conference on Learning Representations (ICLR), 2021
E. Mitchell
Charles Lin
Antoine Bosselut
Chelsea Finn
Christopher D. Manning
KELM
941
457
0
21 Oct 2021
Control Prefixes for Parameter-Efficient Text Generation
Jordan Clive
Kris Cao
Marek Rei
243
34
0
15 Oct 2021
SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer
Tu Vu
Brian Lester
Noah Constant
Rami Al-Rfou
Daniel Cer
VLM
LRM
434
313
0
15 Oct 2021
Exploring Universal Intrinsic Task Subspace via Prompt Tuning
Yujia Qin
Xiaozhi Wang
Yusheng Su
Yankai Lin
Ning Ding
...
Juanzi Li
Lei Hou
Peng Li
Maosong Sun
Jie Zhou
VLM
VPVLM
297
31
0
15 Oct 2021
UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning
Yuning Mao
Lambert Mathias
Rui Hou
Amjad Almahairi
Hao Ma
Jiawei Han
Anuj Kumar
Madian Khabsa
236
212
0
14 Oct 2021
Differentially Private Fine-tuning of Language Models
Da Yu
Saurabh Naik
A. Backurs
Sivakanth Gopi
Huseyin A. Inan
...
Y. Lee
Andre Manoel
Lukas Wutschitz
Sergey Yekhanin
Huishuai Zhang
542
442
0
13 Oct 2021
Towards a Unified View of Parameter-Efficient Transfer Learning
International Conference on Learning Representations (ICLR), 2021
Junxian He
Chunting Zhou
Xuezhe Ma
Taylor Berg-Kirkpatrick
Graham Neubig
AAML
551
1,088
0
08 Oct 2021
Towards Continual Knowledge Learning of Language Models
Joel Jang
Seonghyeon Ye
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Stanley Jungkyu Choi
Minjoon Seo
CLL
KELM
571
183
0
07 Oct 2021
Previous
1
2
3
...
171
172
173
Next