v1v2 (latest)

LoRA: Low-Rank Adaptation of Large Language Models

International Conference on Learning Representations (ICLR), 2021

17 June 2021

ArXiv (abs)PDF HTML HuggingFace (49 upvotes)Github (11998★)

Papers citing "LoRA: Low-Rank Adaptation of Large Language Models"

50 / 8,605 papers shown

ClaimDiff: Comparing and Contrasting Claims on Contentious IssuesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

217

24 May 2022

Representation Projection Invariance Mitigates Representation CollapseConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Anastasia Razdaibiedina

Daniel Khashabi

252

23 May 2022

When does Parameter-Efficient Transfer Learning Work for Machine Translation?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Ahmet Üstün

Asa Cooper Stickland

179

23 May 2022

BBTv2: Towards a Gradient-Free Future with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Tianxiang Sun

Zhengfu He

Hong Qian

Yunhua Zhou

Xuanjing Huang

Xipeng Qiu

275

23 May 2022

Parameter-Efficient Sparsity for Large Language Models Fine-TuningInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

Yuchao Li

Fuli Luo

Chuanqi Tan

Mengdi Wang

Songfang Huang

Shen Li

Junjie Bai

163

23 May 2022

A Unified and Biologically-Plausible Relational Graph Representation of Vision TransformersIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Lu Zhang

...

Dajiang Zhu

Tuo Zhang

Xiaoyan Cai

Tianming Liu

Xi Jiang

ViT

203

20 May 2022

AdaVAE: Exploring Adaptive GPT-2s in Variational Auto-Encoders for Language Modeling

200

12 May 2022

Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context LearningNeural Information Processing Systems (NeurIPS), 2022

453

1,163

11 May 2022

Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention

Yang Liu

117

07 May 2022

Engineering flexible machine learning systems by traversing functionally-invariant paths

456

30 Apr 2022

AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

173

30 Apr 2022

Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

293

30 Apr 2022

Prompt Consistency for Zero-Shot Task GeneralizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Chunting Zhou

Junxian He

Xuezhe Ma

Taylor Berg-Kirkpatrick

Graham Neubig

VLM

358

29 Apr 2022

TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

421

115

29 Apr 2022

On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language ModelNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

...

308

105

28 Apr 2022

Plug-and-Play Adaptation for Continuously-updated QAFindings (Findings), 2022

204

27 Apr 2022

Standing on the Shoulders of Giant Frozen Language Models

...

228

21 Apr 2022

A Contrastive Cross-Channel Data Augmentation Framework for Aspect-based Sentiment AnalysisInternational Conference on Computational Linguistics (COLING), 2022

Liang Ding

184

16 Apr 2022

Impossible Triangle: What's Next for Pre-trained Language Models?

Chenguang Zhu

Michael Zeng

142

13 Apr 2022

DualPrompt: Complementary Prompting for Rehearsal-free Continual LearningEuropean Conference on Computer Vision (ECCV), 2022

...

Jennifer Dy

Tomas Pfister

CLL VLM VPVLM

330

676

10 Apr 2022

Rockafellian Relaxation and Stochastic Optimization under PerturbationsMathematics of Operations Research (MOR), 2022

Pratiksha Agrawal

Louis L. Chen

Eric Eckstrand

225

10 Apr 2022

Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual RetrievalInternational Conference on Computational Linguistics (COLING), 2022

390

05 Apr 2022

Parameter-efficient Model Adaptation for Vision TransformersAAAI Conference on Artificial Intelligence (AAAI), 2022

Jianwei Yang

163

104

29 Mar 2022

A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model PersonalizationInterspeech (Interspeech), 2022

Andrew Rosenberg

203

23 Mar 2022

Visual Prompt TuningEuropean Conference on Computer Vision (ECCV), 2022

Ser-Nam Lim

599

2,237

23 Mar 2022

Hyperdecoders: Instance-specific decoders for multi-task NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Michal Guerquin

Matthew E. Peters

AI4CE

349

15 Mar 2022

Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

...

Jianfei Chen

Yang Liu

Jie Tang

Juan Li

Maosong Sun

350

225

14 Mar 2022

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models

217

07 Mar 2022

Combining Modular Skills in Multitask Learning

Siva Reddy

295

28 Feb 2022

SGPT: GPT Sentence Embeddings for Semantic Search

Niklas Muennighoff

RALM

586

238

17 Feb 2022

Revisiting Parameter-Efficient Tuning: Are We Really There Yet?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

247

111

16 Feb 2022

EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

303

16 Feb 2022

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks

Jianwei Yang

Lu Yuan

209

15 Jan 2022

Black-Box Tuning for Language-Model-as-a-ServiceInternational Conference on Machine Learning (ICML), 2022

Tianxiang Sun

Yunfan Shao

Hong Qian

Xuanjing Huang

Xipeng Qiu

VLM

395

321

10 Jan 2022

Latency Adjustable Transformer Encoder for Language UnderstandingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Sajjad Kachuee

M. Sharifkhani

564

10 Jan 2022

Efficient Hierarchical Domain Adaptation for Pretrained Language Models

Alexandra Chronopoulou

Matthew E. Peters

Jesse Dodge

206

16 Dec 2021

Learning to Prompt for Continual Learning

Chen-Yu Lee

Jennifer Dy

Tomas Pfister

CLL VPVLM KELM VLM

353

1,058

16 Dec 2021

Training Multi-Layer Over-Parametrized Neural Network in Subquadratic Time

Zhao Song

Licheng Zhang

Ruizhe Zhang

358

14 Dec 2021

VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks

338

433

13 Dec 2021

Pruning Pretrained Encoders with a Multitask Objective

Patrick Xia

Richard Shin

127

10 Dec 2021

MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning

259

109

09 Dec 2021

Improving Differentially Private SGD via Randomly Sparsified Gradients

Junyi Zhu

Matthew B. Blaschko

398

01 Dec 2021

OpenPrompt: An Open-source Framework for Prompt-learningAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Weilin Zhao

Zhiyuan Liu

Maosong Sun

248

329

03 Nov 2021

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Xuxi Chen

Tianlong Chen

Weizhu Chen

Ahmed Hassan Awadallah

Zinan Lin

Yu Cheng

MoE ALM

279

30 Oct 2021

Semi-Siamese Bi-encoder Neural Ranking Model Using Lightweight Fine-TuningThe Web Conference (WWW), 2021

Euna Jung

Jaekeol Choi

Wonjong Rhee

143

28 Oct 2021

Fast Model Editing at ScaleInternational Conference on Learning Representations (ICLR), 2021

Christopher D. Manning

KELM

989

459

21 Oct 2021

Control Prefixes for Parameter-Efficient Text Generation

Jordan Clive

Kris Cao

Marek Rei

267

15 Oct 2021

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

461

314

15 Oct 2021

Exploring Universal Intrinsic Task Subspace via Prompt Tuning

Yujia Qin

Xiaozhi Wang

Yusheng Su

Yankai Lin

Ning Ding

...

Juanzi Li

Lei Hou

Peng Li

Maosong Sun

Jie Zhou

VLM VPVLM

313

15 Oct 2021

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning

Hao Ma

Madian Khabsa

276

215

14 Oct 2021