v1v2 (latest)

LoRA: Low-Rank Adaptation of Large Language Models

International Conference on Learning Representations (ICLR), 2021

17 June 2021

ArXiv (abs)PDF HTML HuggingFace (49 upvotes)Github (11998★)

Papers citing "LoRA: Low-Rank Adaptation of Large Language Models"

50 / 8,614 papers shown

On the Effectiveness of Parameter-Efficient Fine-TuningAAAI Conference on Artificial Intelligence (AAAI), 2022

Haoran Yang

217

206

28 Nov 2022

A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data Perspective

553

28 Nov 2022

Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation ModelsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022

Peter Henderson

E. Mitchell

Christopher D. Manning

Dan Jurafsky

Chelsea Finn

241

27 Nov 2022

MAEDAY: MAE for few and zero shot AnomalY-DetectionComputer Vision and Image Understanding (CVIU), 2022

217

25 Nov 2022

HyperTuning: Toward Adapting Large Language Models without Back-propagationInternational Conference on Machine Learning (ICML), 2022

231

22 Nov 2022

Linear Interpolation In Parameter Space is Good Enough for Fine-Tuned Language Models

163

22 Nov 2022

Teaching Structured Vision&Language Concepts to Vision&Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022

...

342

21 Nov 2022

Multitask Vision-Language Prompt TuningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Tianjun Zhang

288

21 Nov 2022

AF Adapter: Continual Pretraining for Building Chinese Biomedical Language ModelIEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2022

178

21 Nov 2022

Aging with GRACE: Lifelong Model Editing with Discrete Key-Value AdaptorsNeural Information Processing Systems (NeurIPS), 2022

642

237

20 Nov 2022

ConStruct-VL: Data-Free Continual Structured VL Concepts LearningComputer Vision and Pattern Recognition (CVPR), 2022

James Smith

Paola Cascante-Bonilla

Diyi Yang

289

17 Nov 2022

Structured Pruning AdaptersPattern Recognition (Pattern Recogn.), 2022

281

17 Nov 2022

CSCD-NS: a Chinese Spelling Check Dataset for Native SpeakersAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Yong Hu

Fandong Meng

Jie Zhou

269

16 Nov 2022

FedTune: A Deep Dive into Efficient Federated Fine-Tuning with Pre-trained Transformers

192

15 Nov 2022

Controllable Citation Sentence Generation with Language Models

Nianlong Gu

Richard H. R. Hahnloser

152

14 Nov 2022

Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters

328

13 Nov 2022

FPT: Improving Prompt Tuning Efficiency via Progressive TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Maosong Sun

Zhiyuan Liu

Qun Liu

VLM LRM

145

13 Nov 2022

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

220

13 Nov 2022

One-Time Model Adaptation to Heterogeneous Clients: An Intra-Client and Inter-Image Attention Design

164

11 Nov 2022

Multi-Head Adapter Routing for Cross-Task GeneralizationNeural Information Processing Systems (NeurIPS), 2022

Nicolas Le Roux

142

07 Nov 2022

Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion ForecastingConference on Robot Learning (CoRL), 2022

Alexandre Alahi

262

06 Nov 2022

On the Domain Adaptation and Generalization of Pretrained Language Models: A Survey

Xu Guo

Han Yu

LM&MA VLM

308

06 Nov 2022

Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature FusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

144

04 Nov 2022

Could Giant Pretrained Image Models Extract Universal Representations?Neural Information Processing Systems (NeurIPS), 2022

180

03 Nov 2022

Two-stage LLM Fine-tuning with Less Specialization and More GeneralizationInternational Conference on Learning Representations (ICLR), 2022

Inderjit S Dhillon

Sanjiv Kumar

336

01 Nov 2022

Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New SpeakersInterspeech (Interspeech), 2022

Cheng-Ping Hsieh

Subhankar Ghosh

Boris Ginsburg

233

01 Nov 2022

AdaMix: Mixture-of-Adaptations for Parameter-efficient Model TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Yaqing Wang

Sahaj Agarwal

Subhabrata Mukherjee

Xiaodong Liu

Jing Gao

Ahmed Hassan Awadallah

Jianfeng Gao

MoE

308

170

31 Oct 2022

GPS: Genetic Prompt Search for Efficient Few-shot LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

147

31 Oct 2022

Parameter-Efficient Tuning Makes a Good Classification HeadConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

265

30 Oct 2022

Differentiable Data Augmentation for Contrastive Sentence Representation LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Tianduo Wang

Wei Lu

SSL

135

29 Oct 2022

Inducer-tuning: Connecting Prefix-tuning and Adapter-tuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Yang Liu

159

26 Oct 2022

Learning Better Intent Representations for Financial Open Intent Classification

145

25 Oct 2022

Evaluating Parameter Efficient Learning for GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

166

25 Oct 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning

Yankai Lin

Xu Han

Zhiyuan Liu

Maosong Sun

Jie Zhou

274

24 Oct 2022

NVIDIA FLARE: Federated Learning from Simulation to Real-WorldIEEE Data Engineering Bulletin (DEB), 2022

Ziyue Xu

...

306

139

24 Oct 2022

Exploring The Landscape of Distributional Robustness for Question Answering ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

228

22 Oct 2022

Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of RewardsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

264

21 Oct 2022

Efficiently Tuned Parameters are Task EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Wangchunshu Zhou

Canwen Xu

Julian McAuley

155

21 Oct 2022

Tele-Knowledge Pre-training for Fault AnalysisIEEE International Conference on Data Engineering (ICDE), 2022

...

290

20 Oct 2022

Late Prompt Tuning: A Late Prompt Could Be Better Than Many PromptsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Xiangyang Liu

Tianxiang Sun

Xuanjing Huang

Xipeng Qiu

VLM

228

20 Oct 2022

Continued Pretraining for Better Zero- and Few-Shot PromptabilityConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Zhaofeng Wu

IV RobertL.Logan

Pete Walsh

Akshita Bhagia

Dirk Groeneveld

Sameer Singh

Iz Beltagy

VLM

230

19 Oct 2022

Hidden State Variability of Pretrained Language Models Can Guide Computation Reduction for Transfer LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Qing Qu

235

18 Oct 2022

Tiny-Attention Adapter: Contexts Are More Important Than the Number of ParametersConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

191

18 Oct 2022

Domain Specific Sub-network for Multi-Domain Neural Machine Translation

149

18 Oct 2022

Scaling & Shifting Your Features: A New Baseline for Efficient Model TuningNeural Information Processing Systems (NeurIPS), 2022

354

335

17 Oct 2022

Keep Me Updated! Memory Management in Long-term ConversationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

306

17 Oct 2022

Accelerating Transfer Learning with Near-Data Computation on Cloud Object StoresACM Symposium on Cloud Computing (SoCC), 2022

224

16 Oct 2022

Prompt Conditioned VAE: Enhancing Generative Replay for Lifelong Learning in Task-Oriented DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

218

14 Oct 2022

Multitask Pre-training of Modular Prompt for Chinese Few-Shot LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Tianxiang Sun

Zhengfu He

Qinen Zhu

Xipeng Qiu

Xuanjing Huang

VLM VPVLM

201

14 Oct 2022

DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank AdaptationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

417

240

14 Oct 2022