ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.14717
  4. Cited By
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
v1v2 (latest)

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

International Conference on Learning Representations (ICLR), 2023
26 September 2023
Yuhui Xu
Lingxi Xie
Xiaotao Gu
Xin Chen
Heng Chang
Hengheng Zhang
Zhensu Chen
Xiaopeng Zhang
Qi Tian
    MQ
ArXiv (abs)PDFHTMLHuggingFace (44 upvotes)Github (136★)

Papers citing "QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models"

50 / 89 papers shown
Title
LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits
LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits
Amir Reza Mirzaei
Yuqiao Wen
Yanshuai Cao
Lili Mou
MQ
433
0
0
30 Oct 2025
FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic
FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic
Kanghyun Choi
Hyeyoon Lee
S. Park
Dain Kwon
Jinho Lee
MQ
148
0
0
28 Oct 2025
Latent Space Factorization in LoRA
Latent Space Factorization in LoRA
Shashi Kumar
Yacouba Kaloga
John Mitros
P. Motlícek
Ina Kodrasi
84
0
0
22 Oct 2025
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression
Kyo Kuroki
Yasuyuki Okoshi
Thiem Van Chu
Kazushi Kawamura
Masato Motomura
MQ
168
0
0
21 Oct 2025
Efficient Resource-Constrained Training of Vision Transformers via Subspace Optimization
Efficient Resource-Constrained Training of Vision Transformers via Subspace Optimization
Le-Trung Nguyen
Enzo Tartaglione
Van-Tam Nguyen
124
0
0
10 Oct 2025
Referring Expression Comprehension for Small Objects
Referring Expression Comprehension for Small Objects
Kanoko Goto
Takumi Hirose
Mahiro Ukai
Shuhei Kurita
Nakamasa Inoue
ObjD
123
1
0
04 Oct 2025
On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs
On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs
Rongguang Ye
Ming Tang
Edith C. H. Ngai
MQ
44
0
0
22 Sep 2025
Quantized Large Language Models in Biomedical Natural Language Processing: Evaluation and Recommendation
Quantized Large Language Models in Biomedical Natural Language Processing: Evaluation and Recommendation
Zaifu Zhan
Shuang Zhou
Min Zeng
Kai Yu
Meijia Song
Xiaoyi Chen
Jun Wang
Yu Hou
Rui Zhang
MQ
164
0
0
04 Sep 2025
SSVD: Structured SVD for Parameter-Efficient Fine-Tuning and Benchmarking under Domain Shift in ASR
SSVD: Structured SVD for Parameter-Efficient Fine-Tuning and Benchmarking under Domain Shift in ASR
Pu Wang
Shinji Watanabe
Hugo Van hamme
116
0
0
02 Sep 2025
LOST: Low-rank and Sparse Pre-training for Large Language Models
LOST: Low-rank and Sparse Pre-training for Large Language Models
Jiaxi Li
Lu Yin
Li Shen
Jinjin Xu
Liwu Xu
Tianjin Huang
Wenwu Wang
Shiwei Liu
Xilu Wang
144
2
0
04 Aug 2025
Convergence Analysis of Aggregation-Broadcast in LoRA-enabled Distributed Fine-Tuning
Convergence Analysis of Aggregation-Broadcast in LoRA-enabled Distributed Fine-Tuning
Xin Chen
Shuaijun Chen
Omid Tavallaie
Nguyen H. Tran
Shuhuang Xiang
Albert Y. Zomaya
212
0
0
02 Aug 2025
Pay Attention to Small Weights
Pay Attention to Small Weights
Chao Zhou
Tom Jacobs
Advait Gadhikar
R. Burkholz
142
0
0
26 Jun 2025
Adapting Vision-Language Models for Evaluating World Models
Adapting Vision-Language Models for Evaluating World Models
Mariya Hendriksen
Tabish Rashid
David Bignell
Raluca Georgescu
Abdelhak Lemkhenter
Katja Hofmann
Sam Devlin
Sarah Parisot
165
0
0
22 Jun 2025
Improving LoRA with Variational Learning
Improving LoRA with Variational Learning
Bai Cong
Nico Daheim
Yuesong Shen
Rio Yokota
Mohammad Emtiyaz Khan
Thomas Möllenhoff
191
1
0
17 Jun 2025
Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Jianlong Wu
Sihao Liu
Chuan Rao
Bang An
Tiancheng Shen
Juil Sock
Ming-Hsuan Yang
Bernard Ghanem
200
4
0
16 Jun 2025
DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents
DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents
Hao Li
Xiaogeng Liu
Hung-Chun Chiu
Dianqi Li
Ning Zhang
Chaowei Xiao
AAML
275
5
0
13 Jun 2025
SLICK: Selective Localization and Instance Calibration for Knowledge-Enhanced Car Damage Segmentation in Automotive Insurance
SLICK: Selective Localization and Instance Calibration for Knowledge-Enhanced Car Damage Segmentation in Automotive Insurance
Teerapong Panboonyuen
288
0
0
12 Jun 2025
FPTQuant: Function-Preserving Transforms for LLM Quantization
Boris van Breugel
Yelysei Bondarenko
Paul N. Whatmough
Markus Nagel
MQ
242
3
0
05 Jun 2025
Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information
Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability InformationInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Seungcheol Park
Sojin Lee
Jongjin Kim
Jinsik Lee
Hyunjik Jo
U. Kang
227
3
0
04 Jun 2025
PoLAR: Polar-Decomposed Low-Rank Adapter Representation
PoLAR: Polar-Decomposed Low-Rank Adapter Representation
Kai Lion
Liang Zhang
Bingcong Li
Niao He
216
3
0
03 Jun 2025
Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine-Tuning via Sparsity Evolution
Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine-Tuning via Sparsity Evolution
Q. Xiao
Alan Ansell
Boqian Wu
Lu Yin
Mykola Pechenizkiy
Shiwei Liu
Decebal Constantin Mocanu
217
2
0
29 May 2025
SineLoRA$Δ$: Sine-Activated Delta Compression
SineLoRAΔΔΔ: Sine-Activated Delta Compression
Cameron Gordon
Yiping Ji
Hemanth Saratchandran
Paul Albert
Simon Lucey
MQ
296
0
0
28 May 2025
Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator
Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator
Qian Cao
Xiting Wang
Yuzhuo Yuan
Yahui Liu
Fang Luo
Ruihua Song
161
0
0
25 May 2025
LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning
LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning
Junyu Chen
Junzhuo Li
Zhen Peng
Wenjie Wang
Yuxiang Ren
Long Shi
Xuming Hu
MQ
213
0
0
24 May 2025
HOFT: Householder Orthogonal Fine-tuning
HOFT: Householder Orthogonal Fine-tuning
Alejandro Moreno Arcas
Albert Sanchis
Jorge Civera
Alfons Juan
248
0
0
22 May 2025
ABBA-Adapters: Efficient and Expressive Fine-Tuning of Foundation Models
ABBA-Adapters: Efficient and Expressive Fine-Tuning of Foundation Models
Raghav Singhal
Kaustubh Ponkshe
Rohit Vartak
Praneeth Vepakomma
485
1
0
20 May 2025
Federated Low-Rank Adaptation for Foundation Models: A Survey
Federated Low-Rank Adaptation for Foundation Models: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Yiyuan Yang
Guodong Long
Qinghua Lu
Liming Zhu
Jing Jiang
Chengqi Zhang
AI4CE
253
5
0
16 May 2025
EMRModel: A Large Language Model for Extracting Medical Consultation Dialogues into Structured Medical Records
EMRModel: A Large Language Model for Extracting Medical Consultation Dialogues into Structured Medical Records
Shuguang Zhao
Qiangzhong Feng
Zhiyang He
Peipei Sun
Yingying Wang
...
Xiaoliang Lu
Mei Cheng
Xinyue Wu
Yanyan Wang
Wei Liang
LM&MA
146
0
0
23 Apr 2025
A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms
A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms
Chengkai Huang
Hongtao Huang
Tong Yu
Kaige Xie
Junda Wu
Shuai Zhang
Julian McAuley
Dietmar Jannach
Lina Yao
LRMAI4CE
270
7
0
23 Apr 2025
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation
Juzheng Zhang
Jiacheng You
Ashwinee Panda
Tom Goldstein
MoMe
331
10
0
10 Apr 2025
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation GroundingInternational Conference on Learning Representations (ICLR), 2025
Indraneil Paul
Haoyi Yang
Goran Glavaš
Kristian Kersting
Iryna Gurevych
AAMLSyDa
238
3
0
27 Mar 2025
Q&C: When Quantization Meets Cache in Efficient Image Generation
Xin Ding
Xiaochen Li
Haotong Qin
Zhibo Chen
DiffMMQ
358
1
0
04 Mar 2025
PaCA: Partial Connection Adaptation for Efficient Fine-TuningInternational Conference on Learning Representations (ICLR), 2025
Sunghyeon Woo
Sol Namkung
Sunwoo Lee
Inho Jeong
Beomseok Kim
Dongsuk Jeon
326
3
0
28 Feb 2025
Evidence-Driven Marker Extraction for Social Media Suicide Risk Detection
Evidence-Driven Marker Extraction for Social Media Suicide Risk Detection
Carter Adams
Caleb Carter
Jackson Simmons
186
1
0
26 Feb 2025
GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
GaLore+++: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
Xutao Liao
Shaohui Li
Yuhui Xu
Zhi Li
Zichen Liu
You He
VLM
247
7
0
31 Dec 2024
Deploying Foundation Model Powered Agent Services: A Survey
Deploying Foundation Model Powered Agent Services: A Survey
Wenchao Xu
Jinyu Chen
Peirong Zheng
Xiaoquan Yi
Tianyi Tian
...
Quan Wan
Yining Qi
Yunfeng Fan
Qinliang Su
Xuemin Shen
AI4CE
423
5
0
18 Dec 2024
FineGates: LLMs Finetuning with Compression using Stochastic Gates
FineGates: LLMs Finetuning with Compression using Stochastic Gates
Jonathan Svirsky
Yehonathan Refael
Ofir Lindenbaum
266
3
0
17 Dec 2024
PassionSR: Post-Training Quantization with Adaptive Scale in One-Step
  Diffusion based Image Super-Resolution
PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-ResolutionComputer Vision and Pattern Recognition (CVPR), 2024
Libo Zhu
Jiajian Li
Haotong Qin
Wenbo Li
Yulun Zhang
Yong Guo
Yunbo Wang
DiffMMQ
317
7
0
26 Nov 2024
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
Hang Guo
Yawei Li
Tao Dai
Shu-Tao Xia
Luca Benini
MQ
297
5
0
29 Oct 2024
EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation
EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation
Shih-yang Liu
Huck Yang
Nai Chit Fung
Charbel Sakr
Hongxu Yin
...
Jan Kautz
Yu-Chun Wang
Pavlo Molchanov
Min-Hung Chen
Min-Hung Chen
MQ
434
0
0
28 Oct 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
457
14
0
24 Oct 2024
Dynamic Adaptive Rank Space Exploration for Efficient Sentiment Analysis with Large Language Models
Dynamic Adaptive Rank Space Exploration for Efficient Sentiment Analysis with Large Language Models
Hongcheng Ding
Fuzhen Hu
Xuanze Zhao
Zixiao Jiang
Shamsul Nahar Abdullah
Deshinta Arrova Dewi
184
0
0
22 Oct 2024
Channel-Wise Mixed-Precision Quantization for Large Language Models
Channel-Wise Mixed-Precision Quantization for Large Language Models
Zihan Chen
Bike Xie
Jundong Li
Cong Shen
MQ
425
6
0
16 Oct 2024
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Hossein Abdi
Mingfei Sun
Andi Zhang
Samuel Kaski
Wei Pan
191
1
0
15 Oct 2024
A Survey: Collaborative Hardware and Software Design in the Era of Large
  Language Models
A Survey: Collaborative Hardware and Software Design in the Era of Large Language ModelsIEEE Circuits and Systems Magazine (IEEE CSM), 2024
Cong Guo
Feng Cheng
Zhixu Du
James Kiessling
Jonathan Ku
...
Qilin Zheng
Guanglei Zhou
Hai
Li-Wei Li
Yiran Chen
169
17
0
08 Oct 2024
Hyperbolic Fine-tuning for Large Language Models
Hyperbolic Fine-tuning for Large Language Models
Menglin Yang
Aosong Feng
Bo Xiong
Jihong Liu
Irwin King
Rex Ying
277
10
0
05 Oct 2024
Fine-tuning and Prompt Engineering with Cognitive Knowledge Graphs for
  Scholarly Knowledge Organization
Fine-tuning and Prompt Engineering with Cognitive Knowledge Graphs for Scholarly Knowledge Organization
Gollam Rabby
Sören Auer
Jennifer D'Souza
A. Oelen
594
3
0
10 Sep 2024
CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large
  Language Models
CoRA: Optimizing Low-Rank Adaptation with Common Subspace of Large Language Models
Xiaojun Xiao
Sen Shen
Qiming Bao
Hongfei Rong
Kairui Liu
Zhongsheng Wang
Jing Liu
230
2
0
31 Aug 2024
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Zi Liang
Haibo Hu
Qingqing Ye
Yaxin Xiao
Haoyang Li
AAMLELMSILM
365
15
0
05 Aug 2024
Enhancing Agricultural Machinery Management through Advanced LLM
  Integration
Enhancing Agricultural Machinery Management through Advanced LLM Integration
Emily Johnson
Noah Wilson
221
2
0
30 Jul 2024
12
Next