Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.02530
Cited By
A Comprehensive Study on Quantization Techniques for Large Language Models
30 October 2024
Jiedong Lang
Zhehao Guo
Shuyu Huang
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Comprehensive Study on Quantization Techniques for Large Language Models"
6 / 6 papers shown
Title
Fine-Tuning Large Language Models and Evaluating Retrieval Methods for Improved Question Answering on Building Codes
Mohammad Aqib
Mohd Hamza
Qipei Mei
Ying Hei Chui
RALM
ELM
47
0
0
07 May 2025
ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs
Fahmida Liza Piya
Rahmatollah Beheshti
34
0
0
23 Apr 2025
Enhancing Ultra-Low-Bit Quantization of Large Language Models Through Saliency-Aware Partial Retraining
Deyu Cao
Samin Aref
MQ
25
0
0
14 Apr 2025
Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization
Yamato Arai
Yuma Ichikawa
MQ
24
0
0
13 Apr 2025
Adaptive Orchestration for Inference of Large Foundation Models at the Edge
Fernando Koch
Aladin Djuhera
Alecio Binotto
26
0
0
19 Mar 2025
Scaling Laws for Floating Point Quantization Training
X. Sun
Shuaipeng Li
Ruobing Xie
Weidong Han
Kan Wu
...
Yangyu Tao
Zhanhui Kang
C. Xu
Di Wang
Jie Jiang
MQ
AIFin
53
0
0
05 Jan 2025
1