Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2307.08072
Cited By
v1
v2 (latest)
Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study
International Conference on Language Resources and Evaluation (LREC), 2023
16 July 2023
Peiyu Liu
Zikang Liu
Ze-Feng Gao
Dawei Gao
Wayne Xin Zhao
Yaliang Li
Bolin Ding
Ji-Rong Wen
MQ
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study"
33 / 33 papers shown
Importance-Aware Data Selection for Efficient LLM Instruction Tuning
Tingyu Jiang
Shen Li
Yiyao Song
Lan Zhang
Hualei Zhu
Yuan Zhao
Xiaohang Xu
Kenjiro Taura
Hao Henry Wang
423
5
0
10 Nov 2025
Scaling LLM Test-Time Compute with Mobile NPU on Smartphones
Zixu Hao
Jianyu Wei
Tuowei Wang
Minxing Huang
Huiqiang Jiang
Shiqi Jiang
Ting Cao
Ju Ren
314
3
0
27 Sep 2025
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
Vage Egiazarian
Roberto L. Castro
Denis Kuznedelev
Andrei Panferov
Eldar Kurtic
...
Alexandre Marques
Mark Kurtz
Saleh Ashkboos
Torsten Hoefler
Dan Alistarh
MQ
285
12
0
27 Sep 2025
Fair-GPTQ: Bias-Aware Quantization for Large Language Models
Irina Proskurina
Guillaume Metzler
Julien Velcin
MQ
256
0
0
18 Sep 2025
Quantized but Deceptive? A Multi-Dimensional Truthfulness Evaluation of Quantized LLMs
Y. Fu
Xianxuan Long
Runchao Li
Haotian Yu
Mu Sheng
Xiaotian Han
Yu Yin
Pan Li
HILM
218
6
0
26 Aug 2025
Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
LRM
311
8
0
18 Jun 2025
Does quantization affect models' performance on long-context tasks?
Anmol Mekala
Anirudh Atmakuru
Yixiao Song
Marzena Karpinska
Mohit Iyyer
MQ
581
3
0
26 May 2025
Through a Compressed Lens: Investigating The Impact of Quantization on Factual Knowledge Recall
Qianli Wang
Mingyang Wang
Nils Feldhus
Simon Ostermann
Yuan Cao
Hinrich Schütze
Sebastian Möller
Vera Schmitt
MQ
304
2
0
20 May 2025
Stability in Single-Peaked Strategic Resource Selection Games
Henri Zeiler
379
7
0
09 May 2025
Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations
Zican Dong
Han Peng
Peiyu Liu
Wayne Xin Zhao
Dong Wu
Feng Xiao
Liang Luo
MoE
364
5
0
09 Apr 2025
Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency
ACM Transactions on Internet of Things (ACM TIOT), 2025
E. J. Husom
Arda Goknil
Merve Astekin
Lwin Khin Shar
Andre Kåsen
S. Sen
Benedikt Andreas Mithassel
Ahmet Soylu
MQ
450
34
0
04 Apr 2025
PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jiaqi Zhao
Miao Zhang
Ming Wang
Yuzhang Shang
Kaihao Zhang
Weili Guan
Yaowei Wang
Min Zhang
MQ
379
5
0
18 Feb 2025
Mixture Compressor for Mixture-of-Experts LLMs Gains More
International Conference on Learning Representations (ICLR), 2024
Wei Huang
Yue Liao
Jianhui Liu
Ruifei He
Haoru Tan
Shiming Zhang
Hongsheng Li
Si Liu
Xiaojuan Qi
MoE
345
29
0
08 Oct 2024
Toward the Evaluation of Large Language Models Considering Score Variance across Instruction Templates
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Yusuke Sakai
Adam Nohejl
Jiangnan Hang
Hidetaka Kamigaito
Taro Watanabe
ELM
312
9
0
22 Aug 2024
Dynamic Sentiment Analysis with Local Large Language Models using Majority Voting: A Study on Factors Affecting Restaurant Evaluation
Junichiro Niimi
315
9
0
18 Jul 2024
How Does Quantization Affect Multilingual LLMs?
Kelly Marchisio
Saurabh Dash
Hongyu Chen
Dennis Aumiller
Ahmet Üstün
Sara Hooker
Sebastian Ruder
MQ
347
34
0
03 Jul 2024
Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox
Yijun Liu
Yuan Meng
Fang Wu
Shenhao Peng
Hang Yao
Chaoyu Guan
Chen Tang
Cheng Wang
Zhi Wang
Wenwu Zhu
MQ
372
9
0
15 Jun 2024
Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression
Peiyu Liu
Zeming Gao
Wayne Xin Zhao
Yipeng Ma
Tao Wang
Ji-Rong Wen
MQ
324
10
0
21 May 2024
When Quantization Affects Confidence of Large Language Models?
Irina Proskurina
Luc Brun
Guillaume Metzler
Julien Velcin
MQ
321
4
0
01 May 2024
Exploring the Mystery of Influential Data for Mathematical Reasoning
Xinzhe Ni
Yeyun Gong
Zhibin Gou
Haoran Pan
Yujiu Yang
Nan Duan
Weizhu Chen
310
14
0
01 Apr 2024
Evaluating Quantized Large Language Models
Shiyao Li
Xuefei Ning
Luning Wang
Tengxuan Liu
Xiangsheng Shi
Shengen Yan
Guohao Dai
Huazhong Yang
Yu Wang
MQ
338
88
0
28 Feb 2024
A Comprehensive Evaluation of Quantization Strategies for Large Language Models
Renren Jin
Jiangcun Du
Wuwei Huang
Wei Liu
Jian Luan
Sijin Yu
Deyi Xiong
MQ
329
74
0
26 Feb 2024
Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance
Branislav Pecher
Ivan Srba
Maria Bielikova
ALM
411
21
0
20 Feb 2024
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
359
93
0
15 Feb 2024
Accurate LoRA-Finetuning Quantization of LLMs via Information Retention
Haotong Qin
Xudong Ma
Xingyu Zheng
Xiaoyang Li
Yang Zhang
Shouda Liu
Jie Luo
Xianglong Liu
Michele Magno
MQ
295
78
0
08 Feb 2024
One-Shot Learning as Instruction Data Prospector for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yunshui Li
Binyuan Hui
Xiaobo Xia
Jiaxi Yang
Min Yang
...
Ling-Hao Chen
Junhao Liu
Tongliang Liu
Fei Huang
Yongbin Li
434
50
0
16 Dec 2023
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
Symposium on Operating Systems Principles (SOSP), 2023
Yixin Song
Zeyu Mi
Haotong Xie
Haibo Chen
BDL
565
245
0
16 Dec 2023
Good Questions Help Zero-Shot Image Reasoning
Kaiwen Yang
Tao Shen
Xinmei Tian
Xiubo Geng
Chongyang Tao
Dacheng Tao
Wanrong Zhu
LRM
299
11
0
04 Dec 2023
PrivateLoRA For Efficient Privacy Preserving LLM
Yiming Wang
Yu Lin
Xiaodong Zeng
Guannan Zhang
373
26
0
23 Nov 2023
Chatmap : Large Language Model Interaction with Cartographic Data
Eren Unlu
KELM
336
6
0
28 Sep 2023
Sparks of Large Audio Models: A Survey and Outlook
S. Latif
Moazzam Shoukat
Fahad Shamshad
Muhammad Usama
Yi Ren
...
Wenwu Wang
Xulong Zhang
Roberto Togneri
Xiaoshi Zhong
Björn W. Schuller
LM&MA
AuLLM
821
56
0
24 Aug 2023
LMTuner: An user-friendly and highly-integrable Training Framework for fine-tuning Large Language Models
Yixuan Weng
Zhiqi Wang
Huanxuan Liao
Shizhu He
Shengping Liu
Kang Liu
Jun Zhao
295
4
0
20 Aug 2023
FootGPT : A Large Language Model Development Experiment on a Minimal Setting
Eren Unlu
ALM
266
1
0
16 Aug 2023
1
Page 1 of 1