Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.17888
Cited By
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
29 May 2023
Zechun Liu
Barlas Oğuz
Changsheng Zhao
Ernie Chang
Pierre Stock
Yashar Mehdad
Yangyang Shi
Raghuraman Krishnamoorthi
Vikas Chandra
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLM-QAT: Data-Free Quantization Aware Training for Large Language Models"
1 / 151 papers shown
Title
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
214
505
0
12 Sep 2019
Previous
1
2
3
4