Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.09184
Cited By
Block-wise Bit-Compression of Transformer-based Models
16 March 2023
Gaochen Dong
W. Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Block-wise Bit-Compression of Transformer-based Models"
3 / 3 papers shown
Title
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
88
341
0
05 Jan 2021
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
225
575
0
12 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,950
0
20 Apr 2018
1