Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.01076
Cited By
Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding
1 June 2023
Ziao Yang
Samridhi Choudhary
Siegfried Kunzmann
Zheng-Wei Zhang
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quantization-Aware and Tensor-Compressed Training of Transformers for Natural Language Understanding"
3 / 3 papers shown
Title
BinaryBERT: Pushing the Limit of BERT Quantization
Haoli Bai
Wei Zhang
Lu Hou
Lifeng Shang
Jing Jin
Xin Jiang
Qun Liu
Michael Lyu
Irwin King
MQ
142
221
0
31 Dec 2020
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
233
576
0
12 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
1