Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.00104
Cited By
v1
v2 (latest)
Post-Training Piecewise Linear Quantization for Deep Neural Networks
European Conference on Computer Vision (ECCV), 2020
31 January 2020
Jun Fang
Ali Shafiee
Hamzah Abdel-Aziz
D. Thorsley
Georgios Georgiadis
Joseph Hassoun
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Post-Training Piecewise Linear Quantization for Deep Neural Networks"
23 / 73 papers shown
Title
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
International Conference on Learning Representations (ICLR), 2022
Qing Jin
Jian Ren
Richard Zhuang
Sumant Hanumante
Zhengang Li
Zhiyu Chen
Yanzhi Wang
Kai-Min Yang
Sergey Tulyakov
MQ
287
54
0
10 Feb 2022
Energy awareness in low precision neural networks
Nurit Spingarn-Eliezer
Ron Banner
Elad Hoffer
Hilla Ben-Yaacov
T. Michaeli
241
0
0
06 Feb 2022
Post-training Quantization for Neural Networks with Provable Guarantees
SIAM Journal on Mathematics of Data Science (SIMODS), 2022
Jinjie Zhang
Yixuan Zhou
Rayan Saab
MQ
179
48
0
26 Jan 2022
PTQ4ViT: Post-training quantization for vision transformers with twin uniform quantization
Zhihang Yuan
Chenhao Xue
Yiqi Chen
Qiang Wu
Guangyu Sun
ViT
MQ
209
189
0
24 Nov 2021
IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization
Mingliang Xu
Mingbao Lin
Gongrui Nan
Jianzhuang Liu
Baochang Zhang
Yonghong Tian
Rongrong Ji
MQ
541
88
0
17 Nov 2021
Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2021
Jiangchao Yao
Shengyu Zhang
Yang Yao
Feng Wang
Jianxin Ma
...
Kun Kuang
Chao-Xiang Wu
Leilei Gan
Jingren Zhou
Hongxia Yang
338
135
0
11 Nov 2021
Arch-Net: Model Distillation for Architecture Agnostic Model Deployment
Weixin Xu
Zipeng Feng
Shuangkang Fang
Song Yuan
Yi Yang
Shuchang Zhou
MQ
289
1
0
01 Nov 2021
Applications and Techniques for Fast Machine Learning in Science
Frontiers in Big Data (Front. Big Data), 2021
A. Deiana
Nhan Tran
Joshua C. Agar
Michaela Blott
G. D. Guglielmo
...
Ashish Sharma
S. Summers
Pietro Vischia
J. Vlimant
Olivia Weng
192
79
0
25 Oct 2021
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai
Lu Hou
Lifeng Shang
Xin Jiang
Irwin King
Michael R. Lyu
MQ
193
49
0
30 Sep 2021
HPTQ: Hardware-Friendly Post Training Quantization
H. Habi
Reuven Peretz
Elad Cohen
Lior Dikstein
Oranit Dror
I. Diamant
Roy H. Jennings
Arnon Netzer
MQ
193
11
0
19 Sep 2021
Fine-grained Data Distribution Alignment for Post-Training Quantization
European Conference on Computer Vision (ECCV), 2021
Mingliang Xu
Mingbao Lin
Mengzhao Chen
Ke Li
Chunjiang Ge
Jiayi Ji
Yongjian Wu
Rongrong Ji
MQ
180
21
0
09 Sep 2021
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Yue Liu
Xinyang Jiang
Donglin Bai
Yuge Zhang
Ningxin Zheng
Xuanyi Dong
Lu Liu
Yuqing Yang
Dongsheng Li
121
11
0
30 Aug 2021
MOHAQ: Multi-Objective Hardware-Aware Quantization of Recurrent Neural Networks
Nesma M. Rezk
Tomas Nordstrom
D. Stathis
Z. Ul-Abdin
E. Aksoy
A. Hemani
MQ
215
3
0
02 Aug 2021
Post-Training Sparsity-Aware Quantization
Neural Information Processing Systems (NeurIPS), 2021
Gil Shomron
F. Gabbay
Samer Kurzum
U. Weiser
MQ
268
47
0
23 May 2021
RCT: Resource Constrained Training for Edge AI
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Tian Huang
Yaoyu Zhang
Ming Yan
Qiufeng Wang
Rick Siow Mong Goh
251
11
0
26 Mar 2021
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision
International Conference on Machine Learning (ICML), 2021
Johan Bjorck
Xiangyu Chen
Christopher De Sa
Daniel Schwalbe-Koda
Kilian Q. Weinberger
210
6
0
26 Feb 2021
VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Conference on Machine Learning and Systems (MLSys), 2021
Steve Dai
Rangharajan Venkatesan
Haoxing Ren
B. Zimmer
W. Dally
Brucek Khailany
MQ
191
90
0
08 Feb 2021
Rethinking Floating Point Overheads for Mixed Precision DNN Accelerators
Conference on Machine Learning and Systems (MLSys), 2021
Hamzah Abdel-Aziz
Ali Shafiee
J. Shin
A. Pedram
Joseph Hassoun
MQ
166
13
0
27 Jan 2021
FantastIC4: A Hardware-Software Co-Design Approach for Efficiently Running 4bit-Compact Multilayer Perceptrons
IEEE Open Journal of Circuits and Systems (JOCS), 2020
Simon Wiedemann
Suhas Shivapakash
P. Wiedemann
Daniel Becking
Wojciech Samek
F. Gerfers
Thomas Wiegand
MQ
259
8
0
17 Dec 2020
An Once-for-All Budgeted Pruning Framework for ConvNets Considering Input Resolution
Wenyu Sun
Jian Cao
Pengtao Xu
Xiangcheng Liu
Pu Li
103
0
0
02 Dec 2020
Neural gradients are near-lognormal: improved quantized and sparse training
Brian Chmiel
Liad Ben-Uri
Moran Shkolnik
Elad Hoffer
Ron Banner
Daniel Soudry
MQ
227
5
0
15 Jun 2020
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi Zadeh
Isak Edo
Omar Mohamed Awad
Andreas Moshovos
MQ
254
207
0
08 May 2020
Loss Aware Post-training Quantization
Machine-mediated learning (ML), 2019
Yury Nahshan
Brian Chmiel
Chaim Baskin
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
301
184
0
17 Nov 2019
Previous
1
2