Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.08886
Cited By
v1
v2
v3 (latest)
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Computer Vision and Pattern Recognition (CVPR), 2018
21 November 2018
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HAQ: Hardware-Aware Automated Quantization with Mixed Precision"
50 / 464 papers shown
TinyM
2
^2
2
Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices
Hasib-Al Rashid
Pretom Roy Ovi
Carl E. Busart
A. Gangopadhyay
T. Mohsenin
295
13
0
09 Feb 2022
Neural-PIM: Efficient Processing-In-Memory with Neural Approximation of Peripherals
IEEE transactions on computers (IEEE Trans. Comput.), 2022
Weidong Cao
Yilong Zhao
Adith Boloor
Yinhe Han
Xuan Zhang
Li Jiang
175
24
0
30 Jan 2022
UWC: Unit-wise Calibration Towards Rapid Network Compression
British Machine Vision Conference (BMVC), 2022
Chen Lin
Zheyang Li
Bo Peng
Haoji Hu
Wenming Tan
Ye Ren
Shiliang Pu
MQ
87
1
0
17 Jan 2022
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
International Conference on Machine Learning (ICML), 2021
Runpei Dong
Zhanhong Tan
Mengdi Wu
Linfeng Zhang
Kaisheng Ma
MQ
437
14
0
30 Dec 2021
BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of DNNs from Scratch
Design, Automation and Test in Europe (DATE), 2021
Souvik Kundu
Shikai Wang
Qirui Sun
Peter A. Beerel
Massoud Pedram
MQ
168
21
0
24 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
417
30
0
16 Dec 2021
N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores
Yu Gong
Zhihang Xu
Zhezhi He
Weifeng Zhang
Xiaobing Tu
Xiaoyao Liang
Li Jiang
157
18
0
15 Dec 2021
Neural Network Quantization for Efficient Inference: A Survey
Olivia Weng
MQ
200
38
0
08 Dec 2021
Implicit Neural Representations for Image Compression
Yannick Strümpler
Janis Postels
Ren Yang
Luc van Gool
F. Tombari
304
195
0
08 Dec 2021
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks via Learned Weights Statistics
IEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Prasen Kumar Sharma
Arun Abraham
V. N. Rajendiran
MQ
364
10
0
06 Dec 2021
Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications
ACM Transactions on Software Engineering and Methodology (TOSEM), 2021
Yongqiang Tian
Wuqi Zhang
Ming Wen
Shing-Chi Cheung
Chengnian Sun
Shiqing Ma
Yu Jiang
229
9
0
06 Dec 2021
Optimizing for In-memory Deep Learning with Emerging Memory Technology
Zhehui Wang
Yaoyu Zhang
Rick Siow Mong Goh
Wei Zhang
Weng-Fai Wong
195
1
0
01 Dec 2021
Adaptive Token Sampling For Efficient Vision Transformers
Mohsen Fayyaz
Soroush Abbasi Koohpayegani
F. Jafari
Sunando Sengupta
Hamid Reza Vaezi Joze
Eric Sommerlade
Hamed Pirsiavash
Juergen Gall
ViT
365
218
0
30 Nov 2021
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Junhao Xu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
MQ
251
18
0
29 Nov 2021
Sharpness-aware Quantization for Deep Neural Networks
Jing Liu
Jianfei Cai
Bohan Zhuang
MQ
480
27
0
24 Nov 2021
Semi-Online Knowledge Distillation
Zhiqiang Liu
Yanxia Liu
Chengkai Huang
110
7
0
23 Nov 2021
Mesa: A Memory-saving Training Framework for Transformers
Zizheng Pan
Peng Chen
Haoyu He
Jing Liu
Jianfei Cai
Bohan Zhuang
231
25
0
22 Nov 2021
Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression
Yuezhou Sun
Wenlong Zhao
Lijun Zhang
Xiao Liu
Hui Guan
Matei A. Zaharia
200
0
0
19 Nov 2021
LVAC: Learned Volumetric Attribute Compression for Point Clouds using Coordinate Based Networks
Berivan Isik
P. Chou
S. Hwang
Nick Johnston
G. Toderici
3DPC
239
32
0
17 Nov 2021
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li
Mingzhu Shen
Jian Ma
Yan Ren
Mingxin Zhao
Tao Gui
Yazhe Niu
F. Yu
Junjie Yan
MQ
163
66
0
05 Nov 2021
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple Precisions
IEEE International Conference on Computer Vision (ICCV), 2021
Sung-En Chang
Yanyu Li
Mengshu Sun
Weiwen Jiang
Sijia Liu
Yanzhi Wang
Xue Lin
MQ
158
13
0
30 Oct 2021
MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
Neural Information Processing Systems (NeurIPS), 2021
Ji Lin
Wei-Ming Chen
Han Cai
Chuang Gan
Song Han
301
180
0
28 Oct 2021
Applications and Techniques for Fast Machine Learning in Science
Frontiers in Big Data (Front. Big Data), 2021
A. Deiana
Nhan Tran
Joshua C. Agar
Michaela Blott
G. D. Guglielmo
...
Ashish Sharma
S. Summers
Pietro Vischia
J. Vlimant
Olivia Weng
214
81
0
25 Oct 2021
EBJR: Energy-Based Joint Reasoning for Adaptive Inference
British Machine Vision Conference (BMVC), 2021
Mohammad Akbari
Amin Banitalebi-Dehkordi
Yong Zhang
BDL
MQ
156
7
0
20 Oct 2021
BNAS v2: Learning Architectures for Binary Networks with Empirical Improvements
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
269
7
0
16 Oct 2021
Joint Channel and Weight Pruning for Model Acceleration on Moblie Devices
Tianli Zhao
Xi Sheryl Zhang
Wentao Zhu
Jiaxing Wang
Sen Yang
Ji Liu
Jian Cheng
215
2
0
15 Oct 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
214
80
0
13 Oct 2021
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
228
24
0
30 Sep 2021
Understanding and Overcoming the Challenges of Efficient Transformer Quantization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
225
172
0
27 Sep 2021
Distribution-sensitive Information Retention for Accurate Binary Neural Network
International Journal of Computer Vision (IJCV), 2021
Haotong Qin
Xiangguo Zhang
Yazhe Niu
Yifu Ding
Yi Xu
Xianglong Liu
MQ
212
122
0
25 Sep 2021
Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning
European Conference on Computer Vision (ECCV), 2021
Hanwei Fan
Jiandong Mu
W. Zhang
222
5
0
22 Sep 2021
OMPQ: Orthogonal Mixed Precision Quantization
AAAI Conference on Artificial Intelligence (AAAI), 2021
Yuexiao Ma
Taisong Jin
Xiawu Zheng
Yan Wang
Huixia Li
Yongjian Wu
Guannan Jiang
Wei Zhang
Rongrong Ji
MQ
316
50
0
16 Sep 2021
Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021
Cheng Gong
Ye Lu
Kunpeng Xie
Zongming Jin
Tao Li
Yanzhi Wang
MQ
241
7
0
08 Sep 2021
BioNetExplorer: Architecture-Space Exploration of Bio-Signal Processing Deep Neural Networks for Wearables
IEEE Internet of Things Journal (IEEE IoT Journal), 2021
B. Prabakaran
Asima Akhtar
Semeen Rehman
Osman Hasan
Mohamed Bennai
101
11
0
07 Sep 2021
Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss
J. H. Lee
Jihun Yun
Sung Ju Hwang
Eunho Yang
MQ
290
17
0
05 Sep 2021
Architecture Aware Latency Constrained Sparse Neural Networks
Tianli Zhao
Qinghao Hu
Xiangyu He
Weixiang Xu
Jiaxing Wang
Cong Leng
Jian Cheng
158
0
0
01 Sep 2021
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions
Machine Intelligence Research (MIR), 2021
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
Weiming Dong
Jianbo Shi
375
18
0
30 Aug 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Knowledge Discovery and Data Mining (KDD), 2021
Amin Banitalebi-Dehkordi
Naveen Vedula
Jian Pei
Fei Xia
Lanjun Wang
Yong Zhang
192
115
0
30 Aug 2021
DKM: Differentiable K-Means Clustering Layer for Neural Network Compression
International Conference on Learning Representations (ICLR), 2021
Minsik Cho
Keivan Alizadeh Vahid
Saurabh N. Adya
Mohammad Rastegari
277
38
0
28 Aug 2021
Dynamic Network Quantization for Efficient Video Inference
IEEE International Conference on Computer Vision (ICCV), 2021
Ximeng Sun
Yikang Shen
Chun-Fu Chen
A. Oliva
Rogerio Feris
Kate Saenko
243
52
0
23 Aug 2021
On the Acceleration of Deep Neural Network Inference using Quantized Compressed Sensing
Meshia Cédric Oveneke
MQ
83
0
0
23 Aug 2021
Online Multi-Granularity Distillation for GAN Compression
Yuxi Ren
Jie Wu
Xuefeng Xiao
Jianchao Yang
340
49
0
16 Aug 2021
Generalizable Mixed-Precision Quantization via Attribution Rank Preservation
IEEE International Conference on Computer Vision (ICCV), 2021
Ziwei Wang
Han Xiao
Jiwen Lu
Jie Zhou
MQ
208
35
0
05 Aug 2021
MOHAQ: Multi-Objective Hardware-Aware Quantization of Recurrent Neural Networks
Nesma M. Rezk
Tomas Nordstrom
D. Stathis
Z. Ul-Abdin
E. Aksoy
A. Hemani
MQ
259
3
0
02 Aug 2021
Pruning Ternary Quantization
Danyang Liu
Xiangshan Chen
Jie Fu
Chen Ma
Xue Liu
MQ
384
0
0
23 Jul 2021
LANA: Latency Aware Network Acceleration
Pavlo Molchanov
Jimmy Hall
Hongxu Yin
Jan Kautz
Nicolò Fusi
Arash Vahdat
324
11
0
12 Jul 2021
HEMP: High-order Entropy Minimization for neural network comPression
Enzo Tartaglione
Stéphane Lathuilière
Attilio Fiandrotti
Marco Cagnazzo
Marco Grangetto
MQ
157
7
0
12 Jul 2021
Post-Training Quantization for Vision Transformer
Neural Information Processing Systems (NeurIPS), 2021
Zhenhua Liu
Yunhe Wang
Kai Han
Siwei Ma
Wen Gao
ViT
MQ
343
433
0
27 Jun 2021
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Boyuan Feng
Yuke Wang
Tong Geng
Ang Li
Yufei Ding
MQ
187
44
0
23 Jun 2021
Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision Quantization
Annual Conference on Genetic and Evolutionary Computation (GECCO), 2021
Santiago Miret
Vui Seng Chua
Mattias Marder
Mariano Phielipp
Nilesh Jain
Somdeb Majumdar
137
10
0
14 Jun 2021
Previous
1
2
3
...
10
5
6
7
8
9
Next