Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.08886
Cited By
v1
v2
v3 (latest)
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Computer Vision and Pattern Recognition (CVPR), 2018
21 November 2018
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HAQ: Hardware-Aware Automated Quantization with Mixed Precision"
50 / 462 papers shown
Title
DarkneTZ: Towards Model Privacy at the Edge using Trusted Execution Environments
ACM SIGMOBILE International Conference on Mobile Systems, Applications, and Services (MobiSys), 2020
Fan Mo
Ali Shahin Shamsabadi
Kleomenis Katevas
Soteris Demetriou
Ilias Leontiadis
Andrea Cavallaro
Hamed Haddadi
FedML
143
208
0
12 Apr 2020
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
Computer Vision and Pattern Recognition (CVPR), 2020
Alvin Wan
Xiaoliang Dai
Peizhao Zhang
Zijian He
Yuandong Tian
...
Matthew Yu
Tao Xu
Kan Chen
Peter Vajda
Joseph E. Gonzalez
153
316
0
12 Apr 2020
GeneCAI: Genetic Evolution for Acquiring Compact AI
Annual Conference on Genetic and Evolutionary Computation (GECCO), 2020
Mojan Javaheripi
Mohammad Samragh
T. Javidi
F. Koushanfar
236
9
0
08 Apr 2020
CNN2Gate: Toward Designing a General Framework for Implementation of Convolutional Neural Networks on FPGA
Alireza Ghaffari
Yvon Savaria
140
10
0
06 Apr 2020
GAN Compression: Efficient Architectures for Interactive Conditional GANs
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Zhekai Zhang
Ji Lin
Yaoyao Ding
Zhijian Liu
Jun-Yan Zhu
Song Han
GAN
196
3
0
19 Mar 2020
Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network Compression
Computer Vision and Pattern Recognition (CVPR), 2020
Yawei Li
Shuhang Gu
Christoph Mayer
Luc Van Gool
Radu Timofte
288
205
0
19 Mar 2020
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Yuhang Li
Wei Wang
Haoli Bai
Yazhe Niu
Xin Dong
F. Yu
MQ
135
23
0
17 Mar 2020
Benchmarking TinyML Systems: Challenges and Direction
Colby R. Banbury
Vijay Janapa Reddi
Max Lam
William Fu
A. Fazel
...
Jae-sun Seo
Jeff Sieracki
Urmish Thakker
Marian Verhelst
Poonam Yadav
281
270
0
10 Mar 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices
Conference on Machine Learning and Systems (MLSys), 2020
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
173
57
0
04 Mar 2020
WaveQ: Gradient-Based Deep Quantization of Neural Networks through Sinusoidal Adaptive Regularization
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
T. Elgindi
Charles-Alban Deledalle
H. Esmaeilzadeh
MQ
173
10
0
29 Feb 2020
Learning in the Frequency Domain
Computer Vision and Pattern Recognition (CVPR), 2020
Kai Xu
Minghai Qin
Fei Sun
Yuhao Wang
Yen-kuang Chen
Fengbo Ren
279
494
0
27 Feb 2020
RNNPool: Efficient Non-linear Pooling for RAM Constrained Inference
Neural Information Processing Systems (NeurIPS), 2020
Oindrila Saha
Aditya Kusupati
H. Simhadri
Manik Varma
Prateek Jain
145
57
0
27 Feb 2020
Searching for Winograd-aware Quantized Networks
Conference on Machine Learning and Systems (MLSys), 2020
Javier Fernandez-Marques
P. Whatmough
Andrew Mundy
Matthew Mattina
MQ
116
40
0
25 Feb 2020
Exploring the Connection Between Binary and Spiking Neural Networks
Frontiers in Neuroscience (Front. Neurosci.), 2020
Sen Lu
Abhronil Sengupta
MQ
179
109
0
24 Feb 2020
Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision
AAAI Conference on Artificial Intelligence (AAAI), 2020
Xingchao Liu
Mao Ye
Dengyong Zhou
Qiang Liu
MQ
247
51
0
20 Feb 2020
Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations
International Conference on Learning Representations (ICLR), 2020
Yichi Zhang
Ritchie Zhao
Weizhe Hua
N. Xu
G. E. Suh
Zhiru Zhang
MQ
299
28
0
17 Feb 2020
Learning Architectures for Binary Networks
European Conference on Computer Vision (ECCV), 2020
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
188
46
0
17 Feb 2020
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization
International Symposium on Circuits and Systems (ISCAS), 2020
Milovs Nikolić
G. B. Hacene
Ciaran Bannon
Alberto Delmas Lascorz
Matthieu Courbariaux
Yoshua Bengio
Vincent Gripon
Andreas Moshovos
MQ
152
25
0
08 Feb 2020
Switchable Precision Neural Networks
Luis Guerra
Bohan Zhuang
Ian Reid
Tom Drummond
MQ
134
20
0
07 Feb 2020
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
International Conference on Learning Representations (ICLR), 2020
Byung Hoon Ahn
Prannoy Pilligundla
Amir Yazdanbakhsh
H. Esmaeilzadeh
ODL
191
87
0
23 Jan 2020
Channel Pruning via Automatic Structure Search
International Joint Conference on Artificial Intelligence (IJCAI), 2020
Mingbao Lin
Rongrong Ji
Yuxin Zhang
Baochang Zhang
Yongjian Wu
Yonghong Tian
232
270
0
23 Jan 2020
Filter Sketch for Network Pruning
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Mingbao Lin
Liujuan Cao
Shaojie Li
QiXiang Ye
Yonghong Tian
Jianzhuang Liu
Q. Tian
Rongrong Ji
CLIP
3DPC
248
95
0
23 Jan 2020
Functional Error Correction for Robust Neural Networks
IEEE Journal on Selected Areas in Information Theory (JSAIT), 2020
Kunping Huang
P. Siegel
Anxiao
Anxiao Jiang
47
27
0
12 Jan 2020
Least squares binary quantization of neural networks
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
196
36
0
09 Jan 2020
Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth
Günther Schindler
Lukas Pfeifenberger
Robert Peharz
Sebastian Tschiatschek
Holger Fröning
Franz Pernkopf
Zoubin Ghahramani
199
63
0
07 Jan 2020
RPR: Random Partition Relaxation for Training; Binary and Ternary Weight Neural Networks
Lukas Cavigelli
Luca Benini
MQ
183
9
0
04 Jan 2020
Fractional Skipping: Towards Finer-Grained Dynamic CNN Inference
AAAI Conference on Artificial Intelligence (AAAI), 2020
Jianghao Shen
Y. Fu
Yue Wang
Pengfei Xu
Zinan Lin
Yingyan Lin
MQ
114
48
0
03 Jan 2020
Mixed-Precision Quantized Neural Network with Progressively Decreasing Bitwidth For Image Classification and Object Detection
Tianshu Chu
Qin Luo
Jie Yang
Xiaolin Huang
MQ
110
8
0
29 Dec 2019
Towards Unified INT8 Training for Convolutional Neural Network
Computer Vision and Pattern Recognition (CVPR), 2019
Feng Zhu
Yazhe Niu
F. Yu
Xianglong Liu
Yanfei Wang
Zhelong Li
Xiuqi Yang
Junjie Yan
MQ
207
172
0
29 Dec 2019
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
223
42
0
21 Dec 2019
AdaBits: Neural Network Quantization with Adaptive Bit-Widths
Computer Vision and Pattern Recognition (CVPR), 2019
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
197
143
0
20 Dec 2019
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
Computer Vision and Pattern Recognition (CVPR), 2019
Hongxu Yin
Pavlo Molchanov
Zhizhong Li
J. Álvarez
Arun Mallya
Derek Hoiem
N. Jha
Jan Kautz
383
636
0
18 Dec 2019
Dynamic Convolution: Attention over Convolution Kernels
Computer Vision and Pattern Recognition (CVPR), 2019
Yinpeng Chen
Xiyang Dai
Xiyang Dai
Dongdong Chen
Lu Yuan
Zicheng Liu
311
1,126
0
07 Dec 2019
Deep Model Compression Via Two-Stage Deep Reinforcement Learning
Huixin Zhan
Wei-Ming Lin
Yongcan Cao
118
12
0
04 Dec 2019
Semi-Relaxed Quantization with DropBits: Training Low-Bit Neural Networks via Bit-wise Regularization
J. H. Lee
Jihun Yun
Sung Ju Hwang
Eunho Yang
MQ
157
0
0
29 Nov 2019
QKD: Quantization-aware Knowledge Distillation
Jangho Kim
Brandon Smart
Jinwon Lee
Chirag I. Patel
Nojun Kwak
MQ
205
72
0
28 Nov 2019
Domain-Aware Dynamic Networks
Tianyuan Zhang
Bichen Wu
Xin Wang
Joseph E. Gonzalez
Kurt Keutzer
137
6
0
26 Nov 2019
Any-Precision Deep Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2019
Haichao Yu
Haoxiang Li
Humphrey Shi
Thomas S. Huang
G. Hua
MQ
205
73
0
17 Nov 2019
Ternary MobileNets via Per-Layer Hybrid Filter Banks
Dibakar Gope
Jesse G. Beu
Urmish Thakker
Matthew Mattina
MQ
133
15
0
04 Nov 2019
Comprehensive SNN Compression Using ADMM Optimization and Activity Regularization
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Lei Deng
Yujie Wu
Yifan Hu
Ling Liang
Guoqi Li
Xing Hu
Yufei Ding
Peng Li
Yuan Xie
200
99
0
03 Nov 2019
Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers
Xishan Zhang
Shaoli Liu
Rui Zhang
Yu Xie
Di Huang
...
Jiaming Guo
Yu Kang
Qi Guo
Zidong Du
Yunji Chen
MQ
122
8
0
01 Nov 2019
Training DNN IoT Applications for Deployment On Analog NVM Crossbars
IEEE International Joint Conference on Neural Network (IJCNN), 2019
F. García-Redondo
Shidhartha Das
G. Rosendale
213
5
0
30 Oct 2019
Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks
Advances in Artificial Intelligence and Machine Learning (AAIML), 2019
Yihui He
Jianing Qian
Jianren Wang
Cindy X. Le
Congrui Hetang
Qi Lyu
Wenping Wang
Tianwei Yue
227
14
0
21 Oct 2019
Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach
Haichuan Yang
Shupeng Gui
Yuhao Zhu
Ji Liu
MQ
130
5
0
14 Oct 2019
Forward and Backward Information Retention for Accurate Binary Neural Networks
Computer Vision and Pattern Recognition (CVPR), 2019
Haotong Qin
Yazhe Niu
Xianglong Liu
Mingzhu Shen
Ziran Wei
F. Yu
Jingkuan Song
MQ
344
364
0
24 Sep 2019
Structured Binary Neural Networks for Image Recognition
International Journal of Computer Vision (IJCV), 2019
Bohan Zhuang
Chunhua Shen
Zhuliang Yu
Peng Chen
Lingqiao Liu
Ian Reid
MQ
277
20
0
22 Sep 2019
PULP-NN: Accelerating Quantized Neural Networks on Parallel Ultra-Low-Power RISC-V Processors
Angelo Garofalo
Manuele Rusci
Francesco Conti
D. Rossi
Luca Benini
MQ
167
143
0
29 Aug 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
IEEE International Conference on Computer Vision (ICCV), 2019
Yazhe Niu
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
232
507
0
14 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
147
3
0
05 Aug 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
International Conference on Learning Representations (ICLR), 2019
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Edouard Grave
MQ
357
154
0
12 Jul 2019
Previous
1
2
3
...
10
8
9
Next