ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.08886
  4. Cited By
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
v1v2v3 (latest)

HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Computer Vision and Pattern Recognition (CVPR), 2018
21 November 2018
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
    MQ
ArXiv (abs)PDFHTML

Papers citing "HAQ: Hardware-Aware Automated Quantization with Mixed Precision"

50 / 462 papers shown
Title
DarkneTZ: Towards Model Privacy at the Edge using Trusted Execution
  Environments
DarkneTZ: Towards Model Privacy at the Edge using Trusted Execution EnvironmentsACM SIGMOBILE International Conference on Mobile Systems, Applications, and Services (MobiSys), 2020
Fan Mo
Ali Shahin Shamsabadi
Kleomenis Katevas
Soteris Demetriou
Ilias Leontiadis
Andrea Cavallaro
Hamed Haddadi
FedML
143
208
0
12 Apr 2020
FBNetV2: Differentiable Neural Architecture Search for Spatial and
  Channel Dimensions
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel DimensionsComputer Vision and Pattern Recognition (CVPR), 2020
Alvin Wan
Xiaoliang Dai
Peizhao Zhang
Zijian He
Yuandong Tian
...
Matthew Yu
Tao Xu
Kan Chen
Peter Vajda
Joseph E. Gonzalez
153
316
0
12 Apr 2020
GeneCAI: Genetic Evolution for Acquiring Compact AI
GeneCAI: Genetic Evolution for Acquiring Compact AIAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2020
Mojan Javaheripi
Mohammad Samragh
T. Javidi
F. Koushanfar
236
9
0
08 Apr 2020
CNN2Gate: Toward Designing a General Framework for Implementation of
  Convolutional Neural Networks on FPGA
CNN2Gate: Toward Designing a General Framework for Implementation of Convolutional Neural Networks on FPGA
Alireza Ghaffari
Yvon Savaria
140
10
0
06 Apr 2020
GAN Compression: Efficient Architectures for Interactive Conditional
  GANs
GAN Compression: Efficient Architectures for Interactive Conditional GANsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Zhekai Zhang
Ji Lin
Yaoyao Ding
Zhijian Liu
Jun-Yan Zhu
Song Han
GAN
196
3
0
19 Mar 2020
Group Sparsity: The Hinge Between Filter Pruning and Decomposition for
  Network Compression
Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network CompressionComputer Vision and Pattern Recognition (CVPR), 2020
Yawei Li
Shuhang Gu
Christoph Mayer
Luc Van Gool
Radu Timofte
288
205
0
19 Mar 2020
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Yuhang Li
Wei Wang
Haoli Bai
Yazhe Niu
Xin Dong
F. Yu
MQ
135
23
0
17 Mar 2020
Benchmarking TinyML Systems: Challenges and Direction
Benchmarking TinyML Systems: Challenges and Direction
Colby R. Banbury
Vijay Janapa Reddi
Max Lam
William Fu
A. Fazel
...
Jae-sun Seo
Jeff Sieracki
Urmish Thakker
Marian Verhelst
Poonam Yadav
281
270
0
10 Mar 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural
  Networks for Edge Devices
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge DevicesConference on Machine Learning and Systems (MLSys), 2020
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
173
57
0
04 Mar 2020
WaveQ: Gradient-Based Deep Quantization of Neural Networks through
  Sinusoidal Adaptive Regularization
WaveQ: Gradient-Based Deep Quantization of Neural Networks through Sinusoidal Adaptive Regularization
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
T. Elgindi
Charles-Alban Deledalle
H. Esmaeilzadeh
MQ
173
10
0
29 Feb 2020
Learning in the Frequency Domain
Learning in the Frequency DomainComputer Vision and Pattern Recognition (CVPR), 2020
Kai Xu
Minghai Qin
Fei Sun
Yuhao Wang
Yen-kuang Chen
Fengbo Ren
279
494
0
27 Feb 2020
RNNPool: Efficient Non-linear Pooling for RAM Constrained Inference
RNNPool: Efficient Non-linear Pooling for RAM Constrained InferenceNeural Information Processing Systems (NeurIPS), 2020
Oindrila Saha
Aditya Kusupati
H. Simhadri
Manik Varma
Prateek Jain
145
57
0
27 Feb 2020
Searching for Winograd-aware Quantized Networks
Searching for Winograd-aware Quantized NetworksConference on Machine Learning and Systems (MLSys), 2020
Javier Fernandez-Marques
P. Whatmough
Andrew Mundy
Matthew Mattina
MQ
116
40
0
25 Feb 2020
Exploring the Connection Between Binary and Spiking Neural Networks
Exploring the Connection Between Binary and Spiking Neural NetworksFrontiers in Neuroscience (Front. Neurosci.), 2020
Sen Lu
Abhronil Sengupta
MQ
179
109
0
24 Feb 2020
Post-training Quantization with Multiple Points: Mixed Precision without
  Mixed Precision
Post-training Quantization with Multiple Points: Mixed Precision without Mixed PrecisionAAAI Conference on Artificial Intelligence (AAAI), 2020
Xingchao Liu
Mao Ye
Dengyong Zhou
Qiang Liu
MQ
247
51
0
20 Feb 2020
Precision Gating: Improving Neural Network Efficiency with Dynamic
  Dual-Precision Activations
Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision ActivationsInternational Conference on Learning Representations (ICLR), 2020
Yichi Zhang
Ritchie Zhao
Weizhe Hua
N. Xu
G. E. Suh
Zhiru Zhang
MQ
299
28
0
17 Feb 2020
Learning Architectures for Binary Networks
Learning Architectures for Binary NetworksEuropean Conference on Computer Vision (ECCV), 2020
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
188
46
0
17 Feb 2020
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization
BitPruning: Learning Bitlengths for Aggressive and Accurate QuantizationInternational Symposium on Circuits and Systems (ISCAS), 2020
Milovs Nikolić
G. B. Hacene
Ciaran Bannon
Alberto Delmas Lascorz
Matthieu Courbariaux
Yoshua Bengio
Vincent Gripon
Andreas Moshovos
MQ
152
25
0
08 Feb 2020
Switchable Precision Neural Networks
Switchable Precision Neural Networks
Luis Guerra
Bohan Zhuang
Ian Reid
Tom Drummond
MQ
134
20
0
07 Feb 2020
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network
  Compilation
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network CompilationInternational Conference on Learning Representations (ICLR), 2020
Byung Hoon Ahn
Prannoy Pilligundla
Amir Yazdanbakhsh
H. Esmaeilzadeh
ODL
191
87
0
23 Jan 2020
Channel Pruning via Automatic Structure Search
Channel Pruning via Automatic Structure SearchInternational Joint Conference on Artificial Intelligence (IJCAI), 2020
Mingbao Lin
Rongrong Ji
Yuxin Zhang
Baochang Zhang
Yongjian Wu
Yonghong Tian
232
270
0
23 Jan 2020
Filter Sketch for Network Pruning
Filter Sketch for Network PruningIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Mingbao Lin
Liujuan Cao
Shaojie Li
QiXiang Ye
Yonghong Tian
Jianzhuang Liu
Q. Tian
Rongrong Ji
CLIP3DPC
248
95
0
23 Jan 2020
Functional Error Correction for Robust Neural Networks
Functional Error Correction for Robust Neural NetworksIEEE Journal on Selected Areas in Information Theory (JSAIT), 2020
Kunping Huang
P. Siegel
Anxiao
Anxiao Jiang
47
27
0
12 Jan 2020
Least squares binary quantization of neural networks
Least squares binary quantization of neural networks
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
196
36
0
09 Jan 2020
Resource-Efficient Neural Networks for Embedded Systems
Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth
Günther Schindler
Lukas Pfeifenberger
Robert Peharz
Sebastian Tschiatschek
Holger Fröning
Franz Pernkopf
Zoubin Ghahramani
199
63
0
07 Jan 2020
RPR: Random Partition Relaxation for Training; Binary and Ternary Weight
  Neural Networks
RPR: Random Partition Relaxation for Training; Binary and Ternary Weight Neural Networks
Lukas Cavigelli
Luca Benini
MQ
183
9
0
04 Jan 2020
Fractional Skipping: Towards Finer-Grained Dynamic CNN Inference
Fractional Skipping: Towards Finer-Grained Dynamic CNN InferenceAAAI Conference on Artificial Intelligence (AAAI), 2020
Jianghao Shen
Y. Fu
Yue Wang
Pengfei Xu
Zinan Lin
Yingyan Lin
MQ
114
48
0
03 Jan 2020
Mixed-Precision Quantized Neural Network with Progressively Decreasing
  Bitwidth For Image Classification and Object Detection
Mixed-Precision Quantized Neural Network with Progressively Decreasing Bitwidth For Image Classification and Object Detection
Tianshu Chu
Qin Luo
Jie Yang
Xiaolin Huang
MQ
110
8
0
29 Dec 2019
Towards Unified INT8 Training for Convolutional Neural Network
Towards Unified INT8 Training for Convolutional Neural NetworkComputer Vision and Pattern Recognition (CVPR), 2019
Feng Zhu
Yazhe Niu
F. Yu
Xianglong Liu
Yanfei Wang
Zhelong Li
Xiuqi Yang
Junjie Yan
MQ
207
172
0
29 Dec 2019
Towards Efficient Training for Neural Network Quantization
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
223
42
0
21 Dec 2019
AdaBits: Neural Network Quantization with Adaptive Bit-Widths
AdaBits: Neural Network Quantization with Adaptive Bit-WidthsComputer Vision and Pattern Recognition (CVPR), 2019
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
197
143
0
20 Dec 2019
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversionComputer Vision and Pattern Recognition (CVPR), 2019
Hongxu Yin
Pavlo Molchanov
Zhizhong Li
J. Álvarez
Arun Mallya
Derek Hoiem
N. Jha
Jan Kautz
383
636
0
18 Dec 2019
Dynamic Convolution: Attention over Convolution Kernels
Dynamic Convolution: Attention over Convolution KernelsComputer Vision and Pattern Recognition (CVPR), 2019
Yinpeng Chen
Xiyang Dai
Xiyang Dai
Dongdong Chen
Lu Yuan
Zicheng Liu
311
1,126
0
07 Dec 2019
Deep Model Compression Via Two-Stage Deep Reinforcement Learning
Deep Model Compression Via Two-Stage Deep Reinforcement Learning
Huixin Zhan
Wei-Ming Lin
Yongcan Cao
118
12
0
04 Dec 2019
Semi-Relaxed Quantization with DropBits: Training Low-Bit Neural Networks via Bit-wise Regularization
J. H. Lee
Jihun Yun
Sung Ju Hwang
Eunho Yang
MQ
157
0
0
29 Nov 2019
QKD: Quantization-aware Knowledge Distillation
QKD: Quantization-aware Knowledge Distillation
Jangho Kim
Brandon Smart
Jinwon Lee
Chirag I. Patel
Nojun Kwak
MQ
205
72
0
28 Nov 2019
Domain-Aware Dynamic Networks
Domain-Aware Dynamic Networks
Tianyuan Zhang
Bichen Wu
Xin Wang
Joseph E. Gonzalez
Kurt Keutzer
137
6
0
26 Nov 2019
Any-Precision Deep Neural Networks
Any-Precision Deep Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2019
Haichao Yu
Haoxiang Li
Humphrey Shi
Thomas S. Huang
G. Hua
MQ
205
73
0
17 Nov 2019
Ternary MobileNets via Per-Layer Hybrid Filter Banks
Ternary MobileNets via Per-Layer Hybrid Filter Banks
Dibakar Gope
Jesse G. Beu
Urmish Thakker
Matthew Mattina
MQ
133
15
0
04 Nov 2019
Comprehensive SNN Compression Using ADMM Optimization and Activity
  Regularization
Comprehensive SNN Compression Using ADMM Optimization and Activity RegularizationIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Lei Deng
Yujie Wu
Yifan Hu
Ling Liang
Guoqi Li
Xing Hu
Yufei Ding
Peng Li
Yuan Xie
200
99
0
03 Nov 2019
Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers
Xishan Zhang
Shaoli Liu
Rui Zhang
Yu Xie
Di Huang
...
Jiaming Guo
Yu Kang
Qi Guo
Zidong Du
Yunji Chen
MQ
122
8
0
01 Nov 2019
Training DNN IoT Applications for Deployment On Analog NVM Crossbars
Training DNN IoT Applications for Deployment On Analog NVM CrossbarsIEEE International Joint Conference on Neural Network (IJCNN), 2019
F. García-Redondo
Shidhartha Das
G. Rosendale
213
5
0
30 Oct 2019
Depth-wise Decomposition for Accelerating Separable Convolutions in
  Efficient Convolutional Neural Networks
Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural NetworksAdvances in Artificial Intelligence and Machine Learning (AAIML), 2019
Yihui He
Jianing Qian
Jianren Wang
Cindy X. Le
Congrui Hetang
Qi Lyu
Wenping Wang
Tianwei Yue
227
14
0
21 Oct 2019
Automatic Neural Network Compression by Sparsity-Quantization Joint
  Learning: A Constrained Optimization-based Approach
Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach
Haichuan Yang
Shupeng Gui
Yuhao Zhu
Ji Liu
MQ
130
5
0
14 Oct 2019
Forward and Backward Information Retention for Accurate Binary Neural
  Networks
Forward and Backward Information Retention for Accurate Binary Neural NetworksComputer Vision and Pattern Recognition (CVPR), 2019
Haotong Qin
Yazhe Niu
Xianglong Liu
Mingzhu Shen
Ziran Wei
F. Yu
Jingkuan Song
MQ
344
364
0
24 Sep 2019
Structured Binary Neural Networks for Image Recognition
Structured Binary Neural Networks for Image RecognitionInternational Journal of Computer Vision (IJCV), 2019
Bohan Zhuang
Chunhua Shen
Zhuliang Yu
Peng Chen
Lingqiao Liu
Ian Reid
MQ
277
20
0
22 Sep 2019
PULP-NN: Accelerating Quantized Neural Networks on Parallel
  Ultra-Low-Power RISC-V Processors
PULP-NN: Accelerating Quantized Neural Networks on Parallel Ultra-Low-Power RISC-V Processors
Angelo Garofalo
Manuele Rusci
Francesco Conti
D. Rossi
Luca Benini
MQ
167
143
0
29 Aug 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit
  Neural Networks
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural NetworksIEEE International Conference on Computer Vision (ICCV), 2019
Yazhe Niu
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
232
507
0
14 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
147
3
0
05 Aug 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
And the Bit Goes Down: Revisiting the Quantization of Neural NetworksInternational Conference on Learning Representations (ICLR), 2019
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Edouard Grave
MQ
357
154
0
12 Jul 2019
Previous
123...1089
Next