ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.08886
  4. Cited By
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
v1v2v3 (latest)

HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Computer Vision and Pattern Recognition (CVPR), 2018
21 November 2018
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
    MQ
ArXiv (abs)PDFHTML

Papers citing "HAQ: Hardware-Aware Automated Quantization with Mixed Precision"

50 / 464 papers shown
TinyM$^2$Net: A Flexible System Algorithm Co-designed Multimodal
  Learning Framework for Tiny Devices
TinyM2^22Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices
Hasib-Al Rashid
Pretom Roy Ovi
Carl E. Busart
A. Gangopadhyay
T. Mohsenin
295
13
0
09 Feb 2022
Neural-PIM: Efficient Processing-In-Memory with Neural Approximation of
  Peripherals
Neural-PIM: Efficient Processing-In-Memory with Neural Approximation of PeripheralsIEEE transactions on computers (IEEE Trans. Comput.), 2022
Weidong Cao
Yilong Zhao
Adith Boloor
Yinhe Han
Xuan Zhang
Li Jiang
175
24
0
30 Jan 2022
UWC: Unit-wise Calibration Towards Rapid Network Compression
UWC: Unit-wise Calibration Towards Rapid Network CompressionBritish Machine Vision Conference (BMVC), 2022
Chen Lin
Zheyang Li
Bo Peng
Haoji Hu
Wenming Tan
Ye Ren
Shiliang Pu
MQ
87
1
0
17 Jan 2022
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural
  Networks
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural NetworksInternational Conference on Machine Learning (ICML), 2021
Runpei Dong
Zhanhong Tan
Mengdi Wu
Linfeng Zhang
Kaisheng Ma
MQ
437
14
0
30 Dec 2021
BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of
  DNNs from Scratch
BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of DNNs from ScratchDesign, Automation and Test in Europe (DATE), 2021
Souvik Kundu
Shikai Wang
Qirui Sun
Peter A. Beerel
Massoud Pedram
MQ
168
21
0
24 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
417
30
0
16 Dec 2021
N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based
  Heterogeneous Computing Cores
N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores
Yu Gong
Zhihang Xu
Zhezhi He
Weifeng Zhang
Xiaobing Tu
Xiaoyao Liang
Li Jiang
157
18
0
15 Dec 2021
Neural Network Quantization for Efficient Inference: A Survey
Neural Network Quantization for Efficient Inference: A Survey
Olivia Weng
MQ
200
38
0
08 Dec 2021
Implicit Neural Representations for Image Compression
Implicit Neural Representations for Image Compression
Yannick Strümpler
Janis Postels
Ren Yang
Luc van Gool
F. Tombari
304
195
0
08 Dec 2021
A Generalized Zero-Shot Quantization of Deep Convolutional Neural
  Networks via Learned Weights Statistics
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks via Learned Weights StatisticsIEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Prasen Kumar Sharma
Arun Abraham
V. N. Rajendiran
MQ
364
10
0
06 Dec 2021
Finding Deviated Behaviors of the Compressed DNN Models for Image
  Classifications
Finding Deviated Behaviors of the Compressed DNN Models for Image ClassificationsACM Transactions on Software Engineering and Methodology (TOSEM), 2021
Yongqiang Tian
Wuqi Zhang
Ming Wen
Shing-Chi Cheung
Chengnian Sun
Shiqing Ma
Yu Jiang
229
9
0
06 Dec 2021
Optimizing for In-memory Deep Learning with Emerging Memory Technology
Optimizing for In-memory Deep Learning with Emerging Memory Technology
Zhehui Wang
Yaoyu Zhang
Rick Siow Mong Goh
Wei Zhang
Weng-Fai Wong
195
1
0
01 Dec 2021
Adaptive Token Sampling For Efficient Vision Transformers
Adaptive Token Sampling For Efficient Vision Transformers
Mohsen Fayyaz
Soroush Abbasi Koohpayegani
F. Jafari
Sunando Sengupta
Hamid Reza Vaezi Joze
Eric Sommerlade
Hamed Pirsiavash
Juergen Gall
ViT
365
218
0
30 Nov 2021
Mixed Precision Low-bit Quantization of Neural Network Language Models
  for Speech Recognition
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Junhao Xu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
MQ
251
18
0
29 Nov 2021
Sharpness-aware Quantization for Deep Neural Networks
Sharpness-aware Quantization for Deep Neural Networks
Jing Liu
Jianfei Cai
Bohan Zhuang
MQ
480
27
0
24 Nov 2021
Semi-Online Knowledge Distillation
Semi-Online Knowledge Distillation
Zhiqiang Liu
Yanxia Liu
Chengkai Huang
110
7
0
23 Nov 2021
Mesa: A Memory-saving Training Framework for Transformers
Mesa: A Memory-saving Training Framework for Transformers
Zizheng Pan
Peng Chen
Haoyu He
Jing Liu
Jianfei Cai
Bohan Zhuang
231
25
0
22 Nov 2021
Toward Compact Parameter Representations for Architecture-Agnostic
  Neural Network Compression
Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression
Yuezhou Sun
Wenlong Zhao
Lijun Zhang
Xiao Liu
Hui Guan
Matei A. Zaharia
200
0
0
19 Nov 2021
LVAC: Learned Volumetric Attribute Compression for Point Clouds using
  Coordinate Based Networks
LVAC: Learned Volumetric Attribute Compression for Point Clouds using Coordinate Based Networks
Berivan Isik
P. Chou
S. Hwang
Nick Johnston
G. Toderici
3DPC
239
32
0
17 Nov 2021
MQBench: Towards Reproducible and Deployable Model Quantization
  Benchmark
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li
Mingzhu Shen
Jian Ma
Yan Ren
Mingxin Zhao
Tao Gui
Yazhe Niu
F. Yu
Junjie Yan
MQ
163
66
0
05 Nov 2021
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise
  Mixed Schemes and Multiple Precisions
RMSMP: A Novel Deep Neural Network Quantization Framework with Row-wise Mixed Schemes and Multiple PrecisionsIEEE International Conference on Computer Vision (ICCV), 2021
Sung-En Chang
Yanyu Li
Mengshu Sun
Weiwen Jiang
Sijia Liu
Yanzhi Wang
Xue Lin
MQ
158
13
0
30 Oct 2021
MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep LearningNeural Information Processing Systems (NeurIPS), 2021
Ji Lin
Wei-Ming Chen
Han Cai
Chuang Gan
Song Han
301
180
0
28 Oct 2021
Applications and Techniques for Fast Machine Learning in Science
Applications and Techniques for Fast Machine Learning in ScienceFrontiers in Big Data (Front. Big Data), 2021
A. Deiana
Nhan Tran
Joshua C. Agar
Michaela Blott
G. D. Guglielmo
...
Ashish Sharma
S. Summers
Pietro Vischia
J. Vlimant
Olivia Weng
214
81
0
25 Oct 2021
EBJR: Energy-Based Joint Reasoning for Adaptive Inference
EBJR: Energy-Based Joint Reasoning for Adaptive InferenceBritish Machine Vision Conference (BMVC), 2021
Mohammad Akbari
Amin Banitalebi-Dehkordi
Yong Zhang
BDLMQ
156
7
0
20 Oct 2021
BNAS v2: Learning Architectures for Binary Networks with Empirical
  Improvements
BNAS v2: Learning Architectures for Binary Networks with Empirical Improvements
Dahyun Kim
Kunal Pratap Singh
Jonghyun Choi
MQ
269
7
0
16 Oct 2021
Joint Channel and Weight Pruning for Model Acceleration on Moblie
  Devices
Joint Channel and Weight Pruning for Model Acceleration on Moblie Devices
Tianli Zhao
Xi Sheryl Zhang
Wentao Zhu
Jiaxing Wang
Sen Yang
Ji Liu
Jian Cheng
215
2
0
15 Oct 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained
  Optimization
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
214
80
0
13 Oct 2021
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting
  and Output Merging
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
228
24
0
30 Sep 2021
Understanding and Overcoming the Challenges of Efficient Transformer
  Quantization
Understanding and Overcoming the Challenges of Efficient Transformer QuantizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
225
172
0
27 Sep 2021
Distribution-sensitive Information Retention for Accurate Binary Neural
  Network
Distribution-sensitive Information Retention for Accurate Binary Neural NetworkInternational Journal of Computer Vision (IJCV), 2021
Haotong Qin
Xiangguo Zhang
Yazhe Niu
Yifu Ding
Yi Xu
Xianglong Liu
MQ
212
122
0
25 Sep 2021
Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning
Bayesian Optimization with Clustering and Rollback for CNN Auto PruningEuropean Conference on Computer Vision (ECCV), 2021
Hanwei Fan
Jiandong Mu
W. Zhang
222
5
0
22 Sep 2021
OMPQ: Orthogonal Mixed Precision Quantization
OMPQ: Orthogonal Mixed Precision QuantizationAAAI Conference on Artificial Intelligence (AAAI), 2021
Yuexiao Ma
Taisong Jin
Xiawu Zheng
Yan Wang
Huixia Li
Yongjian Wu
Guannan Jiang
Wei Zhang
Rongrong Ji
MQ
316
50
0
16 Sep 2021
Elastic Significant Bit Quantization and Acceleration for Deep Neural
  Networks
Elastic Significant Bit Quantization and Acceleration for Deep Neural NetworksIEEE Transactions on Parallel and Distributed Systems (TPDS), 2021
Cheng Gong
Ye Lu
Kunpeng Xie
Zongming Jin
Tao Li
Yanzhi Wang
MQ
241
7
0
08 Sep 2021
BioNetExplorer: Architecture-Space Exploration of Bio-Signal Processing
  Deep Neural Networks for Wearables
BioNetExplorer: Architecture-Space Exploration of Bio-Signal Processing Deep Neural Networks for WearablesIEEE Internet of Things Journal (IEEE IoT Journal), 2021
B. Prabakaran
Asima Akhtar
Semeen Rehman
Osman Hasan
Mohamed Bennai
101
11
0
07 Sep 2021
Cluster-Promoting Quantization with Bit-Drop for Minimizing Network
  Quantization Loss
Cluster-Promoting Quantization with Bit-Drop for Minimizing Network Quantization Loss
J. H. Lee
Jihun Yun
Sung Ju Hwang
Eunho Yang
MQ
290
17
0
05 Sep 2021
Architecture Aware Latency Constrained Sparse Neural Networks
Architecture Aware Latency Constrained Sparse Neural Networks
Tianli Zhao
Qinghao Hu
Xiangyu He
Weixiang Xu
Jiaxing Wang
Cong Leng
Jian Cheng
158
0
0
01 Sep 2021
Efficient Visual Recognition with Deep Neural Networks: A Survey on
  Recent Advances and New Directions
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New DirectionsMachine Intelligence Research (MIR), 2021
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
Weiming Dong
Jianbo Shi
375
18
0
30 Aug 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Auto-Split: A General Framework of Collaborative Edge-Cloud AIKnowledge Discovery and Data Mining (KDD), 2021
Amin Banitalebi-Dehkordi
Naveen Vedula
Jian Pei
Fei Xia
Lanjun Wang
Yong Zhang
192
115
0
30 Aug 2021
DKM: Differentiable K-Means Clustering Layer for Neural Network
  Compression
DKM: Differentiable K-Means Clustering Layer for Neural Network CompressionInternational Conference on Learning Representations (ICLR), 2021
Minsik Cho
Keivan Alizadeh Vahid
Saurabh N. Adya
Mohammad Rastegari
277
38
0
28 Aug 2021
Dynamic Network Quantization for Efficient Video Inference
Dynamic Network Quantization for Efficient Video InferenceIEEE International Conference on Computer Vision (ICCV), 2021
Ximeng Sun
Yikang Shen
Chun-Fu Chen
A. Oliva
Rogerio Feris
Kate Saenko
243
52
0
23 Aug 2021
On the Acceleration of Deep Neural Network Inference using Quantized
  Compressed Sensing
On the Acceleration of Deep Neural Network Inference using Quantized Compressed Sensing
Meshia Cédric Oveneke
MQ
83
0
0
23 Aug 2021
Online Multi-Granularity Distillation for GAN Compression
Online Multi-Granularity Distillation for GAN Compression
Yuxi Ren
Jie Wu
Xuefeng Xiao
Jianchao Yang
340
49
0
16 Aug 2021
Generalizable Mixed-Precision Quantization via Attribution Rank
  Preservation
Generalizable Mixed-Precision Quantization via Attribution Rank PreservationIEEE International Conference on Computer Vision (ICCV), 2021
Ziwei Wang
Han Xiao
Jiwen Lu
Jie Zhou
MQ
208
35
0
05 Aug 2021
MOHAQ: Multi-Objective Hardware-Aware Quantization of Recurrent Neural
  Networks
MOHAQ: Multi-Objective Hardware-Aware Quantization of Recurrent Neural Networks
Nesma M. Rezk
Tomas Nordstrom
D. Stathis
Z. Ul-Abdin
E. Aksoy
A. Hemani
MQ
259
3
0
02 Aug 2021
Pruning Ternary Quantization
Danyang Liu
Xiangshan Chen
Jie Fu
Chen Ma
Xue Liu
MQ
384
0
0
23 Jul 2021
LANA: Latency Aware Network Acceleration
LANA: Latency Aware Network Acceleration
Pavlo Molchanov
Jimmy Hall
Hongxu Yin
Jan Kautz
Nicolò Fusi
Arash Vahdat
324
11
0
12 Jul 2021
HEMP: High-order Entropy Minimization for neural network comPression
HEMP: High-order Entropy Minimization for neural network comPression
Enzo Tartaglione
Stéphane Lathuilière
Attilio Fiandrotti
Marco Cagnazzo
Marco Grangetto
MQ
157
7
0
12 Jul 2021
Post-Training Quantization for Vision Transformer
Post-Training Quantization for Vision TransformerNeural Information Processing Systems (NeurIPS), 2021
Zhenhua Liu
Yunhe Wang
Kai Han
Siwei Ma
Wen Gao
ViTMQ
343
433
0
27 Jun 2021
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU
  Tensor Cores
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Boyuan Feng
Yuke Wang
Tong Geng
Ang Li
Yufei Ding
MQ
187
44
0
23 Jun 2021
Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision
  Quantization
Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision QuantizationAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2021
Santiago Miret
Vui Seng Chua
Mattias Marder
Mariano Phielipp
Nilesh Jain
Somdeb Majumdar
137
10
0
14 Jun 2021
Previous
123...1056789
Next