Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1811.08886
Cited By
v1
v2
v3 (latest)
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Computer Vision and Pattern Recognition (CVPR), 2018
21 November 2018
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HAQ: Hardware-Aware Automated Quantization with Mixed Precision"
50 / 464 papers shown
Title
PDP: Parameter-free Differentiable Pruning is All You Need
Neural Information Processing Systems (NeurIPS), 2023
Minsik Cho
Saurabh N. Adya
Devang Naik
VLM
191
15
0
18 May 2023
Patch-wise Mixed-Precision Quantization of Vision Transformer
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Junrui Xiao
Zhikai Li
Lianwei Yang
Qingyi Gu
MQ
161
21
0
11 May 2023
LayerNAS: Neural Architecture Search in Polynomial Complexity
Yicheng Fan
Dana Alon
Jingyue Shen
Daiyi Peng
Keshav Kumar
Yun Long
Xin Wang
Fotis Iliopoulos
Da-Cheng Juan
Erik Vee
122
5
0
23 Apr 2023
QuMoS: A Framework for Preserving Security of Quantum Machine Learning Model
International Conference on Quantum Computing and Engineering (QCE), 2023
Zhepeng Wang
Jinyang Li
Zhirui Hu
Blake Gage
Elizabeth Iwasawa
Weiwen Jiang
273
16
0
23 Apr 2023
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Alexander Warnecke
Julian Speith
Janka Möller
Konrad Rieck
C. Paar
AAML
482
4
0
17 Apr 2023
Canvas: End-to-End Kernel Architecture Search in Neural Networks
Chenggang Zhao
Genghan Zhang
Mingyu Gao
177
2
0
16 Apr 2023
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs
Javier Campos
Zhen Dong
Javier Mauricio Duarte
A. Gholami
Michael W. Mahoney
Jovan Mitrevski
Nhan Tran
MQ
131
4
0
13 Apr 2023
Learning Accurate Performance Predictors for Ultrafast Automated Model Compression
International Journal of Computer Vision (IJCV), 2023
Ziwei Wang
Jiwen Lu
Han Xiao
Shengyu Liu
Jie Zhou
OffRL
154
1
0
13 Apr 2023
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Journal of Computational Science and Technology (JCST), 2023
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
147
0
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
282
51
0
07 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Computer Vision and Pattern Recognition (CVPR), 2023
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
327
72
0
30 Mar 2023
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective
Computer Vision and Pattern Recognition (CVPR), 2023
Yuexiao Ma
Huixia Li
Xiawu Zheng
Xuefeng Xiao
Rui Wang
Shilei Wen
Xin Pan
Jiayi Ji
Rongrong Ji
MQ
228
15
0
21 Mar 2023
Gated Compression Layers for Efficient Always-On Models
Haiguang Li
T. Thormundsson
I. Poupyrev
N. Gillian
166
3
0
15 Mar 2023
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
IEEE International Conference on Computer Vision (ICCV), 2023
Li Zhang
Xudong Wang
Jiahang Xu
Quanlu Zhang
Yujing Wang
Yuqing Yang
Ningxin Zheng
Ting Cao
Mao Yang
MQ
107
3
0
15 Mar 2023
R2 Loss: Range Restriction Loss for Model Compression and Quantization
Arnav Kundu
Chungkuk Yoo
Srijan Mishra
Minsik Cho
Saurabh N. Adya
MQ
137
2
0
14 Mar 2023
MetaMixer: A Regularization Strategy for Online Knowledge Distillation
Maorong Wang
L. Xiao
T. Yamasaki
KELM
MoE
115
1
0
14 Mar 2023
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments
ACM/IEEE International Conference on Mobile Computing and Networking (MobiCom), 2023
Hao Wen
Yuanchun Li
Zunshuai Zhang
Shiqi Jiang
Xiaozhou Ye
Ouyang Ye
Yaqin Zhang
Yunxin Liu
231
54
0
13 Mar 2023
Bag of Tricks with Quantized Convolutional Neural Networks for image classification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jie Hu
Mengze Zeng
Enhua Wu
MQ
113
2
0
13 Mar 2023
TinyAD: Memory-efficient anomaly detection for time series data in Industrial IoT
IEEE Transactions on Industrial Informatics (IEEE TII), 2023
Yuting Sun
Tong Chen
Quoc Viet Hung Nguyen
Hongzhi Yin
181
18
0
07 Mar 2023
Rotation Invariant Quantization for Model Compression
Dor-Joseph Kampeas
Yury Nahshan
Hanoch Kremer
Gil Lederman
Shira Zaloshinski
Zheng Li
E. Haleva
MQ
244
1
0
03 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yang He
Lingao Xiao
3DPC
348
256
0
01 Mar 2023
DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2023
Jiajun Zhou
Jiajun Wu
Yizhao Gao
Yuhao Ding
Chaofan Tao
Yue Liu
Fengbin Tu
Kwang-Ting Cheng
Hayden Kwok-Hay So
Ngai Wong
MQ
194
9
0
24 Feb 2023
Towards Optimal Compression: Joint Pruning and Quantization
Ben Zandonati
Glenn Bucagu
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
313
5
0
15 Feb 2023
SEAM: Searching Transferable Mixed-Precision Quantization Policy through Large Margin Regularization
ACM Multimedia (ACM MM), 2023
Chen Tang
Kai Ouyang
Runnan Li
Yunpeng Bai
Yuan Meng
Zhi Wang
Wenwu Zhu
MQ
161
14
0
14 Feb 2023
A Practical Mixed Precision Algorithm for Post-Training Quantization
N. Pandey
Markus Nagel
M. V. Baalen
Yin-Ruey Huang
Chirag I. Patel
Tijmen Blankevoort
MQ
178
28
0
10 Feb 2023
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yingchun Wang
Jingcai Guo
Song Guo
Weizhan Zhang
MQ
185
23
0
09 Feb 2023
DynaMIX: Resource Optimization for DNN-Based Real-Time Applications on a Multi-Tasking System
Minkyoung Cho
Kang G. Shin
82
2
0
03 Feb 2023
Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search
Clemens J. S. Schaefer
Elfie Guo
Caitlin Stanton
Xiaofan Zhang
T. Jablin
Navid Lambert-Shirzad
Jian Li
Chia-Wei Chou
Siddharth Joshi
Yu Wang
MQ
216
4
0
02 Feb 2023
A
2
Q
\rm A^2Q
A
2
Q
: Aggregation-Aware Quantization for Graph Neural Networks
International Conference on Learning Representations (ICLR), 2023
Zeyu Zhu
Fanrong Li
Zitao Mo
Qinghao Hu
Gang Li
Zejian Liu
Xiaoyao Liang
Jian Cheng
GNN
MQ
188
6
0
01 Feb 2023
Efficient and Effective Methods for Mixed Precision Neural Network Quantization for Faster, Energy-efficient Inference
Deepika Bablani
J. McKinstry
S. K. Esser
R. Appuswamy
D. Modha
MQ
283
7
0
30 Jan 2023
Does Federated Learning Really Need Backpropagation?
European Conference on Computer Vision (ECCV), 2023
Hao Feng
Tianyu Pang
Chao Du
Wei Chen
Shuicheng Yan
Min Lin
FedML
241
12
0
28 Jan 2023
Tailor: Altering Skip Connections for Resource-Efficient Inference
ACM Transactions on Reconfigurable Technology and Systems (TRETS), 2023
Olivia Weng
Gabriel Marcano
Vladimir Loncar
Alireza Khodamoradi
Nojan Sheybani
Andres Meza
F. Koushanfar
K. Denolf
Javier Mauricio Duarte
Ryan Kastner
210
17
0
18 Jan 2023
Hyperspherical Quantization: Toward Smaller and More Accurate Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Dan Liu
X. Chen
Chen Ma
Xue Liu
MQ
161
4
0
24 Dec 2022
Hyperspherical Loss-Aware Ternary Quantization
Dan Liu
Xue Liu
MQ
149
0
0
24 Dec 2022
Automatic Network Adaptation for Ultra-Low Uniform-Precision Quantization
Seongmin Park
Beomseok Kwon
Jieun Lim
Kyuyoung Sim
Taeho Kim
Jungwook Choi
MQ
230
1
0
21 Dec 2022
CSMPQ:Class Separability Based Mixed-Precision Quantization
International Conference on Intelligent Computing (ICIC), 2022
Ming-Yu Wang
Taisong Jin
Miaohui Zhang
Zhengtao Yu
MQ
115
1
0
20 Dec 2022
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers
IEEE International Conference on Computer Vision (ICCV), 2022
Zhikai Li
Junrui Xiao
Lianwei Yang
Qingyi Gu
MQ
292
129
0
16 Dec 2022
NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution
IEEE Transactions on Mobile Computing (IEEE TMC), 2022
Stylianos I. Venieris
Mario Almeida
Royson Lee
Nicholas D. Lane
SupR
241
7
0
15 Dec 2022
Towards Hardware-Specific Automatic Compression of Neural Networks
Torben Krieger
Bernhard Klein
Holger Fröning
MQ
139
4
0
15 Dec 2022
PD-Quant: Post-Training Quantization based on Prediction Difference Metric
Computer Vision and Pattern Recognition (CVPR), 2022
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
481
94
0
14 Dec 2022
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
197
6
0
10 Dec 2022
CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification
Design Automation Conference (DAC), 2022
Lirui Xiao
Huanrui Yang
Zhen Dong
Kurt Keutzer
Li Du
Shanghang Zhang
MQ
132
10
0
06 Dec 2022
Make RepVGG Greater Again: A Quantization-aware Approach
AAAI Conference on Artificial Intelligence (AAAI), 2022
Xiangxiang Chu
Liang Li
Bo Zhang
MQ
294
63
0
03 Dec 2022
Boosted Dynamic Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2022
Haichao Yu
Haoxiang Li
G. Hua
Gao Huang
Humphrey Shi
171
15
0
30 Nov 2022
Class-based Quantization for Neural Networks
Design, Automation and Test in Europe (DATE), 2022
Wenhao Sun
Grace Li Zhang
Huaxi Gu
Bing Li
Ulf Schlichtmann
MQ
147
10
0
27 Nov 2022
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention
IEEE International Conference on Computer Vision (ICCV), 2022
Wenyuan Zeng
Meng Li
Wenjie Xiong
Tong Tong
Wen-jie Lu
Jin Tan
Runsheng Wang
Ru Huang
309
33
0
25 Nov 2022
NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization
Computer Vision and Pattern Recognition (CVPR), 2022
Shitao Tang
Sicong Tang
Andrea Tagliasacchi
Ping Tan
Yasutaka Furukawa
3DPC
150
26
0
21 Nov 2022
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
International Conference on Machine Learning (ICML), 2022
Guangxuan Xiao
Ji Lin
Mickael Seznec
Hao Wu
Julien Demouth
Song Han
MQ
749
1,170
0
18 Nov 2022
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Zhekai Zhang
Ji Lin
Chenlin Meng
Stefano Ermon
Song Han
Jun-Yan Zhu
DiffM
476
61
0
03 Nov 2022
QuaLA-MiniLM: a Quantized Length Adaptive MiniLM
Shira Guskin
Moshe Wasserblat
Chang Wang
Haihao Shen
MQ
274
2
0
31 Oct 2022
Previous
1
2
3
4
5
...
8
9
10
Next