Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.09504
Cited By
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
28 January 2019
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Neural Network Quantization without Retraining using Outlier Channel Splitting"
50 / 174 papers shown
Title
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
W. Dong
Jianbo Shi
27
18
0
30 Aug 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Amin Banitalebi-Dehkordi
Naveen Vedula
J. Pei
Fei Xia
Lanjun Wang
Yong Zhang
22
89
0
30 Aug 2021
Generalizable Mixed-Precision Quantization via Attribution Rank Preservation
Ziwei Wang
Han Xiao
Jiwen Lu
Jie Zhou
MQ
14
32
0
05 Aug 2021
MOHAQ: Multi-Objective Hardware-Aware Quantization of Recurrent Neural Networks
Nesma M. Rezk
Tomas Nordstrom
D. Stathis
Z. Ul-Abdin
E. Aksoy
A. Hemani
MQ
20
1
0
02 Aug 2021
Image Complexity Guided Network Compression for Biomedical Image Segmentation
Suraj Mishra
D. Z. Chen
X. Sharon
29
6
0
06 Jul 2021
Post-Training Quantization for Vision Transformer
Zhenhua Liu
Yunhe Wang
Kai Han
Siwei Ma
Wen Gao
ViT
MQ
39
324
0
27 Jun 2021
Distilling Image Classifiers in Object Detectors
Shuxuan Guo
J. Álvarez
Mathieu Salzmann
VLM
16
8
0
09 Jun 2021
Advances in Classifying the Stages of Diabetic Retinopathy Using Convolutional Neural Networks in Low Memory Edge Devices
A. Paul
13
4
0
03 Jun 2021
RED : Looking for Redundancies for Data-Free Structured Compression of Deep Neural Networks
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
CVBM
13
20
0
31 May 2021
Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization
Haichao Yu
Linjie Yang
Humphrey Shi
OOD
MQ
18
5
0
16 May 2021
Lightweight Compression of Intermediate Neural Network Features for Collaborative Intelligence
R. Cohen
Hyomin Choi
Ivan V. Bajić
14
23
0
15 May 2021
Lightweight compression of neural network feature tensors for collaborative intelligence
R. Cohen
Hyomin Choi
Ivan V. Bajić
8
42
0
12 May 2021
Pareto-Optimal Quantized ResNet Is Mostly 4-bit
AmirAli Abdolrashidi
Lisa Wang
Shivani Agrawal
J. Malmaud
Oleg Rybakov
Chas Leichner
Lukasz Lew
MQ
18
35
0
07 May 2021
Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Byeongwook Kim
Dongsoo Lee
Yeonju Ro
Yongkweon Jeon
S. Kwon
Baeseong Park
Daehwan Oh
MQ
10
1
0
05 May 2021
Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety
Sebastian Houben
Stephanie Abrecht
Maram Akila
Andreas Bär
Felix Brockherde
...
Serin Varghese
Michael Weber
Sebastian J. Wirkert
Tim Wirtz
Matthias Woehrle
AAML
13
58
0
29 Apr 2021
Quaternion Factorization Machines: A Lightweight Solution to Intricate Feature Interaction Modelling
Tong Chen
Hongzhi Yin
Xiangliang Zhang
Zi Huang
Yang Wang
Meng Wang
17
12
0
05 Apr 2021
Zero-shot Adversarial Quantization
Yuang Liu
Wei Zhang
Jun Wang
MQ
11
77
0
29 Mar 2021
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design
Cong Hao
Jordan Dotzel
Jinjun Xiong
Luca Benini
Zhiru Zhang
Deming Chen
50
34
0
25 Mar 2021
Evaluating Post-Training Compression in GANs using Locality-Sensitive Hashing
Gonçalo Mordido
Haojin Yang
Christoph Meinel
11
2
0
22 Mar 2021
Diversifying Sample Generation for Accurate Data-Free Quantization
Xiangguo Zhang
Haotong Qin
Yifu Ding
Ruihao Gong
Qing Yan
Renshuai Tao
Yuhang Li
F. Yu
Xianglong Liu
MQ
54
94
0
01 Mar 2021
Ps and Qs: Quantization-aware pruning for efficient low latency neural network inference
B. Hawks
Javier Mauricio Duarte
Nicholas J. Fraser
Alessandro Pappalardo
N. Tran
Yaman Umuroglu
MQ
6
51
0
22 Feb 2021
GradFreeBits: Gradient Free Bit Allocation for Dynamic Low Precision Neural Networks
Ben Bodner
G. B. Shalom
Eran Treister
MQ
19
2
0
18 Feb 2021
Confounding Tradeoffs for Neural Network Quantization
Sahaj Garg
Anirudh Jain
Joe Lou
Mitchell Nahmias
MQ
21
17
0
12 Feb 2021
VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference
Steve Dai
Rangharajan Venkatesan
Haoxing Ren
B. Zimmer
W. Dally
Brucek Khailany
MQ
25
67
0
08 Feb 2021
Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms
Rishabh Goyal
Joaquin Vanschoren
V. V. Acht
S. Nijssen
MQ
14
23
0
03 Feb 2021
Generative Zero-shot Network Quantization
Xiangyu He
Qinghao Hu
Peisong Wang
Jian Cheng
GAN
MQ
23
23
0
21 Jan 2021
Knowledge Distillation Methods for Efficient Unsupervised Adaptation Across Multiple Domains
Le Thanh Nguyen-Meidine
Atif Belal
M. Kiran
Jose Dolz
Louis-Antoine Blais-Morin
Eric Granger
25
25
0
18 Jan 2021
BinaryBERT: Pushing the Limit of BERT Quantization
Haoli Bai
Wei Zhang
Lu Hou
Lifeng Shang
Jing Jin
Xin Jiang
Qun Liu
Michael Lyu
Irwin King
MQ
140
221
0
31 Dec 2020
Hybrid and Non-Uniform quantization methods using retro synthesis data for efficient inference
Gvsl Tej Pratap
R. Kumar
MQ
16
1
0
26 Dec 2020
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
Yichi Zhang
Junhao Pan
Xinheng Liu
Hongzheng Chen
Deming Chen
Zhiru Zhang
MQ
41
87
0
22 Dec 2020
DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image Super-Resolution Networks
Chee Hong
Heewon Kim
Sungyong Baik
Junghun Oh
Kyoung Mu Lee
OOD
SupR
MQ
11
40
0
21 Dec 2020
Exploring Neural Networks Quantization via Layer-Wise Quantization Analysis
Shachar Gluska
Mark Grobman
MQ
12
5
0
15 Dec 2020
Logic Synthesis Meets Machine Learning: Trading Exactness for Generalization
Shubham Rai
Walter Lau Neto
Yukio Miyasaka
Xinpei Zhang
Mingfei Yu
...
Zhiru Zhang
V. Tenace
P. Gaillardon
A. Mishchenko
S. Chatterjee
NAI
10
27
0
04 Dec 2020
A Tiny CNN Architecture for Medical Face Mask Detection for Resource-Constrained Endpoints
P. Mohan
A. Paul
Abhay Chirania
CVBM
14
48
0
30 Nov 2020
Subtensor Quantization for Mobilenets
Thu Dinh
A. Melnikov
Vasilios Daskalopoulos
S. Chai
MQ
12
4
0
04 Nov 2020
A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Jianfei Chen
Yujie Gai
Z. Yao
Michael W. Mahoney
Joseph E. Gonzalez
MQ
12
58
0
27 Oct 2020
Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation
Insoo Chung
Byeongwook Kim
Yoonjung Choi
S. Kwon
Yongkweon Jeon
Baeseong Park
Sangha Kim
Dongsoo Lee
MQ
11
27
0
16 Sep 2020
Fast Implementation of 4-bit Convolutional Neural Networks for Mobile Devices
A. Trusov
E. Limonova
Dmitry Slugin
D. Nikolaev
V. Arlazarov
MQ
9
17
0
14 Sep 2020
One Weight Bitwidth to Rule Them All
Ting-Wu Chin
P. Chuang
Vikas Chandra
Diana Marculescu
MQ
17
25
0
22 Aug 2020
Weight Equalizing Shift Scaler-Coupled Post-training Quantization
Jihun Oh
Sangjeong Lee
Meejeong Park
Pooni Walagaurav
K. Kwon
MQ
15
1
0
13 Aug 2020
Controlling Information Capacity of Binary Neural Network
D. Ignatov
Andrey D. Ignatov
MQ
17
21
0
04 Aug 2020
EasyQuant: Post-training Quantization via Scale Optimization
Di Wu
Qingming Tang
Yongle Zhao
Ming Zhang
Ying Fu
Debing Zhang
MQ
17
75
0
30 Jun 2020
Automatic heterogeneous quantization of deep neural networks for low-latency inference on the edge for particle detectors
C. Coelho
Aki Kuusela
Shane Li
Zhuang Hao
T. Aarrestad
Vladimir Loncar
J. Ngadiuba
M. Pierini
Adrian Alan Pol
S. Summers
MQ
21
175
0
15 Jun 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
21
122
0
14 Jun 2020
Neural Network Activation Quantization with Bitwise Information Bottlenecks
Xichuan Zhou
Kui Liu
Cong Shi
Haijun Liu
Ji Liu
MQ
11
1
0
09 Jun 2020
Automated Design Space Exploration for optimised Deployment of DNN on Arm Cortex-A CPUs
Miguel de Prado
Andrew Mundy
Rabia Saeed
Maurizo Denna
Nuria Pazos
Luca Benini
14
11
0
09 Jun 2020
Position-based Scaled Gradient for Model Quantization and Pruning
Jangho Kim
Kiyoon Yoo
Nojun Kwak
MQ
11
7
0
22 May 2020
Joint Progressive Knowledge Distillation and Unsupervised Domain Adaptation
Le Thanh Nguyen-Meidine
Eric Granger
M. Kiran
Jose Dolz
Louis-Antoine Blais-Morin
6
23
0
16 May 2020
AutoScale: Optimizing Energy Efficiency of End-to-End Edge Inference under Stochastic Variance
Young Geun Kim
Carole-Jean Wu
9
3
0
06 May 2020
Up or Down? Adaptive Rounding for Post-Training Quantization
Markus Nagel
Rana Ali Amjad
M. V. Baalen
Christos Louizos
Tijmen Blankevoort
MQ
10
549
0
22 Apr 2020
Previous
1
2
3
4
Next