Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.05877
Cited By
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"
50 / 1,255 papers shown
Title
Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams
Yuning Chai
VOS
13
30
0
03 Apr 2019
Training Quantized Neural Networks with a Full-precision Auxiliary Module
Bohan Zhuang
Lingqiao Liu
Mingkui Tan
Chunhua Shen
Ian Reid
MQ
32
62
0
27 Mar 2019
Looking Fast and Slow: Memory-Guided Mobile Video Object Detection
Mason Liu
Menglong Zhu
Marie White
Yinxiao Li
Dmitry Kalenichenko
17
83
0
25 Mar 2019
Towards Optimal Structured CNN Pruning via Generative Adversarial Learning
Shaohui Lin
R. Ji
Chenqian Yan
Baochang Zhang
Liujuan Cao
QiXiang Ye
Feiyue Huang
David Doermann
CVBM
11
503
0
22 Mar 2019
Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks
Sambhav R. Jain
Albert Gural
Michael Wu
Chris Dick
MQ
13
147
0
19 Mar 2019
AttoNets: Compact and Efficient Deep Neural Networks for the Edge via Human-Machine Collaborative Design
A. Wong
Z. Q. Lin
Brendan Chwyl
HAI
11
14
0
18 Mar 2019
Cascaded Projection: End-to-End Network Compression and Acceleration
Breton L. Minnehan
Andreas E. Savakis
14
26
0
12 Mar 2019
Dynamic Multi-path Neural Network
Yingcheng Su
Shunfeng Zhou
Yichao Wu
Tian Su
Ding Liang
Xuebo Liu
Dixin Zheng
Yingxu Wang
Junjie Yan
Xiaolin Hu
11
1
0
28 Feb 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
16
355
0
18 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces
Mohammad Saidur Rahman
Mohsen Imani
Nate Mathews
M. Wright
AAML
6
80
0
18 Feb 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
16
97
0
15 Feb 2019
Understanding Chat Messages for Sticker Recommendation in Messaging Apps
Abhishek Laddha
Mohamed Hanoosh
Debdoot Mukherjee
Parth Patwa
Ankur Narang
14
17
0
07 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
13
85
0
05 Feb 2019
Towards Federated Learning at Scale: System Design
Keith Bonawitz
Hubert Eichner
W. Grieskamp
Dzmitry Huba
A. Ingerman
...
H. B. McMahan
Timon Van Overveldt
David Petrou
Daniel Ramage
Jason Roselander
FedML
17
2,629
0
04 Feb 2019
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
33
304
0
28 Jan 2019
QGAN: Quantized Generative Adversarial Networks
Peiqi Wang
Dongsheng Wang
Yu Ji
Xinfeng Xie
Haoxuan Song
XuXin Liu
Yongqiang Lyu
Yuan Xie
GAN
MQ
13
32
0
24 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang
James J. Davis
Ruizhe Zhao
Ho-Cheung Ng
Xinyu Niu
Wayne Luk
P. Cheung
G. Constantinides
19
59
0
21 Jan 2019
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
24
62
0
07 Jan 2019
Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks
Xue Geng
Jie Fu
Bin Zhao
Jie Lin
M. Aly
C. Pal
V. Chandrasekhar
MQ
16
5
0
04 Jan 2019
Dynamic Runtime Feature Map Pruning
Tailin Liang
Lei Wang
Shaobo Shi
C. Glossner
3DPC
14
8
0
24 Dec 2018
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
10
12
0
24 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
19
8
0
20 Dec 2018
Fast Adjustable Threshold For Uniform Neural Network Quantization (Winning solution of LPIRC-II)
A. Goncharenko
Andrey Denisov
S. Alyamkin
Evgeny Terentev
MQ
12
20
0
19 Dec 2018
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Yuchao Li
Shaohui Lin
Baochang Zhang
Jianzhuang Liu
David Doermann
Yongjian Wu
Feiyue Huang
R. Ji
40
130
0
11 Dec 2018
Efficient and Robust Machine Learning for Real-World Systems
Franz Pernkopf
Wolfgang Roth
Matthias Zöhrer
Lukas Pfeifenberger
Günther Schindler
Holger Froening
Sebastian Tschiatschek
Robert Peharz
Matthew Mattina
Zoubin Ghahramani
OOD
24
1
0
05 Dec 2018
Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss
Chaim Baskin
A. Mendelson
A. Bronstein
Raja Giryes
MQ
19
5
0
27 Nov 2018
On Periodic Functions as Regularizers for Quantization of Neural Networks
Maxim Naumov
Utku Diril
Jongsoo Park
Benjamin Ray
Jedrzej Jablonski
Andrew Tulloch
MQ
6
25
0
24 Nov 2018
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
27
152
0
22 Nov 2018
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Kuan-Chieh Jackson Wang
Zhijian Liu
Yujun Lin
Ji Lin
Song Han
MQ
35
872
0
21 Nov 2018
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks
Amir H. Ashouri
T. Abdelrahman
Alwyn Dos Remedios
MQ
11
12
0
10 Nov 2018
Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
Mohammad Saeed Shafiee
M. Shafiee
A. Wong
AI4CE
12
4
0
05 Nov 2018
Rethinking floating point for deep learning
Jeff Johnson
MQ
11
135
0
01 Nov 2018
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
K. Chahal
Manraj Singh Grover
Kuntal Dey
3DH
OOD
4
53
0
28 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
27
131
0
03 Oct 2018
2018 Low-Power Image Recognition Challenge
S. Alyamkin
M. Ardi
Achille Brighton
Alexander C. Berg
Yiran Chen
...
George K. Thiruvathukal
Baiwu Zhang
Jingchi Zhang
Xiaopeng Zhang
Shaojie Zhuo
BDL
15
13
0
03 Oct 2018
Post-training 4-bit quantization of convolution networks for rapid-deployment
Ron Banner
Yury Nahshan
Elad Hoffer
Daniel Soudry
MQ
16
93
0
02 Oct 2018
AI Benchmark: Running Deep Neural Networks on Android Smartphones
Andrey D. Ignatov
Radu Timofte
William Chou
Ke Wang
Max Wu
Tim Hartley
Luc Van Gool
ELM
11
320
0
02 Oct 2018
NICE: Noise Injection and Clamping Estimation for Neural Network Quantization
Chaim Baskin
Natan Liss
Yoav Chai
Evgenii Zheltonozhskii
Eli Schwartz
Raja Giryes
A. Mendelson
A. Bronstein
MQ
9
60
0
29 Sep 2018
FermiNets: Learning generative machines to generate efficient neural networks via generative synthesis
A. Wong
M. Shafiee
Brendan Chwyl
Francis Li
8
64
0
17 Sep 2018
Hardware-Aware Machine Learning: Modeling and Optimization
Diana Marculescu
Dimitrios Stamoulis
E. Cai
8
45
0
14 Sep 2018
Discretely Relaxing Continuous Variables for tractable Variational Inference
Trefor W. Evans
P. Nair
BDL
47
0
0
12 Sep 2018
Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference
J. McKinstry
S. K. Esser
R. Appuswamy
Deepika Bablani
John V. Arthur
Izzet B. Yildiz
D. Modha
MQ
13
94
0
11 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing
Xiaofei Xie
L. Ma
Felix Juefei Xu
Hongxu Chen
Minhui Xue
Bo-wen Li
Yang Liu
Jianjun Zhao
Jianxiong Yin
Simon See
32
40
0
04 Sep 2018
Training Compact Neural Networks with Binary Weights and Low Precision Activations
Bohan Zhuang
Chunhua Shen
Ian Reid
MQ
13
14
0
08 Aug 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
31
2,980
0
31 Jul 2018
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations
Ting-Li Chen
Martin Renqiang Min
Yizhou Sun
21
70
0
21 Jun 2018
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
16
990
0
21 Jun 2018
Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
S. Settle
Manasa Bollavaram
P. DÁlberto
Elliott Delaye
Oscar Fernández
Nicholas J. Fraser
A. Ng
Ashish Sirasao
Michael Wu
MQ
16
20
0
21 May 2018
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices
Sheng Chen
Yang Liu
Xiang Gao
Zhen Han
CVBM
3DH
16
557
0
20 Apr 2018
DPRed: Making Typical Activation and Weight Values Matter In Deep Learning Computing
A. Delmas
Sayeh Sharify
Patrick Judd
Kevin Siu
Milos Nikolic
Andreas Moshovos
MQ
24
3
0
17 Apr 2018
Previous
1
2
3
...
24
25
26
Next