ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.05877
  4. Cited By
Quantization and Training of Neural Networks for Efficient
  Integer-Arithmetic-Only Inference

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
    MQ
ArXivPDFHTML

Papers citing "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"

50 / 1,255 papers shown
Title
Patchwork: A Patch-wise Attention Network for Efficient Object Detection
  and Segmentation in Video Streams
Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams
Yuning Chai
VOS
13
30
0
03 Apr 2019
Training Quantized Neural Networks with a Full-precision Auxiliary
  Module
Training Quantized Neural Networks with a Full-precision Auxiliary Module
Bohan Zhuang
Lingqiao Liu
Mingkui Tan
Chunhua Shen
Ian Reid
MQ
32
62
0
27 Mar 2019
Looking Fast and Slow: Memory-Guided Mobile Video Object Detection
Looking Fast and Slow: Memory-Guided Mobile Video Object Detection
Mason Liu
Menglong Zhu
Marie White
Yinxiao Li
Dmitry Kalenichenko
17
83
0
25 Mar 2019
Towards Optimal Structured CNN Pruning via Generative Adversarial
  Learning
Towards Optimal Structured CNN Pruning via Generative Adversarial Learning
Shaohui Lin
R. Ji
Chenqian Yan
Baochang Zhang
Liujuan Cao
QiXiang Ye
Feiyue Huang
David Doermann
CVBM
11
503
0
22 Mar 2019
Trained Quantization Thresholds for Accurate and Efficient Fixed-Point
  Inference of Deep Neural Networks
Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks
Sambhav R. Jain
Albert Gural
Michael Wu
Chris Dick
MQ
13
147
0
19 Mar 2019
AttoNets: Compact and Efficient Deep Neural Networks for the Edge via
  Human-Machine Collaborative Design
AttoNets: Compact and Efficient Deep Neural Networks for the Edge via Human-Machine Collaborative Design
A. Wong
Z. Q. Lin
Brendan Chwyl
HAI
11
14
0
18 Mar 2019
Cascaded Projection: End-to-End Network Compression and Acceleration
Cascaded Projection: End-to-End Network Compression and Acceleration
Breton L. Minnehan
Andreas E. Savakis
14
26
0
12 Mar 2019
Dynamic Multi-path Neural Network
Dynamic Multi-path Neural Network
Yingcheng Su
Shunfeng Zhou
Yichao Wu
Tian Su
Ding Liang
Xuebo Liu
Dixin Zheng
Yingxu Wang
Junjie Yan
Xiaolin Hu
11
1
0
28 Feb 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
16
355
0
18 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website
  Fingerprinting Attacks with Adversarial Traces
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces
Mohammad Saidur Rahman
Mohsen Imani
Nate Mathews
M. Wright
AAML
6
80
0
18 Feb 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
16
97
0
15 Feb 2019
Understanding Chat Messages for Sticker Recommendation in Messaging Apps
Understanding Chat Messages for Sticker Recommendation in Messaging Apps
Abhishek Laddha
Mohamed Hanoosh
Debdoot Mukherjee
Parth Patwa
Ankur Narang
14
17
0
07 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error
  Through Weight Factorization
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
13
85
0
05 Feb 2019
Towards Federated Learning at Scale: System Design
Towards Federated Learning at Scale: System Design
Keith Bonawitz
Hubert Eichner
W. Grieskamp
Dzmitry Huba
A. Ingerman
...
H. B. McMahan
Timon Van Overveldt
David Petrou
Daniel Ramage
Jason Roselander
FedML
17
2,629
0
04 Feb 2019
Improving Neural Network Quantization without Retraining using Outlier
  Channel Splitting
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
33
304
0
28 Jan 2019
QGAN: Quantized Generative Adversarial Networks
QGAN: Quantized Generative Adversarial Networks
Peiqi Wang
Dongsheng Wang
Yu Ji
Xinfeng Xie
Haoxuan Song
XuXin Liu
Yongqiang Lyu
Yuan Xie
GAN
MQ
13
32
0
24 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been,
  Where We're Going
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang
James J. Davis
Ruizhe Zhao
Ho-Cheung Ng
Xinyu Niu
Wayne Luk
P. Cheung
G. Constantinides
19
59
0
21 Jan 2019
DSConv: Efficient Convolution Operator
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
24
62
0
07 Jan 2019
Dataflow-based Joint Quantization of Weights and Activations for Deep
  Neural Networks
Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks
Xue Geng
Jie Fu
Bin Zhao
Jie Lin
M. Aly
C. Pal
V. Chandrasekhar
MQ
16
5
0
04 Jan 2019
Dynamic Runtime Feature Map Pruning
Dynamic Runtime Feature Map Pruning
Tailin Liang
Lei Wang
Shaobo Shi
C. Glossner
3DPC
14
8
0
24 Dec 2018
Precision Highway for Ultra Low-Precision Quantization
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
10
12
0
24 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision
  Neural Networks
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
19
8
0
20 Dec 2018
Fast Adjustable Threshold For Uniform Neural Network Quantization
  (Winning solution of LPIRC-II)
Fast Adjustable Threshold For Uniform Neural Network Quantization (Winning solution of LPIRC-II)
A. Goncharenko
Andrey Denisov
S. Alyamkin
Evgeny Terentev
MQ
12
20
0
19 Dec 2018
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Yuchao Li
Shaohui Lin
Baochang Zhang
Jianzhuang Liu
David Doermann
Yongjian Wu
Feiyue Huang
R. Ji
40
130
0
11 Dec 2018
Efficient and Robust Machine Learning for Real-World Systems
Efficient and Robust Machine Learning for Real-World Systems
Franz Pernkopf
Wolfgang Roth
Matthias Zöhrer
Lukas Pfeifenberger
Günther Schindler
Holger Froening
Sebastian Tschiatschek
Robert Peharz
Matthew Mattina
Zoubin Ghahramani
OOD
24
1
0
05 Dec 2018
Efficient non-uniform quantizer for quantized neural network targeting
  reconfigurable hardware
Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss
Chaim Baskin
A. Mendelson
A. Bronstein
Raja Giryes
MQ
19
5
0
27 Nov 2018
On Periodic Functions as Regularizers for Quantization of Neural
  Networks
On Periodic Functions as Regularizers for Quantization of Neural Networks
Maxim Naumov
Utku Diril
Jongsoo Park
Benjamin Ray
Jedrzej Jablonski
Andrew Tulloch
MQ
6
25
0
24 Nov 2018
Structured Binary Neural Networks for Accurate Image Classification and
  Semantic Segmentation
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
27
152
0
22 Nov 2018
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Kuan-Chieh Jackson Wang
Zhijian Liu
Yujun Lin
Ji Lin
Song Han
MQ
35
872
0
21 Nov 2018
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural
  Networks
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks
Amir H. Ashouri
T. Abdelrahman
Alwyn Dos Remedios
MQ
11
12
0
10 Nov 2018
Dynamic Representations Toward Efficient Inference on Deep Neural
  Networks by Decision Gates
Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
Mohammad Saeed Shafiee
M. Shafiee
A. Wong
AI4CE
12
4
0
05 Nov 2018
Rethinking floating point for deep learning
Rethinking floating point for deep learning
Jeff Johnson
MQ
11
135
0
01 Nov 2018
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
K. Chahal
Manraj Singh Grover
Kuntal Dey
3DH
OOD
4
53
0
28 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
27
131
0
03 Oct 2018
2018 Low-Power Image Recognition Challenge
2018 Low-Power Image Recognition Challenge
S. Alyamkin
M. Ardi
Achille Brighton
Alexander C. Berg
Yiran Chen
...
George K. Thiruvathukal
Baiwu Zhang
Jingchi Zhang
Xiaopeng Zhang
Shaojie Zhuo
BDL
15
13
0
03 Oct 2018
Post-training 4-bit quantization of convolution networks for
  rapid-deployment
Post-training 4-bit quantization of convolution networks for rapid-deployment
Ron Banner
Yury Nahshan
Elad Hoffer
Daniel Soudry
MQ
16
93
0
02 Oct 2018
AI Benchmark: Running Deep Neural Networks on Android Smartphones
AI Benchmark: Running Deep Neural Networks on Android Smartphones
Andrey D. Ignatov
Radu Timofte
William Chou
Ke Wang
Max Wu
Tim Hartley
Luc Van Gool
ELM
11
320
0
02 Oct 2018
NICE: Noise Injection and Clamping Estimation for Neural Network
  Quantization
NICE: Noise Injection and Clamping Estimation for Neural Network Quantization
Chaim Baskin
Natan Liss
Yoav Chai
Evgenii Zheltonozhskii
Eli Schwartz
Raja Giryes
A. Mendelson
A. Bronstein
MQ
9
60
0
29 Sep 2018
FermiNets: Learning generative machines to generate efficient neural
  networks via generative synthesis
FermiNets: Learning generative machines to generate efficient neural networks via generative synthesis
A. Wong
M. Shafiee
Brendan Chwyl
Francis Li
8
64
0
17 Sep 2018
Hardware-Aware Machine Learning: Modeling and Optimization
Hardware-Aware Machine Learning: Modeling and Optimization
Diana Marculescu
Dimitrios Stamoulis
E. Cai
8
45
0
14 Sep 2018
Discretely Relaxing Continuous Variables for tractable Variational
  Inference
Discretely Relaxing Continuous Variables for tractable Variational Inference
Trefor W. Evans
P. Nair
BDL
47
0
0
12 Sep 2018
Discovering Low-Precision Networks Close to Full-Precision Networks for
  Efficient Embedded Inference
Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference
J. McKinstry
S. K. Esser
R. Appuswamy
Deepika Bablani
John V. Arthur
Izzet B. Yildiz
D. Modha
MQ
13
94
0
11 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided
  Fuzzing
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing
Xiaofei Xie
L. Ma
Felix Juefei Xu
Hongxu Chen
Minhui Xue
Bo-wen Li
Yang Liu
Jianjun Zhao
Jianxiong Yin
Simon See
32
40
0
04 Sep 2018
Training Compact Neural Networks with Binary Weights and Low Precision
  Activations
Training Compact Neural Networks with Binary Weights and Low Precision Activations
Bohan Zhuang
Chunhua Shen
Ian Reid
MQ
13
14
0
08 Aug 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
31
2,980
0
31 Jul 2018
Learning K-way D-dimensional Discrete Codes for Compact Embedding
  Representations
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations
Ting-Li Chen
Martin Renqiang Min
Yizhou Sun
21
70
0
21 Jun 2018
Quantizing deep convolutional networks for efficient inference: A
  whitepaper
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
16
990
0
21 Jun 2018
Quantizing Convolutional Neural Networks for Low-Power High-Throughput
  Inference Engines
Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
S. Settle
Manasa Bollavaram
P. DÁlberto
Elliott Delaye
Oscar Fernández
Nicholas J. Fraser
A. Ng
Ashish Sirasao
Michael Wu
MQ
16
20
0
21 May 2018
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification
  on Mobile Devices
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices
Sheng Chen
Yang Liu
Xiang Gao
Zhen Han
CVBM
3DH
16
557
0
20 Apr 2018
DPRed: Making Typical Activation and Weight Values Matter In Deep
  Learning Computing
DPRed: Making Typical Activation and Weight Values Matter In Deep Learning Computing
A. Delmas
Sayeh Sharify
Patrick Judd
Kevin Siu
Milos Nikolic
Andreas Moshovos
MQ
24
3
0
17 Apr 2018
Previous
123...242526
Next