ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.06393
  4. Cited By
Fixed Point Quantization of Deep Convolutional Networks

Fixed Point Quantization of Deep Convolutional Networks

19 November 2015
D. Lin
S. Talathi
V. Annapureddy
    MQ
ArXivPDFHTML

Papers citing "Fixed Point Quantization of Deep Convolutional Networks"

49 / 99 papers shown
Title
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and
  Cheaper Reasoning
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and Cheaper Reasoning
Yushan Zhu
Wen Zhang
Mingyang Chen
Hui Chen
Xu-Xin Cheng
Wei Zhang
Huajun Chen Zhejiang University
8
27
0
13 Sep 2020
Term Revealing: Furthering Quantization at Run Time on Quantized DNNs
Term Revealing: Furthering Quantization at Run Time on Quantized DNNs
H. T. Kung
Bradley McDanel
S. Zhang
MQ
13
9
0
13 Jul 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and
  Integer Programming
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
18
122
0
14 Jun 2020
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming
Mehrdad Khani Shirkoohi
Pouya Hamadanian
Arash Nasr-Esfahany
Mohammad Alizadeh
21
45
0
11 Jun 2020
An Overview of Neural Network Compression
An Overview of Neural Network Compression
James OÑeill
AI4CE
42
98
0
05 Jun 2020
Quantized Neural Networks: Characterization and Holistic Optimization
Quantized Neural Networks: Characterization and Holistic Optimization
Yoonho Boo
Sungho Shin
Wonyong Sung
MQ
37
8
0
31 May 2020
Taurus: A Data Plane Architecture for Per-Packet ML
Taurus: A Data Plane Architecture for Per-Packet ML
Tushar Swamy
Alexander Rucker
M. Shahbaz
Ishan Gaur
K. Olukotun
13
82
0
12 Feb 2020
Compact recurrent neural networks for acoustic event detection on
  low-energy low-complexity platforms
Compact recurrent neural networks for acoustic event detection on low-energy low-complexity platforms
G. Cerutti
Rahul Prasad
A. Brutti
Elisabetta Farella
13
47
0
29 Jan 2020
Convolutional-Recurrent Neural Networks on Low-Power Wearable Platforms
  for Cardiac Arrhythmia Detection
Convolutional-Recurrent Neural Networks on Low-Power Wearable Platforms for Cardiac Arrhythmia Detection
Antonino Faraone
R. Delgado-Gonzalo
6
24
0
08 Jan 2020
Sparse Weight Activation Training
Sparse Weight Activation Training
Md Aamir Raihan
Tor M. Aamodt
32
72
0
07 Jan 2020
Adaptive Loss-aware Quantization for Multi-bit Networks
Adaptive Loss-aware Quantization for Multi-bit Networks
Zhongnan Qu
Zimu Zhou
Yun Cheng
Lothar Thiele
MQ
25
53
0
18 Dec 2019
QKD: Quantization-aware Knowledge Distillation
QKD: Quantization-aware Knowledge Distillation
Jangho Kim
Yash Bhalgat
Jinwon Lee
Chirag I. Patel
Nojun Kwak
MQ
16
63
0
28 Nov 2019
Loss Aware Post-training Quantization
Loss Aware Post-training Quantization
Yury Nahshan
Brian Chmiel
Chaim Baskin
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
17
163
0
17 Nov 2019
S2DNAS:Transforming Static CNN Model for Dynamic Inference via Neural
  Architecture Search
S2DNAS:Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search
Zhihang Yuan
Bingzhe Wu
Zheng Liang
Shiwan Zhao
Weichen Bi
Guangyu Sun
19
30
0
16 Nov 2019
XNOR-Net++: Improved Binary Neural Networks
XNOR-Net++: Improved Binary Neural Networks
Adrian Bulat
Georgios Tzimiropoulos
MQ
13
200
0
30 Sep 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object
  Detection on FPGAs
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
Caiwen Ding
Shuo Wang
Ning Liu
Kaidi Xu
Yanzhi Wang
Yun Liang
MQ
11
89
0
29 Sep 2019
Point-Voxel CNN for Efficient 3D Deep Learning
Point-Voxel CNN for Efficient 3D Deep Learning
Zhijian Liu
Haotian Tang
Yujun Lin
Song Han
3DPC
15
659
0
08 Jul 2019
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network
  Inference On Microcontrollers
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network Inference On Microcontrollers
Manuele Rusci
Alessandro Capotondi
Luca Benini
MQ
9
74
0
30 May 2019
Encrypted Speech Recognition using Deep Polynomial Networks
Encrypted Speech Recognition using Deep Polynomial Networks
Shi-Xiong Zhang
Y. Gong
Dong Yu
11
25
0
11 May 2019
Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive
  ADMM
Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM
Sheng Lin
Xiaolong Ma
Shaokai Ye
Geng Yuan
Kaisheng Ma
Yanzhi Wang
MQ
17
10
0
02 May 2019
Knowledge Distillation For Recurrent Neural Network Language Modeling
  With Trust Regularization
Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization
Yangyang Shi
M. Hwang
X. Lei
Haoyu Sheng
26
25
0
08 Apr 2019
Progressive Stochastic Binarization of Deep Networks
Progressive Stochastic Binarization of Deep Networks
David Hartmann
Michael Wand
MQ
12
1
0
03 Apr 2019
Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning
  and Quantization Rates using ADMM
Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM
Shaokai Ye
Xiaoyu Feng
Tianyun Zhang
Xiaolong Ma
Sheng Lin
...
Jian Tang
M. Fardad
X. Lin
Yongpan Liu
Yanzhi Wang
MQ
24
38
0
23 Mar 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
14
355
0
18 Feb 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
11
97
0
15 Feb 2019
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using
  Alternating Direction Method of Multipliers
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Ao Ren
Tianyun Zhang
Shaokai Ye
Jiayu Li
Wenyao Xu
Xuehai Qian
X. Lin
Yanzhi Wang
MQ
24
162
0
31 Dec 2018
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in
  FPGAs
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Zhe Li
Caiwen Ding
Siyue Wang
Wujie Wen
Youwei Zhuo
...
Qinru Qiu
Wenyao Xu
X. Lin
Xuehai Qian
Yanzhi Wang
MQ
7
64
0
12 Dec 2018
QUENN: QUantization Engine for low-power Neural Networks
QUENN: QUantization Engine for low-power Neural Networks
Miguel de Prado
Maurizio Denna
Luca Benini
Nuria Pazos
MQ
24
14
0
14 Nov 2018
Convolutional Neural Network Quantization using Generalized Gamma
  Distribution
Convolutional Neural Network Quantization using Generalized Gamma Distribution
Doyun Kim
H. Yim
Sanghyuck Ha
Changgwun Lee
Inyup Kang
MQ
19
4
0
31 Oct 2018
Progressive Weight Pruning of Deep Neural Networks using ADMM
Progressive Weight Pruning of Deep Neural Networks using ADMM
Shaokai Ye
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Kaidi Xu
...
M. Fardad
Sijia Liu
Xiang Chen
X. Lin
Yanzhi Wang
AI4CE
23
38
0
17 Oct 2018
Quantization for Rapid Deployment of Deep Neural Networks
Quantization for Rapid Deployment of Deep Neural Networks
J. Lee
Sangwon Ha
Saerom Choi
Won-Jo Lee
Seungwon Lee
MQ
14
48
0
12 Oct 2018
Hierarchical binary CNNs for landmark localization with limited
  resources
Hierarchical binary CNNs for landmark localization with limited resources
Adrian Bulat
Georgios Tzimiropoulos
CVBM
3DV
15
36
0
14 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
27
230
0
13 Aug 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks
  per Bit?
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
14
135
0
20 Jun 2018
Adding New Tasks to a Single Network with Weight Transformations using
  Binary Masks
Adding New Tasks to a Single Network with Weight Transformations using Binary Masks
Massimiliano Mancini
Elisa Ricci
Barbara Caputo
Samuel Rota Buló
9
51
0
28 May 2018
Accelerating CNN inference on FPGAs: A Survey
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
16
147
0
26 May 2018
Low-Precision Floating-Point Schemes for Neural Network Training
Low-Precision Floating-Point Schemes for Neural Network Training
Marc Ortiz
A. Cristal
Eduard Ayguadé
Marc Casas
MQ
12
22
0
14 Apr 2018
Loss-aware Weight Quantization of Deep Networks
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
15
127
0
23 Feb 2018
Deep Learning as a Mixed Convex-Combinatorial Optimization Problem
Deep Learning as a Mixed Convex-Combinatorial Optimization Problem
A. Friesen
Pedro M. Domingos
18
20
0
31 Oct 2017
Network Sketching: Exploiting Binary Structure in Deep CNNs
Network Sketching: Exploiting Binary Structure in Deep CNNs
Yiwen Guo
Anbang Yao
Hao Zhao
Yurong Chen
MQ
23
95
0
07 Jun 2017
Bayesian Compression for Deep Learning
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
15
479
0
24 May 2017
Binarized Convolutional Landmark Localizers for Human Pose Estimation
  and Face Alignment with Limited Resources
Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources
Adrian Bulat
Georgios Tzimiropoulos
CVBM
3DV
24
190
0
02 Mar 2017
Fixed-point optimization of deep neural networks with adaptive step size
  retraining
Fixed-point optimization of deep neural networks with adaptive step size retraining
Sungho Shin
Yoonho Boo
Wonyong Sung
MQ
24
34
0
27 Feb 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
19
502
0
03 Feb 2017
Towards the Limit of Network Quantization
Towards the Limit of Network Quantization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
16
191
0
05 Dec 2016
Scalable Compression of Deep Neural Networks
Scalable Compression of Deep Neural Networks
Xing Wang
Jie Liang
16
4
0
26 Aug 2016
Overcoming Challenges in Fixed Point Training of Deep Convolutional
  Networks
Overcoming Challenges in Fixed Point Training of Deep Convolutional Networks
D. Lin
S. Talathi
23
45
0
08 Jul 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural
  Networks
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
Philipp Gysel
24
127
0
20 May 2016
Computational Cost Reduction in Learned Transform Classifications
Computational Cost Reduction in Learned Transform Classifications
E. Machado
C. Miosso
R. V. Borries
Murilo Coutinho
P. Berger
Thiago Marques
R. Jacobi
30
3
0
26 Apr 2015
Previous
12