ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.01064
  4. Cited By
Trained Ternary Quantization

Trained Ternary Quantization

4 December 2016
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
    MQ
ArXivPDFHTML

Papers citing "Trained Ternary Quantization"

50 / 509 papers shown
Title
DeepTwist: Learning Model Compression via Occasional Weight Distortion
DeepTwist: Learning Model Compression via Occasional Weight Distortion
Dongsoo Lee
Parichay Kapoor
Byeongwook Kim
22
19
0
30 Oct 2018
Discrimination-aware Channel Pruning for Deep Neural Networks
Discrimination-aware Channel Pruning for Deep Neural Networks
Zhuangwei Zhuang
Mingkui Tan
Bohan Zhuang
Jing Liu
Yong Guo
Qingyao Wu
Junzhou Huang
Jin-Hui Zhu
9
593
0
28 Oct 2018
Progressive Weight Pruning of Deep Neural Networks using ADMM
Progressive Weight Pruning of Deep Neural Networks using ADMM
Shaokai Ye
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Kaidi Xu
...
M. Fardad
Sijia Liu
Xiang Chen
X. Lin
Yanzhi Wang
AI4CE
26
38
0
17 Oct 2018
Training Deep Neural Network in Limited Precision
Training Deep Neural Network in Limited Precision
Hyunsun Park
J. Lee
Youngmin Oh
Sangwon Ha
Seungwon Lee
16
8
0
12 Oct 2018
Towards Fast and Energy-Efficient Binarized Neural Network Inference on
  FPGA
Towards Fast and Energy-Efficient Binarized Neural Network Inference on FPGA
Cheng Fu
Shilin Zhu
Hao Su
Ching-En Lee
Jishen Zhao
MQ
15
31
0
04 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
25
131
0
03 Oct 2018
LIT: Block-wise Intermediate Representation Training for Model
  Compression
LIT: Block-wise Intermediate Representation Training for Model Compression
Animesh Koratana
Daniel Kang
Peter Bailis
Matei A. Zaharia
6
12
0
02 Oct 2018
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network
  using Truncated Gaussian Approximation
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network using Truncated Gaussian Approximation
Zhezhi He
Deliang Fan
MQ
13
66
0
02 Oct 2018
ProxQuant: Quantized Neural Networks via Proximal Operators
ProxQuant: Quantized Neural Networks via Proximal Operators
Yu Bai
Yu-Xiang Wang
Edo Liberty
MQ
11
117
0
01 Oct 2018
NICE: Noise Injection and Clamping Estimation for Neural Network
  Quantization
NICE: Noise Injection and Clamping Estimation for Neural Network Quantization
Chaim Baskin
Natan Liss
Yoav Chai
Evgenii Zheltonozhskii
Eli Schwartz
Raja Giryes
A. Mendelson
A. Bronstein
MQ
9
60
0
29 Sep 2018
Learning Recurrent Binary/Ternary Weights
Learning Recurrent Binary/Ternary Weights
A. Ardakani
Zhengyun Ji
S. C. Smithson
B. Meyer
W. Gross
MQ
4
27
0
28 Sep 2018
Scalar Arithmetic Multiple Data: Customizable Precision for Deep Neural
  Networks
Scalar Arithmetic Multiple Data: Customizable Precision for Deep Neural Networks
Andrew Anderson
David Gregg
MQ
11
1
0
27 Sep 2018
Characterising Across-Stack Optimisations for Deep Convolutional Neural
  Networks
Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks
Jack Turner
José Cano
Valentin Radu
Elliot J. Crowley
Michael F. P. O'Boyle
Amos Storkey
16
40
0
19 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided
  Fuzzing
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing
Xiaofei Xie
L. Ma
Felix Juefei Xu
Hongxu Chen
Minhui Xue
Bo-wen Li
Yang Liu
Jianjun Zhao
Jianxiong Yin
Simon See
29
40
0
04 Sep 2018
Learning Sparse Low-Precision Neural Networks With Learnable
  Regularization
Learning Sparse Low-Precision Neural Networks With Learnable Regularization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
14
31
0
01 Sep 2018
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Yang He
Xuanyi Dong
Guoliang Kang
Yanwei Fu
C. Yan
Yi Yang
35
134
0
22 Aug 2018
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Yang He
Guoliang Kang
Xuanyi Dong
Yanwei Fu
Yi Yang
AAML
VLM
11
954
0
21 Aug 2018
Learning to Quantize Deep Networks by Optimizing Quantization Intervals
  with Task Loss
Learning to Quantize Deep Networks by Optimizing Quantization Intervals with Task Loss
S. Jung
Changyong Son
Seohyung Lee
JinWoo Son
Youngjun Kwak
Jae-Joon Han
Sung Ju Hwang
Changkyu Choi
MQ
14
372
0
17 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
27
230
0
13 Aug 2018
Training Compact Neural Networks with Binary Weights and Low Precision
  Activations
Training Compact Neural Networks with Binary Weights and Low Precision Activations
Bohan Zhuang
Chunhua Shen
Ian Reid
MQ
13
14
0
08 Aug 2018
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural
  Network in Embedded FPGA
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural Network in Embedded FPGA
Junsong Wang
Qiuwen Lou
Xiaofan Zhang
Chao Zhu
Yonghua Lin
Deming Chen
MQ
23
93
0
31 Jul 2018
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep
  Neural Networks
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
G. Hua
MQ
9
696
0
26 Jul 2018
Optimize Deep Convolutional Neural Network with Ternarized Weights and
  High Accuracy
Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
Zhezhi He
Boqing Gong
Deliang Fan
11
22
0
20 Jul 2018
Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)
Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)
Jungwook Choi
P. Chuang
Zhuo Wang
Swagath Venkataramani
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
11
75
0
17 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable
  Precision LSTM Networks on FPGAs
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
11
72
0
11 Jul 2018
Auto Deep Compression by Reinforcement Learning Based Actor-Critic
  Structure
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure
Hamed Hakkak
OffRL
AI4CE
13
1
0
08 Jul 2018
Stochastic Layer-Wise Precision in Deep Neural Networks
Stochastic Layer-Wise Precision in Deep Neural Networks
Griffin Lacey
Graham W. Taylor
S. Areibi
13
18
0
03 Jul 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
14
133
0
01 Jul 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks
  per Bit?
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
14
135
0
20 Jun 2018
Exploration of Low Numeric Precision Deep Learning Inference Using Intel
  FPGAs
Exploration of Low Numeric Precision Deep Learning Inference Using Intel FPGAs
Philip Colangelo
Nasibeh Nasiri
Asit K. Mishra
Eriko Nurvitadhi
M. Margala
Kevin Nealis
MQ
11
1
0
12 Jun 2018
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service
Amartya Sanyal
Matt J. Kusner
Adria Gascon
Varun Kanade
FedML
11
125
0
09 Jun 2018
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural
  Networks
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks
Ke Sun
Mingjie Li
Dong Liu
Jingdong Wang
29
126
0
01 Jun 2018
MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural
  Network Compression
MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural Network Compression
Lazar Supic
R. Naous
Ranko Sredojevic
Aleksandra Faust
Vladimir M. Stojanović
17
4
0
30 May 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Dongsoo Lee
Byeongwook Kim
MQ
20
16
0
29 May 2018
Accelerating CNN inference on FPGAs: A Survey
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
19
147
0
26 May 2018
Tensorial Neural Networks: Generalization of Neural Networks and
  Application to Model Compression
Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression
Jiahao Su
Jingling Li
Bobby Bhattacharjee
Furong Huang
14
20
0
25 May 2018
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices
  Compressed with Quantization and Tensorization
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices Compressed with Quantization and Tensorization
Yuan-Chia Cheng
Guangya Li
Hai-Bao Chen
S. Tan
Hao Yu
4
3
0
21 May 2018
PACT: Parameterized Clipping Activation for Quantized Neural Networks
PACT: Parameterized Clipping Activation for Quantized Neural Networks
Jungwook Choi
Zhuo Wang
Swagath Venkataramani
P. Chuang
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
11
936
0
16 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural
  Networks
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
4
45
0
29 Apr 2018
Low-memory convolutional neural networks through incremental depth-first
  processing
Low-memory convolutional neural networks through incremental depth-first processing
Jonathan Binas
Yoshua Bengio
SupR
12
3
0
28 Apr 2018
Value-aware Quantization for Training and Inference of Neural Networks
Value-aware Quantization for Training and Inference of Neural Networks
Eunhyeok Park
S. Yoo
Peter Vajda
MQ
6
157
0
20 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight
  Repetition
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition
Kartik Hegde
Jiyong Yu
R. Agrawal
Mengjia Yan
Michael Pellauer
Christopher W. Fletcher
9
165
0
18 Apr 2018
IGCV$2$: Interleaved Structured Sparse Convolutional Neural Networks
IGCV222: Interleaved Structured Sparse Convolutional Neural Networks
Guotian Xie
Jingdong Wang
Ting Zhang
Jianhuang Lai
Richang Hong
Guo-Jun Qi
8
104
0
17 Apr 2018
Fast inference of deep neural networks in FPGAs for particle physics
Fast inference of deep neural networks in FPGAs for particle physics
Javier Mauricio Duarte
Song Han
Philip C. Harris
S. Jindariani
E. Kreinar
...
J. Ngadiuba
M. Pierini
R. Rivera
N. Tran
Zhenbin Wu
AI4CE
75
386
0
16 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Ameya Prabhu
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
25
18
0
11 Apr 2018
Distribution-Aware Binarization of Neural Networks for Sketch
  Recognition
Distribution-Aware Binarization of Neural Networks for Sketch Recognition
Ameya Prabhu
Vishal Batchu
Sri Aurobindo Munagala
Rohit Gajawada
A. Namboodiri
MQ
11
5
0
09 Apr 2018
Training DNNs with Hybrid Block Floating Point
Training DNNs with Hybrid Block Floating Point
M. Drumond
Tao R. Lin
Martin Jaggi
Babak Falsafi
17
94
0
04 Apr 2018
Adversarial Network Compression
Adversarial Network Compression
Vasileios Belagiannis
Azade Farshad
Fabio Galasso
GAN
AAML
6
58
0
28 Mar 2018
SqueezeNext: Hardware-Aware Neural Network Design
SqueezeNext: Hardware-Aware Neural Network Design
A. Gholami
K. Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Kurt Keutzer
14
295
0
23 Mar 2018
EVA$^2$: Exploiting Temporal Redundancy in Live Computer Vision
EVA2^22: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
33
75
0
16 Mar 2018
Previous
123...101189
Next