Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.01064
Cited By
Trained Ternary Quantization
4 December 2016
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trained Ternary Quantization"
50 / 509 papers shown
Title
HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision
Zhen Dong
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
13
513
0
29 Apr 2019
Towards Efficient Model Compression via Learned Global Ranking
Ting-Wu Chin
Ruizhou Ding
Cha Zhang
Diana Marculescu
10
170
0
28 Apr 2019
SWALP : Stochastic Weight Averaging in Low-Precision Training
Guandao Yang
Tianyi Zhang
Polina Kirichenko
Junwen Bai
A. Wilson
Christopher De Sa
11
94
0
26 Apr 2019
S
2
S^{2}
S
2
-LBI: Stochastic Split Linearized Bregman Iterations for Parsimonious Deep Learning
Yanwei Fu
Donghao Li
Xinwei Sun
Shun Zhang
Yizhou Wang
Y. Yao
17
0
0
24 Apr 2019
Low-Memory Neural Network Training: A Technical Report
N. Sohoni
Christopher R. Aberger
Megan Leszczynski
Jian Zhang
Christopher Ré
9
99
0
24 Apr 2019
SCANN: Synthesis of Compact and Accurate Neural Networks
Shayan Hassantabar
Zeyu Wang
N. Jha
4
37
0
19 Apr 2019
Defensive Quantization: When Efficiency Meets Robustness
Ji Lin
Chuang Gan
Song Han
MQ
21
201
0
17 Apr 2019
Knowledge Squeezed Adversarial Network Compression
Changyong Shu
Li Peng
Xie Yuan
Yanyun Qu
Longquan Dai
Lizhuang Ma
GAN
29
11
0
10 Apr 2019
C2S2: Cost-aware Channel Sparse Selection for Progressive Network Pruning
Chih-Yao Chiu
Hwann-Tzong Chen
Tyng-Luh Liu
6
0
0
06 Apr 2019
3DQ: Compact Quantized Neural Networks for Volumetric Whole Brain Segmentation
Magdalini Paschali
Stefano Gasperini
Abhijit Guha Roy
Michael Y.-S. Fang
Nassir Navab
14
19
0
05 Apr 2019
Progressive Stochastic Binarization of Deep Networks
David Hartmann
Michael Wand
MQ
12
1
0
03 Apr 2019
LUTNet: Rethinking Inference in FPGA Soft Logic
Erwei Wang
James J. Davis
P. Cheung
G. Constantinides
14
61
0
01 Apr 2019
Training Quantized Neural Networks with a Full-precision Auxiliary Module
Bohan Zhuang
Lingqiao Liu
Mingkui Tan
Chunhua Shen
Ian Reid
MQ
24
62
0
27 Mar 2019
Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks
Sambhav R. Jain
Albert Gural
Michael Wu
Chris Dick
MQ
8
147
0
19 Mar 2019
Understanding Straight-Through Estimator in Training Activation Quantized Neural Nets
Penghang Yin
J. Lyu
Shuai Zhang
Stanley Osher
Y. Qi
Jack Xin
MQ
LLMSV
19
305
0
13 Mar 2019
Focused Quantization for Sparse CNNs
Yiren Zhao
Xitong Gao
Daniel Bates
Robert D. Mullins
Chengzhong Xu
MQ
15
26
0
07 Mar 2019
Ternary Hybrid Neural-Tree Networks for Highly Constrained IoT Applications
Dibakar Gope
Ganesh S. Dasika
Matthew Mattina
17
23
0
04 Mar 2019
Cluster Regularized Quantization for Deep Networks Compression
Yiming Hu
Jianquan Li
Xianlei Long
Shenhua Hu
Jiagang Zhu
Xingang Wang
Qingyi Gu
MQ
9
6
0
27 Feb 2019
Learned Step Size Quantization
S. K. Esser
J. McKinstry
Deepika Bablani
R. Appuswamy
D. Modha
MQ
9
774
0
21 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces
Mohammad Saidur Rahman
Mohsen Imani
Nate Mathews
M. Wright
AAML
6
80
0
18 Feb 2019
Self-Binarizing Networks
Fayez Lahoud
R. Achanta
Pablo Márquez-Neila
Sabine Süsstrunk
MQ
11
23
0
02 Feb 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM
Hongxu Yin
Guoyang Chen
Yingmin Li
Shuai Che
Weifeng Zhang
N. Jha
22
10
0
30 Jan 2019
Information-Theoretic Understanding of Population Risk Improvement with Model Compression
Yuheng Bu
Weihao Gao
Shaofeng Zou
V. Veeravalli
MedIm
6
15
0
27 Jan 2019
Really should we pruning after model be totally trained? Pruning based on a small amount of training
Li Yue
Zhao Weibin
Shang-Te Lin
VLM
9
5
0
24 Jan 2019
QGAN: Quantized Generative Adversarial Networks
Peiqi Wang
Dongsheng Wang
Yu Ji
Xinfeng Xie
Haoxuan Song
XuXin Liu
Yongqiang Lyu
Yuan Xie
GAN
MQ
11
32
0
24 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang
James J. Davis
Ruizhe Zhao
Ho-Cheung Ng
Xinyu Niu
Wayne Luk
P. Cheung
G. Constantinides
11
59
0
21 Jan 2019
Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks
Charbel Sakr
Naigang Wang
Chia-Yu Chen
Jungwook Choi
A. Agrawal
Naresh R Shanbhag
K. Gopalakrishnan
MQ
14
34
0
19 Jan 2019
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
24
62
0
07 Jan 2019
Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks
Xue Geng
Jie Fu
Bin Zhao
Jie Lin
M. Aly
C. Pal
V. Chandrasekhar
MQ
14
5
0
04 Jan 2019
Regularized Binary Network Training
Sajad Darabi
Mouloud Belbahri
Matthieu Courbariaux
V. Nia
MQ
21
32
0
31 Dec 2018
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Ao Ren
Tianyun Zhang
Shaokai Ye
Jiayu Li
Wenyao Xu
Xuehai Qian
X. Lin
Yanzhi Wang
MQ
24
162
0
31 Dec 2018
A Survey of FPGA Based Deep Learning Accelerators: Challenges and Opportunities
Teng Wang
Chao Wang
Xuehai Zhou
Hua-ping Chen
17
34
0
25 Dec 2018
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
8
12
0
24 Dec 2018
ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
Xiaoliang Dai
Peizhao Zhang
Bichen Wu
Hongxu Yin
Fei Sun
...
Yiming Wu
Yangqing Jia
Peter Vajda
M. Uyttendaele
N. Jha
11
272
0
21 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
17
8
0
20 Dec 2018
A Main/Subsidiary Network Framework for Simplifying Binary Neural Network
Yinghao Xu
Xin Dong
Yudian Li
Hao Su
14
27
0
11 Dec 2018
DNQ: Dynamic Network Quantization
Yuhui Xu
Shuai Zhang
Y. Qi
Jiaxian Guo
Weiyao Lin
H. Xiong
MQ
14
6
0
06 Dec 2018
Efficient and Robust Machine Learning for Real-World Systems
Franz Pernkopf
Wolfgang Roth
Matthias Zöhrer
Lukas Pfeifenberger
Günther Schindler
Holger Froening
Sebastian Tschiatschek
Robert Peharz
Matthew Mattina
Zoubin Ghahramani
OOD
13
1
0
05 Dec 2018
On Compressing U-net Using Knowledge Distillation
K. Mangalam
Mathieu Salzamann
AI4CE
9
16
0
01 Dec 2018
Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search
Bichen Wu
Yanghan Wang
Peizhao Zhang
Yuandong Tian
Peter Vajda
Kurt Keutzer
MQ
20
272
0
30 Nov 2018
Dense xUnit Networks
I. Kligvasser
T. Michaeli
13
3
0
27 Nov 2018
Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss
Chaim Baskin
A. Mendelson
A. Bronstein
Raja Giryes
MQ
19
5
0
27 Nov 2018
A Survey of Mobile Computing for the Visually Impaired
Martin Weiss
Margaux Luck
Roger Girgis
C. Pal
Joseph Paul Cohen
20
10
0
25 Nov 2018
Joint Neural Architecture Search and Quantization
Yukang Chen
Gaofeng Meng
Qian Zhang
Xinbang Zhang
Liangchen Song
Shiming Xiang
Chunhong Pan
MQ
24
29
0
23 Nov 2018
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
27
152
0
22 Nov 2018
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Kuan-Chieh Jackson Wang
Zhijian Liu
Yujun Lin
Ji Lin
Song Han
MQ
21
872
0
21 Nov 2018
Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs
Yifan Yang
Qijing Huang
Bichen Wu
Tianjun Zhang
Liang Ma
...
Michaela Blott
Luciano Lavagno
K. Vissers
J. Wawrzynek
Kurt Keutzer
11
113
0
21 Nov 2018
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
27
1,669
0
20 Nov 2018
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
55
68
0
05 Nov 2018
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
Yang He
Ping Liu
Ziwei Wang
Zhilan Hu
Yi Yang
AAML
3DPC
11
1,037
0
01 Nov 2018
Previous
1
2
3
...
10
11
7
8
9
Next