Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.00953
Cited By
Deep Learning with Low Precision by Half-wave Gaussian Quantization
3 February 2017
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Learning with Low Precision by Half-wave Gaussian Quantization"
50 / 87 papers shown
Title
Dedicated Inference Engine and Binary-Weight Neural Networks for Lightweight Instance Segmentation
Tse-Wei Chen
Wei Tao
Dongyue Zhao
Kazuhiro Mima
Tadayuki Ito
Kinya Osa
Masami Kato
MQ
31
0
0
03 Jan 2025
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi
Hyeyoon Lee
Dain Kwon
Sunjong Park
Kyuyeun Kim
Noseong Park
Jinho Lee
Jinho Lee
MQ
40
1
0
29 Jul 2024
Characterizing Disparity Between Edge Models and High-Accuracy Base Models for Vision Tasks
Zhenyu Wang
S. Nirjon
24
0
0
13 Jul 2024
Designed Dithering Sign Activation for Binary Neural Networks
Brayan Monroy
Juan Estupiñán
T. Gelvez-Barrera
Jorge Bacca
Henry Arguello
MQ
35
1
0
03 May 2024
Investigation of Energy-efficient AI Model Architectures and Compression Techniques for "Green" Fetal Brain Segmentation
Szymon Mazurek
M. Pytlarz
Sylwia Malec
A. Crimi
31
0
0
03 Apr 2024
LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization
Qianhui Liu
Jiaqi Yan
Malu Zhang
Gang Pan
Haizhou Li
32
4
0
26 Jan 2024
Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks
Chee Hong
Kyoung Mu Lee
SupR
MQ
19
1
0
25 Jul 2023
Quantized Feature Distillation for Network Quantization
Kevin Zhu
Yin He
Jianxin Wu
MQ
24
9
0
20 Jul 2023
A Framework for Benchmarking Real-Time Embedded Object Detection
M. Schlosser
D. König
Michael Teutsch
ObjD
11
0
0
23 Apr 2023
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input
Senmao Tian
Ming Lu
Jiaming Liu
Yandong Guo
Yurong Chen
Shunli Zhang
SupR
MQ
20
11
0
13 Apr 2023
Learning Discretized Neural Networks under Ricci Flow
Jun Chen
Han Chen
Mengmeng Wang
Guang Dai
Ivor W. Tsang
Y. Liu
15
2
0
07 Feb 2023
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNs
A. M. Ribeiro-dos-Santos
João Dinis Ferreira
O. Mutlu
G. Falcão
MQ
13
1
0
15 Jan 2023
Redistribution of Weights and Activations for AdderNet Quantization
Ying Nie
Kai Han
Haikang Diao
Chuanjian Liu
Enhua Wu
Yunhe Wang
MQ
44
6
0
20 Dec 2022
PD-Quant: Post-Training Quantization based on Prediction Difference Metric
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
96
68
0
14 Dec 2022
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
19
2
0
10 Dec 2022
BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Haotong Qin
Xudong Ma
Yifu Ding
X. Li
Yang Zhang
Zejun Ma
Jiakai Wang
Jie Luo
Xianglong Liu
MQ
38
20
0
13 Nov 2022
Fast and Low-Memory Deep Neural Networks Using Binary Matrix Factorization
Alireza Bordbar
M. Kahaei
MQ
23
0
0
24 Oct 2022
AttTrack: Online Deep Attention Transfer for Multi-object Tracking
Keivan Nalaie
Rong Zheng
VOT
11
4
0
16 Oct 2022
Convolutional Neural Networks Quantization with Attention
Binyi Wu
Bernd Waschneck
Christian Mayr
MQ
13
1
0
30 Sep 2022
How important are activation functions in regression and classification? A survey, performance comparison, and future directions
Ameya Dilip Jagtap
George Karniadakis
AI4CE
29
71
0
06 Sep 2022
AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets
Zhaopeng Tu
Xinghao Chen
Pengju Ren
Yunhe Wang
MQ
34
54
0
17 Aug 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
18
11
0
11 Aug 2022
CADyQ: Content-Aware Dynamic Quantization for Image Super-Resolution
Chee Hong
Sungyong Baik
Heewon Kim
Seungjun Nah
Kyoung Mu Lee
SupR
MQ
23
32
0
21 Jul 2022
Towards Semantic Communication Protocols: A Probabilistic Logic Perspective
Sejin Seo
Jihong Park
Seung-Woo Ko
Jinho D. Choi
M. Bennis
Seong-Lyun Kim
22
22
0
08 Jul 2022
Combinatorial optimization for low bit-width neural networks
Hanxu Zhou
Aida Ashrafi
Matthew B. Blaschko
MQ
19
0
0
04 Jun 2022
Standard Deviation-Based Quantization for Deep Neural Networks
Amir Ardakani
A. Ardakani
B. Meyer
J. Clark
W. Gross
MQ
41
1
0
24 Feb 2022
Distilled Neural Networks for Efficient Learning to Rank
F. M. Nardini
Cosimo Rulli
Salvatore Trani
Rossano Venturini
FedML
21
16
0
22 Feb 2022
Lightweight Jet Reconstruction and Identification as an Object Detection Task
Adrian Alan Pol
T. Aarrestad
E. Govorkova
Roi Halily
Anat Klempner
...
Vladimir Loncar
J. Ngadiuba
M. Pierini
Olya Sirkin
S. Summers
17
2
0
09 Feb 2022
Elastic-Link for Binarized Neural Network
Jie Hu
Ziheng Wu
Vince Tan
Zhilin Lu
Mengze Zeng
Enhua Wu
MQ
26
6
0
19 Dec 2021
Sharpness-aware Quantization for Deep Neural Networks
Jing Liu
Jianfei Cai
Bohan Zhuang
MQ
27
24
0
24 Nov 2021
Arch-Net: Model Distillation for Architecture Agnostic Model Deployment
Weixin Xu
Zipeng Feng
Shuangkang Fang
Song Yuan
Yi Yang
Shuchang Zhou
MQ
16
1
0
01 Nov 2021
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
Weihan Chen
Peisong Wang
Jian Cheng
MQ
33
61
0
13 Oct 2021
VC dimension of partially quantized neural networks in the overparametrized regime
Yutong Wang
Clayton D. Scott
12
1
0
06 Oct 2021
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Boyuan Feng
Yuke Wang
Tong Geng
Ang Li
Yufei Ding
MQ
11
37
0
23 Jun 2021
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
Lukas Hedegaard
Alexandros Iosifidis
3DPC
18
14
0
31 May 2021
"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization
Tianlong Chen
Zhenyu (Allen) Zhang
Xu Ouyang
Zechun Liu
Zhiqiang Shen
Zhangyang Wang
MQ
31
36
0
16 Apr 2021
Compacting Deep Neural Networks for Internet of Things: Methods and Applications
Ke Zhang
Hanbo Ying
Hongning Dai
Lin Li
Yuangyuang Peng
Keyi Guo
Hongfang Yu
16
38
0
20 Mar 2021
Adaptive Precision Training for Resource Constrained Devices
Tian Huang
Tao Luo
Joey Tianyi Zhou
28
5
0
23 Dec 2020
Training Binary Neural Networks through Learning with Noisy Supervision
Kai Han
Yunhe Wang
Yixing Xu
Chunjing Xu
Enhua Wu
Chang Xu
MQ
8
55
0
10 Oct 2020
High-Capacity Expert Binary Networks
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
MQ
18
57
0
07 Oct 2020
Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks
Yoonho Boo
Sungho Shin
Jungwook Choi
Wonyong Sung
MQ
14
29
0
30 Sep 2020
Kernel Based Progressive Distillation for Adder Neural Networks
Yixing Xu
Chang Xu
Xinghao Chen
Wei Zhang
Chunjing Xu
Yunhe Wang
30
47
0
28 Sep 2020
NASB: Neural Architecture Search for Binary Convolutional Neural Networks
Baozhou Zhu
Zaid Al-Ars
P. Hofstee
MQ
15
23
0
08 Aug 2020
Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization
Haibao Yu
Qi Han
Jianbo Li
Jianping Shi
Guangliang Cheng
Bin Fan
MQ
16
61
0
20 Jul 2020
AQD: Towards Accurate Fully-Quantized Object Detection
Peng Chen
Jing Liu
Bohan Zhuang
Mingkui Tan
Chunhua Shen
MQ
21
10
0
14 Jul 2020
Binary Neural Networks: A Survey
Haotong Qin
Ruihao Gong
Xianglong Liu
Xiao Bai
Jingkuan Song
N. Sebe
MQ
36
457
0
31 Mar 2020
Training Binary Neural Networks with Real-to-Binary Convolutions
Brais Martínez
Jing Yang
Adrian Bulat
Georgios Tzimiropoulos
MQ
9
226
0
25 Mar 2020
Evolutionary Bin Packing for Memory-Efficient Dataflow Inference Acceleration on FPGA
M. Kroes
L. Petrica
S. Cotofana
Michaela Blott
8
7
0
24 Mar 2020
Efficient Crowd Counting via Structured Knowledge Transfer
Lingbo Liu
Jiaqi Chen
Hefeng Wu
Tianshui Chen
Guanbin Li
Liang Lin
24
64
0
23 Mar 2020
Accelerating Deep Learning Inference via Freezing
Adarsh Kumar
Arjun Balasubramanian
Shivaram Venkataraman
Aditya Akella
AI4CE
21
22
0
07 Feb 2020
1
2
Next