Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.06393
Cited By
Fixed Point Quantization of Deep Convolutional Networks
19 November 2015
D. Lin
S. Talathi
V. Annapureddy
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fixed Point Quantization of Deep Convolutional Networks"
50 / 99 papers shown
Title
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks
Erin Carson
Xinye Chen
49
0
0
10 Apr 2025
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
Sanghwan Bae
Jiwoo Hong
Min Young Lee
Hanbyul Kim
Jeongyeon Nam
Donghyun Kwak
OffRL
LRM
48
3
0
04 Apr 2025
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
34
0
0
01 Nov 2024
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi
Hyeyoon Lee
Dain Kwon
Sunjong Park
Kyuyeun Kim
Noseong Park
Jinho Lee
Jinho Lee
MQ
40
1
0
29 Jul 2024
DE
3
^3
3
-BERT: Distance-Enhanced Early Exiting for BERT based on Prototypical Networks
Jianing He
Qi Zhang
Weiping Ding
Duoqian Miao
Jun Zhao
Liang Hu
LongBing Cao
34
3
0
03 Feb 2024
In-Ear-Voice: Towards Milli-Watt Audio Enhancement With Bone-Conduction Microphones for In-Ear Sensing Platforms
Philipp Schilk
Niccolò Polvani
Andrea Ronco
Milos Cernak
Michele Magno
27
12
0
05 Sep 2023
Learning Discrete Weights and Activations Using the Local Reparameterization Trick
G. Berger
Aviv Navon
Ethan Fetaya
MQ
20
0
0
04 Jul 2023
Patch-wise Mixed-Precision Quantization of Vision Transformer
Junrui Xiao
Zhikai Li
Lianwei Yang
Qingyi Gu
MQ
22
12
0
11 May 2023
Are Visual Recognition Models Robust to Image Compression?
Joao Maria Janeiro
Stanislav Frolov
Alaaeldin El-Nouby
Jakob Verbeek
VLM
13
4
0
10 Apr 2023
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
27
0
0
07 Apr 2023
A Heterogeneous Parallel Non-von Neumann Architecture System for Accurate and Efficient Machine Learning Molecular Dynamics
Zhuoying Zhao
Ziling Tan
Pinghui Mo
Xiaonan Wang
Dan Zhao
Xin Zhang
Ming Tao
Jie Liu
19
1
0
26 Mar 2023
Rethinking Data-Free Quantization as a Zero-Sum Game
Biao Qian
Yang Wang
Richang Hong
Meng Wang
MQ
16
17
0
19 Feb 2023
Towards Implementing Energy-aware Data-driven Intelligence for Smart Health Applications on Mobile Platforms
G. D. Samaraweera
Hung Nguyen
Hadi Zanddizari
Behnam Zeinali
Jerome Chang
17
0
0
01 Feb 2023
The Hidden Power of Pure 16-bit Floating-Point Neural Networks
Juyoung Yun
Byungkon Kang
Zhoulai Fu
MQ
21
1
0
30 Jan 2023
OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks
Benoit Steiner
Mostafa Elhoushi
Jacob Kahn
James Hegarty
29
8
0
24 Oct 2022
MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos
Mathias Parger
Chengcheng Tang
Thomas Neff
Christopher D. Twigg
Cem Keskin
Robert Y. Wang
M. Steinberger
19
6
0
18 Oct 2022
Neural Network Compression by Joint Sparsity Promotion and Redundancy Reduction
T. M. Khan
Syed S. Naqvi
A. Robles-Kelly
Erik H. W. Meijering
36
7
0
14 Oct 2022
Design Automation for Fast, Lightweight, and Effective Deep Learning Models: A Survey
Dalin Zhang
Kaixuan Chen
Yan Zhao
B. Yang
Li-Ping Yao
Christian S. Jensen
38
3
0
22 Aug 2022
FP8 Quantization: The Power of the Exponent
Andrey Kuzmin
M. V. Baalen
Yuwei Ren
Markus Nagel
Jorn W. T. Peters
Tijmen Blankevoort
MQ
19
78
0
19 Aug 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
18
11
0
11 Aug 2022
Design of High-Throughput Mixed-Precision CNN Accelerators on FPGA
Cecilia Latotzke
Tim Ciesielski
T. Gemmeke
MQ
11
7
0
09 Aug 2022
CoNLoCNN: Exploiting Correlation and Non-Uniform Quantization for Energy-Efficient Low-precision Deep Convolutional Neural Networks
Muhammad Abdullah Hanif
G. M. Sarda
Alberto Marchisio
Guido Masera
Maurizio Martina
Muhammad Shafique
MQ
8
4
0
31 Jul 2022
DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware
H. Hashemi
Yongqin Wang
M. Annavaram
FedML
18
58
0
30 Jun 2022
FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support
Seock-Hwan Noh
Jahyun Koo
Seunghyun Lee
Jongse Park
Jaeha Kung
AI4CE
16
17
0
13 Mar 2022
Engineering the Neural Automatic Passenger Counter
Nico Jahn
Michael Siebert
11
2
0
02 Mar 2022
Multi-task Learning Approach for Modulation and Wireless Signal Classification for 5G and Beyond: Edge Deployment via Model Compression
Anu Jagannath
Jithin Jagannath
14
26
0
26 Feb 2022
Quantune: Post-training Quantization of Convolutional Neural Networks using Extreme Gradient Boosting for Fast Deployment
Jemin Lee
Misun Yu
Yongin Kwon
Teaho Kim
MQ
14
17
0
10 Feb 2022
Accelerating DNN Training with Structured Data Gradient Pruning
Bradley McDanel
Helia Dinh
J. Magallanes
4
7
0
01 Feb 2022
Sub-mW Keyword Spotting on an MCU: Analog Binary Feature Extraction and Binary Neural Networks
G. Cerutti
Lukas Cavigelli
Renzo Andri
Michele Magno
Elisabetta Farella
Luca Benini
26
14
0
10 Jan 2022
Speedup deep learning models on GPU by taking advantage of efficient unstructured pruning and bit-width reduction
Marcin Pietroñ
Dominik Zurek
14
13
0
28 Dec 2021
Neural Network Quantization for Efficient Inference: A Survey
Olivia Weng
MQ
17
22
0
08 Dec 2021
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks
Mahmood Azhar Qureshi
Arslan Munir
14
0
0
09 Nov 2021
BitTrain: Sparse Bitmap Compression for Memory-Efficient Training on the Edge
Abdelrahman I. Hosny
Marina Neseem
Sherief Reda
MQ
25
4
0
29 Oct 2021
Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme
Shaojie Li
Jie Wu
Xuefeng Xiao
Fei Chao
Xudong Mao
Rongrong Ji
23
35
0
27 Oct 2021
Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks
Yikai Wang
Yi Yang
Fuchun Sun
Anbang Yao
MQ
18
15
0
18 Oct 2021
MAFAT: Memory-Aware Fusing and Tiling of Neural Networks for Accelerated Edge Inference
J. Farley
A. Gerstlauer
FedML
21
5
0
14 Jul 2021
Smoothed Differential Privacy
Ao Liu
Yu-Xiang Wang
Lirong Xia
17
0
0
04 Jul 2021
Knowledge distillation: A good teacher is patient and consistent
Lucas Beyer
Xiaohua Zhai
Amelie Royer
L. Markeeva
Rohan Anil
Alexander Kolesnikov
VLM
15
287
0
09 Jun 2021
Measuring what Really Matters: Optimizing Neural Networks for TinyML
Lennart Heim
Andreas Biri
Zhongnan Qu
Lothar Thiele
38
30
0
21 Apr 2021
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators
David Stutz
Nandhini Chandramoorthy
Matthias Hein
Bernt Schiele
AAML
MQ
20
18
0
16 Apr 2021
Efficient Video Compression via Content-Adaptive Super-Resolution
Mehrdad Khani Shirkoohi
Vibhaalakshmi Sivaraman
Mohammad Alizadeh
SupR
26
49
0
06 Apr 2021
Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer
Phuoc Pham
J. Abraham
Jaeyong Chung
MQ
33
11
0
01 Apr 2021
Toward Compact Deep Neural Networks via Energy-Aware Pruning
Seul-Ki Yeom
Kyung-Hwan Shim
Jee-Hyun Hwang
CVBM
22
12
0
19 Mar 2021
Reduced-Order Neural Network Synthesis with Robustness Guarantees
R. Drummond
M. Turner
S. Duncan
13
9
0
18 Feb 2021
Dynamic Precision Analog Computing for Neural Networks
Sahaj Garg
Joe Lou
Anirudh Jain
Mitchell Nahmias
37
32
0
12 Feb 2021
Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms
Rishabh Goyal
Joaquin Vanschoren
V. V. Acht
S. Nijssen
MQ
12
23
0
03 Feb 2021
Sound Event Detection with Binary Neural Networks on Tightly Power-Constrained IoT Devices
G. Cerutti
Renzo Andri
Lukas Cavigelli
Michele Magno
Elisabetta Farella
Luca Benini
MQ
13
37
0
12 Jan 2021
Design and Analysis of Uplink and Downlink Communications for Federated Learning
Sihui Zheng
Cong Shen
Xiang Chen
23
139
0
07 Dec 2020
An SMT-Based Approach for Verifying Binarized Neural Networks
Guy Amir
Haoze Wu
Clark W. Barrett
Guy Katz
11
58
0
05 Nov 2020
Softmax Tempering for Training Neural Machine Translation Models
Raj Dabre
Atsushi Fujita
23
10
0
20 Sep 2020
1
2
Next