Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.01704
Cited By
v1
v2
v3
v4 (latest)
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
5 November 2018
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks"
36 / 36 papers shown
Title
ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs
Yuchen Yang
Shubham Ugare
Yifan Zhao
Gagandeep Singh
Sasa Misailovic
MQ
311
1
0
31 Oct 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Ruibing Jin
Xiaoli Li
307
8
0
09 May 2024
SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search
S. N. Sridhar
Maciej Szankin
Fang Chen
Sairam Sundaresan
Anthony Sarah
MQ
211
1
0
19 Dec 2023
MetaMix: Meta-state Precision Searcher for Mixed-precision Activation Quantization
AAAI Conference on Artificial Intelligence (AAAI), 2023
Han-Byul Kim
Joo Hyung Lee
Sungjoo Yoo
Hong-Seok Kim
MQ
211
9
0
12 Nov 2023
FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search
Jordan Dotzel
Gang Wu
Andrew Li
M. Umar
Yun Ni
...
Liqun Cheng
Martin G. Dixon
N. Jouppi
Quoc V. Le
Sheng Li
MQ
285
5
0
07 Aug 2023
ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture Design
International Symposium on Computer Architecture (ISCA), 2023
Srivatsan Krishnan
Amir Yazdanbaksh
Yasmine Omri
Jason J. Jabbour
Ikechukwu Uchendu
...
Behzad Boroujerdian
Daniel Richins
Devashree Tripathy
Aleksandra Faust
Vijay Janapa Reddi
233
17
0
15 Jun 2023
Towards Optimal Compression: Joint Pruning and Quantization
Ben Zandonati
Glenn Bucagu
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
313
5
0
15 Feb 2023
CSMPQ:Class Separability Based Mixed-Precision Quantization
International Conference on Intelligent Computing (ICIC), 2022
Ming-Yu Wang
Taisong Jin
Miaohui Zhang
Zhengtao Yu
MQ
115
1
0
20 Dec 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
280
19
0
11 Aug 2022
Learnable Mixed-precision and Dimension Reduction Co-design for Low-storage Activation
IEEE Workshop on Signal Processing Systems (SiPS), 2022
Yu-Shan Tai
Cheng-Yang Chang
Chieh-Fang Teng
AnYeu
A. Wu
196
5
0
16 Jul 2022
Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Clemens J. S. Schaefer
Siddharth Joshi
Shane Li
Raul Blazquez
MQ
162
14
0
15 Jun 2022
Modulating Regularization Frequency for Efficient Compression-Aware Model Training
Dongsoo Lee
S. Kwon
Byeongwook Kim
Jeongin Yun
Baeseong Park
Yongkweon Jeon
101
0
0
05 May 2021
Exploring Neural Networks Quantization via Layer-Wise Quantization Analysis
Shachar Gluska
Mark Grobman
MQ
93
7
0
15 Dec 2020
Automated Model Compression by Jointly Applied Pruning and Quantization
Wenting Tang
Xingxing Wei
Yue Liu
MQ
99
7
0
12 Nov 2020
Differentiable Joint Pruning and Quantization for Hardware Efficiency
Ying Wang
Yadong Lu
Tijmen Blankevoort
MQ
282
85
0
20 Jul 2020
FracBits: Mixed Precision Quantization via Fractional Bit-Widths
Linjie Yang
Qing Jin
MQ
246
92
0
04 Jul 2020
An Overview of Neural Network Compression
James OÑeill
AI4CE
332
112
0
05 Jun 2020
Modeling Survival in model-based Reinforcement Learning
Saeed Moazami
P. Doerschuk
OffRL
85
1
0
18 Apr 2020
Rethinking Differentiable Search for Mixed-Precision Neural Networks
Computer Vision and Pattern Recognition (CVPR), 2020
Zhaowei Cai
Nuno Vasconcelos
MQ
136
138
0
13 Apr 2020
GeneCAI: Genetic Evolution for Acquiring Compact AI
Annual Conference on Genetic and Evolutionary Computation (GECCO), 2020
Mojan Javaheripi
Mohammad Samragh
T. Javidi
F. Koushanfar
240
9
0
08 Apr 2020
CNN2Gate: Toward Designing a General Framework for Implementation of Convolutional Neural Networks on FPGA
Alireza Ghaffari
Yvon Savaria
156
10
0
06 Apr 2020
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Yuhang Li
Wei Wang
Haoli Bai
Yazhe Niu
Xin Dong
F. Yu
MQ
147
23
0
17 Mar 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices
Conference on Machine Learning and Systems (MLSys), 2020
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
177
57
0
04 Mar 2020
WaveQ: Gradient-Based Deep Quantization of Neural Networks through Sinusoidal Adaptive Regularization
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
T. Elgindi
Charles-Alban Deledalle
H. Esmaeilzadeh
MQ
185
10
0
29 Feb 2020
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization
International Symposium on Circuits and Systems (ISCAS), 2020
Milovs Nikolić
G. B. Hacene
Ciaran Bannon
Alberto Delmas Lascorz
Matthieu Courbariaux
Yoshua Bengio
Vincent Gripon
Andreas Moshovos
MQ
164
25
0
08 Feb 2020
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
International Conference on Learning Representations (ICLR), 2020
Byung Hoon Ahn
Prannoy Pilligundla
Amir Yazdanbakhsh
H. Esmaeilzadeh
ODL
203
88
0
23 Jan 2020
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
227
42
0
21 Dec 2019
AdaBits: Neural Network Quantization with Adaptive Bit-Widths
Computer Vision and Pattern Recognition (CVPR), 2019
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
209
143
0
20 Dec 2019
GroSS: Group-Size Series Decomposition for Grouped Architecture Search
Henry Howard-Jenkins
Yiwen Li
V. Prisacariu
202
0
0
02 Dec 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
IEEE International Conference on Computer Vision (ICCV), 2019
Yazhe Niu
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
254
511
0
14 Aug 2019
Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks
International Conference on Machine Learning (ICML), 2019
Ahmed T. Elthakeb
Prannoy Pilligundla
Alex Cloninger
H. Esmaeilzadeh
MQ
251
9
0
14 Jun 2019
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation
Byung Hoon Ahn
Prannoy Pilligundla
H. Esmaeilzadeh
133
21
0
30 May 2019
Mixed Precision DNNs: All you need is a good parametrization
Stefan Uhlich
Lukas Mauch
Fabien Cardinaux
K. Yoshiyama
Javier Alonso García
Stephen Tiedemann
Thomas Kemp
Akira Nakamura
MQ
274
39
0
27 May 2019
SinReQ: Generalized Sinusoidal Regularization for Low-Bitwidth Deep Quantized Training
Ahmed T. Elthakeb
Prannoy Pilligundla
H. Esmaeilzadeh
MQ
191
9
0
04 May 2019
Towards Learning of Filter-Level Heterogeneous Compression of Convolutional Neural Networks
Y. Zur
Chaim Baskin
Evgenii Zheltonozhskii
Brian Chmiel
Itay Evron
A. Bronstein
A. Mendelson
MQ
178
7
0
22 Apr 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
254
111
0
15 Feb 2019
1