Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1811.01704
Cited By
v1
v2
v3
v4 (latest)
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
5 November 2018
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks"
36 / 36 papers shown
Title
ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs
Yuchen Yang
Shubham Ugare
Yifan Zhao
Gagandeep Singh
Sasa Misailovic
MQ
291
1
0
31 Oct 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Ruibing Jin
Xiaoli Li
241
7
0
09 May 2024
SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search
S. N. Sridhar
Maciej Szankin
Fang Chen
Sairam Sundaresan
Anthony Sarah
MQ
199
0
0
19 Dec 2023
MetaMix: Meta-state Precision Searcher for Mixed-precision Activation Quantization
AAAI Conference on Artificial Intelligence (AAAI), 2023
Han-Byul Kim
Joo Hyung Lee
Sungjoo Yoo
Hong-Seok Kim
MQ
163
9
0
12 Nov 2023
FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search
Jordan Dotzel
Gang Wu
Andrew Li
M. Umar
Yun Ni
...
Liqun Cheng
Martin G. Dixon
N. Jouppi
Quoc V. Le
Sheng Li
MQ
273
5
0
07 Aug 2023
ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture Design
International Symposium on Computer Architecture (ISCA), 2023
Srivatsan Krishnan
Amir Yazdanbaksh
Yasmine Omri
Jason J. Jabbour
Ikechukwu Uchendu
...
Behzad Boroujerdian
Daniel Richins
Devashree Tripathy
Aleksandra Faust
Vijay Janapa Reddi
221
17
0
15 Jun 2023
Towards Optimal Compression: Joint Pruning and Quantization
Ben Zandonati
Glenn Bucagu
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
249
5
0
15 Feb 2023
CSMPQ:Class Separability Based Mixed-Precision Quantization
International Conference on Intelligent Computing (ICIC), 2022
Ming-Yu Wang
Taisong Jin
Miaohui Zhang
Zhengtao Yu
MQ
103
1
0
20 Dec 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
268
19
0
11 Aug 2022
Learnable Mixed-precision and Dimension Reduction Co-design for Low-storage Activation
IEEE Workshop on Signal Processing Systems (SiPS), 2022
Yu-Shan Tai
Cheng-Yang Chang
Chieh-Fang Teng
AnYeu
A. Wu
164
5
0
16 Jul 2022
Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Clemens J. S. Schaefer
Siddharth Joshi
Shane Li
Raul Blazquez
MQ
150
13
0
15 Jun 2022
Modulating Regularization Frequency for Efficient Compression-Aware Model Training
Dongsoo Lee
S. Kwon
Byeongwook Kim
Jeongin Yun
Baeseong Park
Yongkweon Jeon
97
0
0
05 May 2021
Exploring Neural Networks Quantization via Layer-Wise Quantization Analysis
Shachar Gluska
Mark Grobman
MQ
93
7
0
15 Dec 2020
Automated Model Compression by Jointly Applied Pruning and Quantization
Wenting Tang
Xingxing Wei
Yue Liu
MQ
91
7
0
12 Nov 2020
Differentiable Joint Pruning and Quantization for Hardware Efficiency
Ying Wang
Yadong Lu
Tijmen Blankevoort
MQ
249
82
0
20 Jul 2020
FracBits: Mixed Precision Quantization via Fractional Bit-Widths
Linjie Yang
Qing Jin
MQ
194
91
0
04 Jul 2020
An Overview of Neural Network Compression
James OÑeill
AI4CE
280
110
0
05 Jun 2020
Modeling Survival in model-based Reinforcement Learning
Saeed Moazami
P. Doerschuk
OffRL
81
1
0
18 Apr 2020
Rethinking Differentiable Search for Mixed-Precision Neural Networks
Computer Vision and Pattern Recognition (CVPR), 2020
Zhaowei Cai
Nuno Vasconcelos
MQ
136
138
0
13 Apr 2020
GeneCAI: Genetic Evolution for Acquiring Compact AI
Annual Conference on Genetic and Evolutionary Computation (GECCO), 2020
Mojan Javaheripi
Mohammad Samragh
T. Javidi
F. Koushanfar
228
9
0
08 Apr 2020
CNN2Gate: Toward Designing a General Framework for Implementation of Convolutional Neural Networks on FPGA
Alireza Ghaffari
Yvon Savaria
136
1
0
06 Apr 2020
Efficient Bitwidth Search for Practical Mixed Precision Neural Network
Yuhang Li
Wei Wang
Haoli Bai
Yazhe Niu
Xin Dong
F. Yu
MQ
131
23
0
17 Mar 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices
Conference on Machine Learning and Systems (MLSys), 2020
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
173
57
0
04 Mar 2020
WaveQ: Gradient-Based Deep Quantization of Neural Networks through Sinusoidal Adaptive Regularization
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
T. Elgindi
Charles-Alban Deledalle
H. Esmaeilzadeh
MQ
173
10
0
29 Feb 2020
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization
International Symposium on Circuits and Systems (ISCAS), 2020
Milovs Nikolić
G. B. Hacene
Ciaran Bannon
Alberto Delmas Lascorz
Matthieu Courbariaux
Yoshua Bengio
Vincent Gripon
Andreas Moshovos
MQ
148
25
0
08 Feb 2020
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
International Conference on Learning Representations (ICLR), 2020
Byung Hoon Ahn
Prannoy Pilligundla
Amir Yazdanbakhsh
H. Esmaeilzadeh
ODL
191
87
0
23 Jan 2020
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
223
42
0
21 Dec 2019
AdaBits: Neural Network Quantization with Adaptive Bit-Widths
Computer Vision and Pattern Recognition (CVPR), 2019
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
189
143
0
20 Dec 2019
GroSS: Group-Size Series Decomposition for Grouped Architecture Search
Henry Howard-Jenkins
Yiwen Li
V. Prisacariu
182
0
0
02 Dec 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
IEEE International Conference on Computer Vision (ICCV), 2019
Yazhe Niu
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
232
507
0
14 Aug 2019
Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks
International Conference on Machine Learning (ICML), 2019
Ahmed T. Elthakeb
Prannoy Pilligundla
Alex Cloninger
H. Esmaeilzadeh
MQ
171
9
0
14 Jun 2019
Reinforcement Learning and Adaptive Sampling for Optimized DNN Compilation
Byung Hoon Ahn
Prannoy Pilligundla
H. Esmaeilzadeh
129
21
0
30 May 2019
Mixed Precision DNNs: All you need is a good parametrization
Stefan Uhlich
Lukas Mauch
Fabien Cardinaux
K. Yoshiyama
Javier Alonso García
Stephen Tiedemann
Thomas Kemp
Akira Nakamura
MQ
234
39
0
27 May 2019
SinReQ: Generalized Sinusoidal Regularization for Low-Bitwidth Deep Quantized Training
Ahmed T. Elthakeb
Prannoy Pilligundla
H. Esmaeilzadeh
MQ
183
9
0
04 May 2019
Towards Learning of Filter-Level Heterogeneous Compression of Convolutional Neural Networks
Y. Zur
Chaim Baskin
Evgenii Zheltonozhskii
Brian Chmiel
Itay Evron
A. Bronstein
A. Mendelson
MQ
169
7
0
22 Apr 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
234
109
0
15 Feb 2019
1