Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.00104
Cited By
v1
v2 (latest)
Post-Training Piecewise Linear Quantization for Deep Neural Networks
European Conference on Computer Vision (ECCV), 2020
31 January 2020
Jun Fang
Ali Shafiee
Hamzah Abdel-Aziz
D. Thorsley
Georgios Georgiadis
Joseph Hassoun
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Post-Training Piecewise Linear Quantization for Deep Neural Networks"
50 / 73 papers shown
Title
MDM: Manhattan Distance Mapping of DNN Weights for Parasitic-Resistance-Resilient Memristive Crossbars
International Conference on Learning Representations (ICLR), 2025
Matheus Farias
Wanghley Martins
H. T. Kung
64
0
0
06 Nov 2025
Outlier-Aware Post-Training Quantization for Image Super-Resolution
Hailing Wang
Jianglin Lu
Yitian Zhang
Y. Fu
MQ
100
0
0
01 Nov 2025
AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models
Seunghoon Lee
Jeongwoo Choi
Byunggwan Son
Jaehyeon Moon
Jeimin Jeon
Bumsub Ham
DiffM
MQ
176
0
0
23 Oct 2025
Collaborative Compression for Large-Scale MoE Deployment on Edge
Yixiao Chen
Yanyue Xie
Ruining Yang
Wei Jiang
Wei Wang
Yong He
Yue Chen
Pu Zhao
Y. Wang
MQ
56
0
0
30 Sep 2025
Bi-VLM: Pushing Ultra-Low Precision Post-Training Quantization Boundaries in Vision-Language Models
Xijun Wang
Junyun Huang
Rayyan Abdalla
Chengyuan Zhang
Ruiqi Xian
Wanrong Zhu
MQ
VLM
123
0
0
23 Sep 2025
Enhancing Quantization-Aware Training on Edge Devices via Relative Entropy Coreset Selection and Cascaded Layer Correction
Yujia Tong
Jingling Yuan
Chuang Hu
MQ
154
1
0
17 Jul 2025
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
Yujia Tong
Yuze Wang
Jingling Yuan
Chuang Hu
NoLa
234
2
0
18 Mar 2025
Task Vector Quantization for Memory-Efficient Model Merging
Youngeun Kim
Seunghwan Lee
Aecheon Jung
Bogon Ryu
Sungeun Hong
MQ
MoMe
224
3
0
10 Mar 2025
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
334
1
0
01 Nov 2024
Efficient Reprogramming of Memristive Crossbars for DNNs: Weight Sorting and Bit Stucking
International Symposium on Circuits and Systems (ISCAS), 2024
Matheus Farias
H. T. Kung
MQ
117
2
0
29 Oct 2024
Sorted Weight Sectioning for Energy-Efficient Unstructured Sparse DNNs on Compute-in-Memory Crossbars
International Symposium on Circuits and Systems (ISCAS), 2024
Matheus Farias
H. T. Kung
170
2
0
15 Oct 2024
Post-Training Quantization in Brain-Computer Interfaces based on Event-Related Potential Detection
IEEE International Conference on Systems, Man and Cybernetics (SMC), 2024
H. Cecotti
Dalvir Dhaliwal
Hardip Singh
Y. Meena
MQ
61
0
0
10 Oct 2024
Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview
Yanshu Wang
Tong Yang
Xiyan Liang
Guoan Wang
Hanning Lu
Xu Zhe
Yaoming Li
Li Weitao
MQ
257
5
0
18 Sep 2024
Quantizing YOLOv7: A Comprehensive Study
Mohammadamin Baghbanbashi
Mohsen Raji
B. Ghavami
MQ
142
10
0
06 Jul 2024
Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model
Jinyin Chen
Xiaoming Zhao
Haibin Zheng
Xiao Li
Sheng Xiang
Haifeng Guo
AAML
124
7
0
01 Jun 2024
EdgeSight: Enabling Modeless and Cost-Efficient Inference at the Edge
ChonLam Lao
Jiaqi Gao
Ganesh Ananthanarayanan
Aditya Akella
Minlan Yu
VLM
181
0
0
29 May 2024
Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves
Mykhail M. Uss
Ruslan Yermolenko
Olena Kolodiazhna
Olena Kolodiazhna
Ivan Safonov
Volodymyr Savin
Yoonjae Yeo
Seowon Ji
Jaeyun Jeong
MQ
175
0
0
22 May 2024
Investigating the Impact of Quantization on Adversarial Robustness
Qun Li
Yuan Meng
Chen Tang
Jiacheng Jiang
Zhi Wang
144
11
0
08 Apr 2024
DNN Memory Footprint Reduction via Post-Training Intra-Layer Multi-Precision Quantization
IEEE International Symposium on Quality Electronic Design (ISQED), 2024
B. Ghavami
Amin Kamjoo
Lesley Shannon
S. Wilton
MQ
138
0
0
03 Apr 2024
Instance-Aware Group Quantization for Vision Transformers
Jaehyeon Moon
Jeimin Jeon
Junyong Cheon
Bumsub Ham
MQ
ViT
226
15
0
01 Apr 2024
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Emad Fallahzadeh
Bram Adams
Ahmed E. Hassan
MQ
339
5
0
25 Mar 2024
Achieving Pareto Optimality using Efficient Parameter Reduction for DNNs in Resource-Constrained Edge Environment
Atah Nuh Mih
Alireza Rahimi
Asfia Kawnine
Francis Palma
Monica Wachowicz
R. Dubay
Hung Cao
218
0
0
14 Mar 2024
QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven Fine Tuning
Jiun-Man Chen
Yu-Hsuan Chao
Yu-Jie Wang
Ming-Der Shieh
Chih-Chung Hsu
Wei-Fen Lin
MQ
222
2
0
11 Mar 2024
Tiny Reinforcement Learning for Quadruped Locomotion using Decision Transformers
Orhan Eren Akgün
Néstor Cuevas
Matheus Farias
Daniel Garces
173
1
0
20 Feb 2024
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
International Conference on Machine Learning (ICML), 2024
Wei Huang
Yangdong Liu
Haotong Qin
Ying Li
Shiming Zhang
Xianglong Liu
Michele Magno
Xiaojuan Qi
MQ
259
125
0
06 Feb 2024
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
Jijia Liu
Chao Yu
Jiaxuan Gao
Yuqing Xie
Qingmin Liao
Yi Wu
Yu Wang
LLMAG
LM&Ro
336
55
0
23 Dec 2023
IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means
Sean Jaffe
Ambuj K. Singh
Francesco Bullo
MQ
175
0
0
12 Dec 2023
GenQ: Quantization in Low Data Regimes with Generative Synthetic Data
European Conference on Computer Vision (ECCV), 2023
Yuhang Li
Youngeun Kim
Donghyun Lee
Souvik Kundu
Priyadarshini Panda
MQ
261
6
0
07 Dec 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
219
16
0
20 Nov 2023
Exploring Post-Training Quantization of Protein Language Models
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2023
Shuang Peng
Fei Yang
Ning Sun
Sheng Chen
Yanfeng Jiang
Aimin Pan
MQ
122
0
0
30 Oct 2023
SINF: Semantic Neural Network Inference with Semantic Subgraphs
Sazzad Sayyed
Jonathan D. Ashdown
211
0
0
02 Oct 2023
A Survey on Model Compression for Large Language Models
Transactions of the Association for Computational Linguistics (TACL), 2023
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
294
332
0
15 Aug 2023
Digital Modeling on Large Kernel Metamaterial Neural Network
Journal of Imaging Science and Technology (JIST), 2023
Quan Liu
Hanyu Zheng
Brandon T. Swartz
Ho Hin Lee
Zuhayr Asad
I. Kravchenko
Jason G Valentine
Yuankai Huo
132
6
0
21 Jul 2023
Q-YOLO: Efficient Inference for Real-time Object Detection
Asian Conference on Pattern Recognition (ACPR), 2023
Mingze Wang
H. Sun
Jun Shi
Xuhui Liu
Baochang Zhang
Xianbin Cao
ObjD
135
14
0
01 Jul 2023
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
167
0
0
23 Jun 2023
Towards Accurate Post-training Quantization for Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2023
Changyuan Wang
Ziwei Wang
Xiuwei Xu
Yansong Tang
Jie Zhou
Jiwen Lu
MQ
246
34
0
30 May 2023
MBQuant: A Novel Multi-Branch Topology Method for Arbitrary Bit-width Network Quantization
Pattern Recognition (Pattern Recogn.), 2023
Mingliang Xu
Yuyao Zhou
Jiayi Ji
Rongrong Ji
MQ
197
7
0
14 May 2023
GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples
Neural Networks (Neural Netw.), 2023
T. Gao
Chengzhong Xu
Le Zhang
Hui Kong
323
9
0
13 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
177
1
0
08 May 2023
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Computer Vision and Pattern Recognition (CVPR), 2023
Sheng Xu
Yanjing Li
Mingbao Lin
Penglei Gao
Guodong Guo
Jinhu Lu
Baochang Zhang
MQ
154
33
0
01 Apr 2023
Towards Accurate Post-Training Quantization for Vision Transformer
ACM Multimedia (ACM MM), 2022
Yifu Ding
Haotong Qin
Qing-Yu Yan
Z. Chai
Junjie Liu
Xiaolin K. Wei
Xianglong Liu
MQ
187
89
0
25 Mar 2023
Ultra-low Precision Multiplication-free Training for Deep Neural Networks
Yu Xie
Rui Zhang
Xishan Zhang
Yifan Hao
Zidong Du
Xingui Hu
Ling Li
Qi Guo
MQ
246
2
0
28 Feb 2023
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yingchun Wang
Jingcai Guo
Song Guo
Weizhan Zhang
MQ
153
23
0
09 Feb 2023
PowerQuant: Automorphism Search for Non-Uniform Quantization
International Conference on Learning Representations (ICLR), 2023
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
128
22
0
24 Jan 2023
PD-Quant: Post-Training Quantization based on Prediction Difference Metric
Computer Vision and Pattern Recognition (CVPR), 2022
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
433
92
0
14 Dec 2022
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
189
6
0
10 Dec 2022
Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer
Neural Information Processing Systems (NeurIPS), 2022
Yanjing Li
Sheng Xu
Baochang Zhang
Xianbin Cao
Penglei Gao
Guodong Guo
MQ
ViT
193
127
0
13 Oct 2022
Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach
European Conference on Computer Vision (ECCV), 2022
Jiseok Youn
Jaehun Song
Hyung-Sin Kim
S. Bahk
MQ
129
10
0
20 Jul 2022
A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification
ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2022
Babak Rokh
A. Azarpeyvand
Alireza Khanteymoori
MQ
353
163
0
14 May 2022
SPIQ: Data-Free Per-Channel Static Input Quantization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
97
22
0
28 Mar 2022
1
2
Next