Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2001.00281
Cited By
ZeroQ: A Novel Zero Shot Quantization Framework
Computer Vision and Pattern Recognition (CVPR), 2020
1 January 2020
Yaohui Cai
Z. Yao
Zhen Dong
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Github (279★)
Papers citing
"ZeroQ: A Novel Zero Shot Quantization Framework"
50 / 249 papers shown
Title
CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding
Ziteng Sun
Adrian Benton
Samuel Kushnir
Asher Trockman
Vikas Singh
Suhas Diggavi
A. Suresh
MQ
65
0
0
24 Nov 2025
QuantKAN: A Unified Quantization Framework for Kolmogorov Arnold Networks
Kazi Ahmed Asif Fuad
Lizhong Chen
MQ
101
0
0
24 Nov 2025
D4C: Data-free Quantization for Contrastive Language-Image Pre-training Models
Wenlun Zhang
Yunshan Zhong
Zihao Ding
Xinyu Li
Kentaro Yoshioka
MQ
CLIP
VLM
115
0
0
19 Nov 2025
Distribution-Aware Tensor Decomposition for Compression of Convolutional Neural Networks
Alper Kalle
Theo Rudkiewicz
M. Ouerfelli
Mohamed Tamaazousti
220
0
0
06 Nov 2025
FlexiQ: Adaptive Mixed-Precision Quantization for Latency/Accuracy Trade-Offs in Deep Neural Networks
Jaemin Kim
Hongjun Um
Sungkyun Kim
Yongjun Park
Jiwon Seo
MQ
105
0
0
03 Oct 2025
Knowledge Distillation Detection for Open-weights Models
Qin Shi
Amber Yijia Zheng
Qifan Song
Raymond A. Yeh
149
0
0
02 Oct 2025
Cat: Post-Training Quantization Error Reduction via Cluster-based Affine Transformation
Ali Zoljodi
Radu Timofte
Masoud Daneshtalab
MQ
111
0
0
30 Sep 2025
Patch Rebirth: Toward Fast and Transferable Model Inversion of Vision Transformers
Seongsoo Heo
Dong-Wan Choi
189
0
0
27 Sep 2025
Towards Adapting Federated & Quantum Machine Learning for Network Intrusion Detection: A Survey
International Conference on Modern Circuits and Systems Technologies (ICMCST), 2025
Devashish Chaudhary
Sutharshan Rajasegarar
Mengyue Deng
FedML
AI4CE
363
1
0
24 Sep 2025
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment
Deokjae Lee
Hyun Oh Song
MQ
117
0
0
24 Sep 2025
Interpreting the Effects of Quantization on LLMs
Manpreet Singh
Hassan Sajjad
MQ
MILM
229
0
0
22 Aug 2025
Quantized Neural Networks for Microcontrollers: A Comprehensive Review of Methods, Platforms, and Applications
Hamza A. Abushahla
Dara Varam
Ariel J. N. Panopio
Mohamed I. AlHajri
MQ
271
0
0
20 Aug 2025
Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion
Songwei Liu
Hong Liu
Fangmin Chen
Xurui Peng
Chenqian Yan
Lean Fu
Xing Mei
MQ
135
0
0
16 Aug 2025
MiCo: End-to-End Mixed Precision Neural Network Co-Exploration Framework for Edge AI
Zijun Jiang
Yangdi Lyu
MQ
67
0
0
13 Aug 2025
Neutralizing Token Aggregation via Information Augmentation for Efficient Test-Time Adaptation
Yizhe Xiong
Zihan Zhou
Yiwen Liang
H. Chen
Zijia Lin
Tianxiang Hao
Fan Zhang
Jungong Han
Guiguang Ding
185
0
0
05 Aug 2025
DFQ-ViT: Data-Free Quantization for Vision Transformers without Fine-tuning
Yujia Tong
Jingling Yuan
Tian Zhang
Jianquan Liu
Chuang Hu
MQ
174
1
0
19 Jul 2025
ReStNet: A Reusable & Stitchable Network for Dynamic Adaptation on IoT Devices
Maoyu Wang
Yao Lu
Jiaqi Nie
Zeyu Wang
Yun Lin
Qi Xuan
Guan Gui
127
0
0
08 Jun 2025
FPTQuant: Function-Preserving Transforms for LLM Quantization
Boris van Breugel
Yelysei Bondarenko
Paul N. Whatmough
Markus Nagel
MQ
218
3
0
05 Jun 2025
TuneComp: Joint Fine-tuning and Compression for Large Foundation Models
Xiangyu Chen
Jing Liu
Ye Wang
Matthew Brand
Wang
T. Koike-Akino
235
0
0
27 May 2025
Zero-shot Quantization: A Comprehensive Survey
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Minjun Kim
Jaehyeon Choi
Jongkeun Lee
Wonjin Cho
U. Kang
MQ
283
6
0
14 May 2025
PDE: Gene Effect Inspired Parameter Dynamic Evolution for Low-light Image Enhancement
Tong Li
Lizhi Wang
Hansen Feng
Lin Zhu
Hua Huang
DiffM
294
0
0
14 May 2025
Quantitative Analysis of Performance Drop in DeepSeek Model Quantization
Enbo Zhao
Yi Shen
Shuming Shi
Jieyun Huang
Z. Chen
Rongjia Du
Siqi Xiao
Jing Zhang
Ning Wang
Shiguo Lian
MQ
490
0
0
05 May 2025
StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Yeona Hong
Hyewon Han
Woo-Jin Chung
Hong-Goo Kang
MQ
287
0
0
21 Apr 2025
PARQ: Piecewise-Affine Regularized Quantization
Lisa Jin
Jianhao Ma
Zechun Liu
Andrey Gromov
Aaron Defazio
Lin Xiao
MQ
188
3
0
19 Mar 2025
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
Yujia Tong
Yuze Wang
Jingling Yuan
Chuang Hu
NoLa
234
2
0
18 Mar 2025
A General Error-Theoretical Analysis Framework for Constructing Compression Strategies
Yunquan Zhang
Daning Cheng
Yunquan Zhang
Meiqi Tu
Fangmin Liu
Jiake Tian
170
2
0
19 Feb 2025
DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization
USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2024
Y. Park
Jake Hyun
Hojoon Kim
Jae W. Lee
MQ
377
0
0
28 Dec 2024
Rethinking Model Redundancy for Low-light Image Enhancement
Tong Li
Lizhi Wang
Hansen Feng
Lin Zhu
Wanxuan Lu
Hua Huang
274
0
0
21 Dec 2024
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
Mingliang Xu
Yuyao Zhou
Yuxin Zhang
Shen Li
Shen Li
Jiayi Ji
Zhanpeng Zeng
Rongrong Ji
MQ
695
0
0
21 Dec 2024
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Network
AAAI Conference on Artificial Intelligence (AAAI), 2023
Xiping Hu
Yuetong Fang
Jiahang Cao
Hongwei Ren
Zhanchen Zhu
261
5
0
18 Dec 2024
Relation-Guided Adversarial Learning for Data-free Knowledge Transfer
International Journal of Computer Vision (IJCV), 2024
Yingping Liang
Ying Fu
205
3
0
16 Dec 2024
SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization
Runsheng Bai
Qiang Liu
B. Liu
MQ
300
2
0
05 Dec 2024
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
330
1
0
01 Nov 2024
Data Generation for Hardware-Friendly Post-Training Quantization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
871
1
0
29 Oct 2024
Self-calibration for Language Model Quantization and Pruning
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
901
1
0
22 Oct 2024
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Enze Xie
Junsong Chen
Junyu Chen
Han Cai
Haotian Tang
...
Zhekai Zhang
Zhekai Zhang
Ligeng Zhu
Yaojie Lu
Song Han
VLM
255
177
0
14 Oct 2024
Q-VLM: Post-training Quantization for Large Vision-Language Models
Neural Information Processing Systems (NeurIPS), 2024
Changyuan Wang
Ziwei Wang
Xiuwei Xu
Yansong Tang
Jie Zhou
Jiwen Lu
MQ
336
11
0
10 Oct 2024
QT-DoG: Quantization-aware Training for Domain Generalization
Saqib Javed
Hieu Le
Mathieu Salzmann
OOD
MQ
268
6
0
08 Oct 2024
Constraint Guided Model Quantization of Neural Networks
Quinten Van Baelen
P. Karsmakers
MQ
250
0
0
30 Sep 2024
SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms
International Conference on Control, Automation, Robotics and Vision (ICARCV), 2024
Niraj Pudasaini
Muhammad Abdullah Hanif
Mohamed Bennai
161
4
0
22 Sep 2024
Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview
Yanshu Wang
Tong Yang
Xiyan Liang
Guoan Wang
Hanning Lu
Xu Zhe
Yaoming Li
Li Weitao
MQ
257
5
0
18 Sep 2024
Privacy-Preserving SAM Quantization for Efficient Edge Intelligence in Healthcare
Zhikai Li
Jing Zhang
Qingyi Gu
MedIm
231
3
0
14 Sep 2024
Infrared Domain Adaptation with Zero-Shot Quantization
International Conference on Machine Vision (ICMV), 2024
Burak Sevsay
Erdem Akagündüz
VLM
MQ
296
1
0
25 Aug 2024
Computer Vision Model Compression Techniques for Embedded Systems: A Survey
Computers & graphics (CG), 2024
Alexandre Lopes
Fernando Pereira dos Santos
D. Oliveira
Mauricio Schiezaro
Hélio Pedrini
246
14
0
15 Aug 2024
Layer-Specific Optimization: Sensitivity Based Convolution Layers Basis Search
V. Alekseev
Ilya Lukashevich
Ilia Zharikov
Ilya Vasiliev
217
0
0
12 Aug 2024
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
AAAI Conference on Artificial Intelligence (AAAI), 2024
Kanghyun Choi
Hyeyoon Lee
Dain Kwon
Sunjong Park
Kyuyeun Kim
Noseong Park
Jinho Lee
Jinho Lee
MQ
505
6
0
29 Jul 2024
MetaAug: Meta-Data Augmentation for Post-Training Quantization
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
Dinh Q. Phung
Gustavo Carneiro
Thanh-Toan Do
MQ
184
1
0
20 Jul 2024
MCU-MixQ: A HW/SW Co-optimized Mixed-precision Neural Network Design Framework for MCUs
Junfeng Gong
Cheng Liu
Long Cheng
Huawei Li
Xiaowei Li
222
2
0
17 Jul 2024
CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
Akshat Ramachandran
Souvik Kundu
Tushar Krishna
MQ
246
17
0
07 Jul 2024
DataFreeShield: Defending Adversarial Attacks without Training Data
Hyeyoon Lee
Kanghyun Choi
Dain Kwon
Sunjong Park
Mayoore S. Jaiswal
Noseong Park
Jonghyun Choi
Jinho Lee
217
0
0
21 Jun 2024
1
2
3
4
5
Next