Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2203.05740
Cited By
v1
v2 (latest)
QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization
International Conference on Learning Representations (ICLR), 2022
11 March 2022
Xiuying Wei
Yazhe Niu
Yuhang Li
Xianglong Liu
F. Yu
MQ
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (122★)
Papers citing
"QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization"
50 / 124 papers shown
Title
Layer-Wise High-Impact Parameter Ratio Optimization in Post-Training Quantization for Large Language Models
Cuong Pham
Hoang Anh Dung
Cuong C. Nguyen
Trung Le
G. Carneiro
Thanh-Toan Do
MQ
125
0
0
21 Nov 2025
D4C: Data-free Quantization for Contrastive Language-Image Pre-training Models
Wenlun Zhang
Yunshan Zhong
Zihao Ding
Xinyu Li
Kentaro Yoshioka
MQ
CLIP
VLM
147
0
0
19 Nov 2025
DynaQuant: Dynamic Mixed-Precision Quantization for Learned Image Compression
Youneng Bao
Yulong Cheng
Y. Liu
Yichen Yang
Peng Qin
Mu Li
Yongsheng Liang
MQ
66
0
0
11 Nov 2025
GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks
Zhaoyang Wang
Dong Wang
MQ
80
0
0
08 Nov 2025
Efficiently Training A Flat Neural Network Before It has been Quantizated
Peng Xia
Junbiao Pang
Tianyang Cai
MQ
100
0
0
03 Nov 2025
AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models
Seunghoon Lee
Jeongwoo Choi
Byunggwan Son
Jaehyeon Moon
Jeimin Jeon
Bumsub Ham
DiffM
MQ
196
0
0
23 Oct 2025
Adaptively Sampling-Reusing-Mixing Decomposed Gradients to Speed Up Sharpness Aware Minimization
Jiaxin Deng
Junbiao Pang
132
0
0
04 Oct 2025
Cat: Post-Training Quantization Error Reduction via Cluster-based Affine Transformation
Ali Zoljodi
Radu Timofte
Masoud Daneshtalab
MQ
119
0
0
30 Sep 2025
Quantized Visual Geometry Grounded Transformer
Weilun Feng
Haotong Qin
Mingqiang Wu
Chuanguang Yang
Yuqi Li
...
Zhulin An
Libo Huang
Yulun Zhang
Michele Magno
Yongjun Xu
MQ
3DGS
189
1
0
25 Sep 2025
ProfilingAgent: Profiling-Guided Agentic Reasoning for Adaptive Model Optimization
Sadegh Jafari
Aishwarya Sarkar
Mohiuddin Bilwal
Ali Jannesari
MQ
68
0
0
06 Sep 2025
PTQAT: A Hybrid Parameter-Efficient Quantization Algorithm for 3D Perception Tasks
Xinhao Wang
Zhiwei Lin
Zhongyu Xia
Yongtao Wang
MQ
141
1
0
14 Aug 2025
Test-Time Model Adaptation for Quantized Neural Networks
Zeshuai Deng
Guohao Chen
Shuaicheng Niu
Hui Luo
Shuhai Zhang
Yifan Yang
Renjie Chen
Wei Luo
Mingkui Tan
MQ
139
1
0
04 Aug 2025
Enhancing Generalization in Data-free Quantization via Mixup-class Prompting
Jiwoong Park
Chaeun Lee
Yongseok Choi
Sein Park
Deokki Hong
Jungwook Choi
MQ
154
0
0
29 Jul 2025
First-Order Error Matters: Accurate Compensation for Quantized Large Language Models
Xingyu Zheng
Haotong Qin
Yuye Li
Haoran Chu
Jiakai Wang
Jinyang Guo
Michele Magno
Xianglong Liu
MQ
259
0
0
15 Jul 2025
PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Extremely Efficient Diffusion Model
Beomseok Ko
Hyeryung Jang
MQ
156
0
0
20 Jun 2025
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation
Computer Vision and Pattern Recognition (CVPR), 2025
Zhuguanyu Wu
Shihe Wang
Jiayi Zhang
Jiaxin Chen
Yunhong Wang
MQ
135
3
0
13 Jun 2025
Unifying Block-wise PTQ and Distillation-based QAT for Progressive Quantization toward 2-bit Instruction-Tuned LLMs
Jung Hyun Lee
Seungjae Shin
Vinnam Kim
Jaeseong You
An Chen
MQ
161
2
0
10 Jun 2025
Flexible Mixed Precision Quantization for Learned Image Compression
IEEE International Conference on Multimedia and Expo (ICME), 2024
Md Adnan Faisal Hossain
Z. Duan
Fengqing Zhu
MQ
172
1
0
02 Jun 2025
Merge-Friendly Post-Training Quantization for Multi-Target Domain Adaptation
Juncheol Shin
Minsang Seok
Seonggon Kim
Eunhyeok Park
MQ
MoMe
156
0
0
29 May 2025
QwT-v2: Practical, Effective and Efficient Post-Training Quantization
Ningyuan Tang
Minghao Fu
Hao Yu
Jianxin Wu
MQ
201
2
0
27 May 2025
NQKV: A KV Cache Quantization Scheme Based on Normal Distribution Characteristics
Zhihang Cai
Xingjun Zhang
Zhendong Tan
Zheng Wei
MQ
335
1
0
22 May 2025
Zero-shot Quantization: A Comprehensive Survey
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Minjun Kim
Jaehyeon Choi
Jongkeun Lee
Wonjin Cho
U. Kang
MQ
327
6
0
14 May 2025
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model
Navin Ranjan
Andreas E. Savakis
MQ
VLM
318
0
0
08 May 2025
Pack-PTQ: Advancing Post-training Quantization of Neural Networks by Pack-wise Reconstruction
Changjun Li
Runqing Jiang
Zhuo Song
Pengpeng Yu
Ye Zhang
Yulan Guo
MQ
264
0
0
01 May 2025
GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric Calibration
Yuhang Li
Ruokai Yin
Donghyun Lee
Shiting Xiao
Priyadarshini Panda
MQ
341
11
0
03 Apr 2025
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2025
Zhuguanyu Wu
Jiayi Zhang
Jiaxin Chen
Jinyang Guo
Di Huang
Yunhong Wang
MQ
304
6
0
03 Apr 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Computer Vision and Pattern Recognition (CVPR), 2025
Xuan Shen
Weize Ma
Jing Liu
Changdi Yang
Rui Ding
...
Wei Niu
Yanzhi Wang
Pu Zhao
Jun Lin
Jiuxiang Gu
MQ
323
6
0
20 Mar 2025
Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix
Junbiao Pang
Tianyang Cai
314
1
0
14 Mar 2025
Breaking the Limits of Quantization-Aware Defenses: QADT-R for Robustness Against Patch-Based Adversarial Attacks in QNNs
Amira Guesmi
B. Ouni
Muhammad Shafique
MQ
AAML
258
0
0
10 Mar 2025
Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping
Ning Ding
Jing Han
Yuchuan Tian
Chao Xu
Kai Han
Yehui Tang
MQ
344
1
0
10 Mar 2025
Task Vector Quantization for Memory-Efficient Model Merging
Youngeun Kim
Seunghwan Lee
Aecheon Jung
Bogon Ryu
Sungeun Hong
MQ
MoMe
240
3
0
10 Mar 2025
SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model
Jing Zhang
Zhiyu Li
Chengzhi Hu
Xuewen Liu
Qingyi Gu
VLM
MQ
181
0
0
09 Mar 2025
AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model
Wenlun Zhang
Yunshan Zhong
Shimpei Ando
Kentaro Yoshioka
VLM
MQ
445
1
0
05 Mar 2025
CacheQuant: Comprehensively Accelerated Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
Xuewen Liu
Zhikai Li
Qingyi Gu
DiffM
172
5
0
03 Mar 2025
PQD: Post-training Quantization for Efficient Diffusion Models
Jiaojiao Ye
Zhen Wang
Linnan Jiang
MQ
242
1
0
03 Jan 2025
PTQ4VM: Post-Training Quantization for Visual Mamba
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jun-gyu Jin
Changhun Lee
Seonggon Kim
Eunhyeok Park
MQ
Mamba
270
7
0
29 Dec 2024
TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Haocheng Huang
Jiaxin Chen
Jinyang Guo
Ruiyi Zhan
Yunhong Wang
DiffM
MQ
247
3
0
21 Dec 2024
Semantic Alignment and Reinforcement for Data-Free Quantization of Vision Transformers
Mingliang Xu
Yuyao Zhou
Yuxin Zhang
Shen Li
Shen Li
Jiayi Ji
Zhanpeng Zeng
Rongrong Ji
MQ
739
0
0
21 Dec 2024
MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Weilun Feng
Haotong Qin
Chuanguang Yang
Zhulin An
Libo Huang
Boyu Diao
Fei Wang
Renshuai Tao
Yongjun Xu
Michele Magno
DiffM
MQ
235
14
0
16 Dec 2024
PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models
ACM Multimedia (MM), 2024
Zining Wnag
Jinpei Guo
Yazhe Niu
Yang Yong
Aishan Liu
Yushi Huang
Jiaheng Liu
Xianglong Liu
257
7
0
10 Dec 2024
MPQ-Diff: Mixed Precision Quantization for Diffusion Models
Rocco Manz Maruzzelli
Basile Lewandowski
Lydia Y. Chen
DiffM
MQ
287
0
0
28 Nov 2024
Exploring the Robustness and Transferability of Patch-Based Adversarial Attacks in Quantized Neural Networks
Amira Guesmi
B. Ouni
Mohamed Bennai
AAML
303
0
0
22 Nov 2024
Quantization without Tears
Computer Vision and Pattern Recognition (CVPR), 2024
Minghao Fu
Hao Yu
Jie Shao
Junjie Zhou
Ke Zhu
Jianxin Wu
MQ
604
13
0
21 Nov 2024
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
Hang Guo
Yawei Li
Tao Dai
Shu-Tao Xia
Luca Benini
MQ
293
5
0
29 Oct 2024
Data Generation for Hardware-Friendly Post-Training Quantization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
903
1
0
29 Oct 2024
QEFT: Quantization for Efficient Fine-Tuning of LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Changhun Lee
Jun-gyu Jin
Jun-gyu Jin
Eunhyeok Park
MQ
152
4
0
11 Oct 2024
Constraint Guided Model Quantization of Neural Networks
Quinten Van Baelen
P. Karsmakers
MQ
262
0
0
30 Sep 2024
PTQ4RIS: Post-Training Quantization for Referring Image Segmentation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Xiaoyan Jiang
Hang Yang
Kaiying Zhu
Xihe Qiu
Shibo Zhao
Sifan Zhou
MQ
139
2
0
25 Sep 2024
Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues
AAAI Conference on Artificial Intelligence (AAAI), 2024
Mingshen Wang
Zhao Zhang
Feng Li
Ke Xu
Kang Miao
Meng Wang
MQ
SupR
197
4
0
22 Sep 2024
Bilateral Sharpness-Aware Minimization for Flatter Minima
Jiaxin Deng
Junbiao Pang
Baochang Zhang
Qingming Huang
AAML
887
0
0
20 Sep 2024
1
2
3
Next