ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.00104
  4. Cited By
Post-Training Piecewise Linear Quantization for Deep Neural Networks
v1v2 (latest)

Post-Training Piecewise Linear Quantization for Deep Neural Networks

European Conference on Computer Vision (ECCV), 2020
31 January 2020
Jun Fang
Ali Shafiee
Hamzah Abdel-Aziz
D. Thorsley
Georgios Georgiadis
Joseph Hassoun
    MQ
ArXiv (abs)PDFHTML

Papers citing "Post-Training Piecewise Linear Quantization for Deep Neural Networks"

50 / 73 papers shown
Title
MDM: Manhattan Distance Mapping of DNN Weights for Parasitic-Resistance-Resilient Memristive Crossbars
MDM: Manhattan Distance Mapping of DNN Weights for Parasitic-Resistance-Resilient Memristive CrossbarsInternational Conference on Learning Representations (ICLR), 2025
Matheus Farias
Wanghley Martins
H. T. Kung
64
0
0
06 Nov 2025
Outlier-Aware Post-Training Quantization for Image Super-Resolution
Outlier-Aware Post-Training Quantization for Image Super-Resolution
Hailing Wang
Jianglin Lu
Yitian Zhang
Y. Fu
MQ
100
0
0
01 Nov 2025
AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models
AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models
Seunghoon Lee
Jeongwoo Choi
Byunggwan Son
Jaehyeon Moon
Jeimin Jeon
Bumsub Ham
DiffMMQ
176
0
0
23 Oct 2025
Collaborative Compression for Large-Scale MoE Deployment on Edge
Collaborative Compression for Large-Scale MoE Deployment on Edge
Yixiao Chen
Yanyue Xie
Ruining Yang
Wei Jiang
Wei Wang
Yong He
Yue Chen
Pu Zhao
Y. Wang
MQ
56
0
0
30 Sep 2025
Bi-VLM: Pushing Ultra-Low Precision Post-Training Quantization Boundaries in Vision-Language Models
Bi-VLM: Pushing Ultra-Low Precision Post-Training Quantization Boundaries in Vision-Language Models
Xijun Wang
Junyun Huang
Rayyan Abdalla
Chengyuan Zhang
Ruiqi Xian
Wanrong Zhu
MQVLM
123
0
0
23 Sep 2025
Enhancing Quantization-Aware Training on Edge Devices via Relative Entropy Coreset Selection and Cascaded Layer Correction
Enhancing Quantization-Aware Training on Edge Devices via Relative Entropy Coreset Selection and Cascaded Layer Correction
Yujia Tong
Jingling Yuan
Chuang Hu
MQ
154
1
0
17 Jul 2025
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
Yujia Tong
Yuze Wang
Jingling Yuan
Chuang Hu
NoLa
234
2
0
18 Mar 2025
Task Vector Quantization for Memory-Efficient Model Merging
Task Vector Quantization for Memory-Efficient Model Merging
Youngeun Kim
Seunghwan Lee
Aecheon Jung
Bogon Ryu
Sungeun Hong
MQMoMe
224
3
0
10 Mar 2025
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
334
1
0
01 Nov 2024
Efficient Reprogramming of Memristive Crossbars for DNNs: Weight Sorting
  and Bit Stucking
Efficient Reprogramming of Memristive Crossbars for DNNs: Weight Sorting and Bit StuckingInternational Symposium on Circuits and Systems (ISCAS), 2024
Matheus Farias
H. T. Kung
MQ
117
2
0
29 Oct 2024
Sorted Weight Sectioning for Energy-Efficient Unstructured Sparse DNNs
  on Compute-in-Memory Crossbars
Sorted Weight Sectioning for Energy-Efficient Unstructured Sparse DNNs on Compute-in-Memory CrossbarsInternational Symposium on Circuits and Systems (ISCAS), 2024
Matheus Farias
H. T. Kung
170
2
0
15 Oct 2024
Post-Training Quantization in Brain-Computer Interfaces based on
  Event-Related Potential Detection
Post-Training Quantization in Brain-Computer Interfaces based on Event-Related Potential DetectionIEEE International Conference on Systems, Man and Cybernetics (SMC), 2024
H. Cecotti
Dalvir Dhaliwal
Hardip Singh
Y. Meena
MQ
61
0
0
10 Oct 2024
Art and Science of Quantizing Large-Scale Models: A Comprehensive
  Overview
Art and Science of Quantizing Large-Scale Models: A Comprehensive Overview
Yanshu Wang
Tong Yang
Xiyan Liang
Guoan Wang
Hanning Lu
Xu Zhe
Yaoming Li
Li Weitao
MQ
257
5
0
18 Sep 2024
Quantizing YOLOv7: A Comprehensive Study
Quantizing YOLOv7: A Comprehensive Study
Mohammadamin Baghbanbashi
Mohsen Raji
B. Ghavami
MQ
142
10
0
06 Jul 2024
Robust Knowledge Distillation Based on Feature Variance Against
  Backdoored Teacher Model
Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model
Jinyin Chen
Xiaoming Zhao
Haibin Zheng
Xiao Li
Sheng Xiang
Haifeng Guo
AAML
124
7
0
01 Jun 2024
EdgeSight: Enabling Modeless and Cost-Efficient Inference at the Edge
EdgeSight: Enabling Modeless and Cost-Efficient Inference at the Edge
ChonLam Lao
Jiaqi Gao
Ganesh Ananthanarayanan
Aditya Akella
Minlan Yu
VLM
181
0
0
29 May 2024
Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves
Predicting High-precision Depth on Low-Precision Devices Using 2D Hilbert Curves
Mykhail M. Uss
Ruslan Yermolenko
Olena Kolodiazhna
Olena Kolodiazhna
Ivan Safonov
Volodymyr Savin
Yoonjae Yeo
Seowon Ji
Jaeyun Jeong
MQ
175
0
0
22 May 2024
Investigating the Impact of Quantization on Adversarial Robustness
Investigating the Impact of Quantization on Adversarial Robustness
Qun Li
Yuan Meng
Chen Tang
Jiacheng Jiang
Zhi Wang
144
11
0
08 Apr 2024
DNN Memory Footprint Reduction via Post-Training Intra-Layer
  Multi-Precision Quantization
DNN Memory Footprint Reduction via Post-Training Intra-Layer Multi-Precision QuantizationIEEE International Symposium on Quality Electronic Design (ISQED), 2024
B. Ghavami
Amin Kamjoo
Lesley Shannon
S. Wilton
MQ
138
0
0
03 Apr 2024
Instance-Aware Group Quantization for Vision Transformers
Instance-Aware Group Quantization for Vision Transformers
Jaehyeon Moon
Jeimin Jeon
Junyong Cheon
Bumsub Ham
MQViT
226
15
0
01 Apr 2024
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Emad Fallahzadeh
Bram Adams
Ahmed E. Hassan
MQ
339
5
0
25 Mar 2024
Achieving Pareto Optimality using Efficient Parameter Reduction for DNNs
  in Resource-Constrained Edge Environment
Achieving Pareto Optimality using Efficient Parameter Reduction for DNNs in Resource-Constrained Edge Environment
Atah Nuh Mih
Alireza Rahimi
Asfia Kawnine
Francis Palma
Monica Wachowicz
R. Dubay
Hung Cao
218
0
0
14 Mar 2024
QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven
  Fine Tuning
QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven Fine Tuning
Jiun-Man Chen
Yu-Hsuan Chao
Yu-Jie Wang
Ming-Der Shieh
Chih-Chung Hsu
Wei-Fen Lin
MQ
222
2
0
11 Mar 2024
Tiny Reinforcement Learning for Quadruped Locomotion using Decision
  Transformers
Tiny Reinforcement Learning for Quadruped Locomotion using Decision Transformers
Orhan Eren Akgün
Néstor Cuevas
Matheus Farias
Daniel Garces
173
1
0
20 Feb 2024
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsInternational Conference on Machine Learning (ICML), 2024
Wei Huang
Yangdong Liu
Haotong Qin
Ying Li
Shiming Zhang
Xianglong Liu
Michele Magno
Xiaojuan Qi
MQ
259
125
0
06 Feb 2024
LLM-Powered Hierarchical Language Agent for Real-time Human-AI
  Coordination
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
Jijia Liu
Chao Yu
Jiaxuan Gao
Yuqing Xie
Qingmin Liao
Yi Wu
Yu Wang
LLMAGLM&Ro
336
55
0
23 Dec 2023
IDKM: Memory Efficient Neural Network Quantization via Implicit,
  Differentiable k-Means
IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means
Sean Jaffe
Ambuj K. Singh
Francesco Bullo
MQ
175
0
0
12 Dec 2023
GenQ: Quantization in Low Data Regimes with Generative Synthetic Data
GenQ: Quantization in Low Data Regimes with Generative Synthetic DataEuropean Conference on Computer Vision (ECCV), 2023
Yuhang Li
Youngeun Kim
Donghyun Lee
Souvik Kundu
Priyadarshini Panda
MQ
261
6
0
07 Dec 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive
  Review
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
219
16
0
20 Nov 2023
Exploring Post-Training Quantization of Protein Language Models
Exploring Post-Training Quantization of Protein Language ModelsIEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2023
Shuang Peng
Fei Yang
Ning Sun
Sheng Chen
Yanfeng Jiang
Aimin Pan
MQ
122
0
0
30 Oct 2023
SINF: Semantic Neural Network Inference with Semantic Subgraphs
SINF: Semantic Neural Network Inference with Semantic Subgraphs
Sazzad Sayyed
Jonathan D. Ashdown
211
0
0
02 Oct 2023
A Survey on Model Compression for Large Language Models
A Survey on Model Compression for Large Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2023
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
294
332
0
15 Aug 2023
Digital Modeling on Large Kernel Metamaterial Neural Network
Digital Modeling on Large Kernel Metamaterial Neural NetworkJournal of Imaging Science and Technology (JIST), 2023
Quan Liu
Hanyu Zheng
Brandon T. Swartz
Ho Hin Lee
Zuhayr Asad
I. Kravchenko
Jason G Valentine
Yuankai Huo
132
6
0
21 Jul 2023
Q-YOLO: Efficient Inference for Real-time Object Detection
Q-YOLO: Efficient Inference for Real-time Object DetectionAsian Conference on Pattern Recognition (ACPR), 2023
Mingze Wang
H. Sun
Jun Shi
Xuhui Liu
Baochang Zhang
Xianbin Cao
ObjD
135
14
0
01 Jul 2023
Efficient Online Processing with Deep Neural Networks
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
167
0
0
23 Jun 2023
Towards Accurate Post-training Quantization for Diffusion Models
Towards Accurate Post-training Quantization for Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Changyuan Wang
Ziwei Wang
Xiuwei Xu
Yansong Tang
Jie Zhou
Jiwen Lu
MQ
246
34
0
30 May 2023
MBQuant: A Novel Multi-Branch Topology Method for Arbitrary Bit-width
  Network Quantization
MBQuant: A Novel Multi-Branch Topology Method for Arbitrary Bit-width Network QuantizationPattern Recognition (Pattern Recogn.), 2023
Mingliang Xu
Yuyao Zhou
Jiayi Ji
Rongrong Ji
MQ
197
7
0
14 May 2023
GSB: Group Superposition Binarization for Vision Transformer with
  Limited Training Samples
GSB: Group Superposition Binarization for Vision Transformer with Limited Training SamplesNeural Networks (Neural Netw.), 2023
T. Gao
Chengzhong Xu
Le Zhang
Hui Kong
323
9
0
13 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task
  Adaptation
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
177
1
0
08 May 2023
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Q-DETR: An Efficient Low-Bit Quantized Detection TransformerComputer Vision and Pattern Recognition (CVPR), 2023
Sheng Xu
Yanjing Li
Mingbao Lin
Penglei Gao
Guodong Guo
Jinhu Lu
Baochang Zhang
MQ
154
33
0
01 Apr 2023
Towards Accurate Post-Training Quantization for Vision Transformer
Towards Accurate Post-Training Quantization for Vision TransformerACM Multimedia (ACM MM), 2022
Yifu Ding
Haotong Qin
Qing-Yu Yan
Z. Chai
Junjie Liu
Xiaolin K. Wei
Xianglong Liu
MQ
187
89
0
25 Mar 2023
Ultra-low Precision Multiplication-free Training for Deep Neural
  Networks
Ultra-low Precision Multiplication-free Training for Deep Neural Networks
Yu Xie
Rui Zhang
Xishan Zhang
Yifan Hao
Zidong Du
Xingui Hu
Ling Li
Qi Guo
MQ
246
2
0
28 Feb 2023
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement
  Learning
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yingchun Wang
Jingcai Guo
Song Guo
Weizhan Zhang
MQ
153
23
0
09 Feb 2023
PowerQuant: Automorphism Search for Non-Uniform Quantization
PowerQuant: Automorphism Search for Non-Uniform QuantizationInternational Conference on Learning Representations (ICLR), 2023
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
128
22
0
24 Jan 2023
PD-Quant: Post-Training Quantization based on Prediction Difference
  Metric
PD-Quant: Post-Training Quantization based on Prediction Difference MetricComputer Vision and Pattern Recognition (CVPR), 2022
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
433
92
0
14 Dec 2022
Vertical Layering of Quantized Neural Networks for Heterogeneous
  Inference
Vertical Layering of Quantized Neural Networks for Heterogeneous InferenceIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
189
6
0
10 Dec 2022
Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer
Q-ViT: Accurate and Fully Quantized Low-bit Vision TransformerNeural Information Processing Systems (NeurIPS), 2022
Yanjing Li
Sheng Xu
Baochang Zhang
Xianbin Cao
Penglei Gao
Guodong Guo
MQViT
193
127
0
13 Oct 2022
Bitwidth-Adaptive Quantization-Aware Neural Network Training: A
  Meta-Learning Approach
Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning ApproachEuropean Conference on Computer Vision (ECCV), 2022
Jiseok Youn
Jaehun Song
Hyung-Sin Kim
S. Bahk
MQ
129
10
0
20 Jul 2022
A Comprehensive Survey on Model Quantization for Deep Neural Networks in
  Image Classification
A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image ClassificationACM Transactions on Intelligent Systems and Technology (ACM TIST), 2022
Babak Rokh
A. Azarpeyvand
Alireza Khanteymoori
MQ
353
163
0
14 May 2022
SPIQ: Data-Free Per-Channel Static Input Quantization
SPIQ: Data-Free Per-Channel Static Input QuantizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
97
22
0
28 Mar 2022
12
Next