A Greedy Algorithm for Quantizing Neural Networks

v1v2 (latest)

A Greedy Algorithm for Quantizing Neural Networks

29 October 2020

ArXiv (abs)PDF HTML

Papers citing "A Greedy Algorithm for Quantizing Neural Networks"

13 / 13 papers shown

Title
Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization Shihao Zhang Haoyu Zhang Ian Colbert Rayan Saab MQ 84 0 0 16 May 2025
DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory Jerry Chee A. Backurs Rainie Heck Li Zhang Janardhan Kulkarni Thomas Rothvoss Sivakanth Gopi MQ 140 1 0 11 Jan 2025
Accumulator-Aware Post-Training Quantization Ian Colbert Fabian Grob Giuseppe Franco Jinjie Zhang Rayan Saab MQ 71 4 0 25 Sep 2024
Frame Quantization of Neural Networks Wojciech Czaja Sanghoon Na 59 1 0 11 Apr 2024
SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network Quantization Jinjie Zhang Rayan Saab 35 0 0 20 Sep 2023
QuIP: 2-Bit Quantization of Large Language Models With Guarantees Jerry Chee Yaohui Cai Volodymyr Kuleshov Chris De Sa MQ 123 210 0 25 Jul 2023
Tuning-free one-bit covariance estimation using data-driven dithering S. Dirksen J. Maly 105 7 0 24 Jul 2023
A simple approach for quantizing neural networks J. Maly Rayan Saab MQ 61 12 0 07 Sep 2022
Seeking Interpretability and Explainability in Binary Activated Neural Networks Benjamin Leblanc Pascal Germain FAtt 90 1 0 07 Sep 2022
Algorithms for Discrepancy, Matchings, and Approximations: Fast, Simple, and Practical Mónika Csikós Nabil H. Mustafa 28 0 0 02 Sep 2022
Post-training Quantization for Neural Networks with Provable Guarantees Jinjie Zhang Yixuan Zhou Rayan Saab MQ 73 33 0 26 Jan 2022
Approximation of functions with one-bit neural networks C. S. Güntürk Weilin Li 44 9 0 16 Dec 2021
XnODR and XnIDR: Two Accurate and Fast Fully Connected Layers For Convolutional Neural Networks Jian Sun A. P. Fard Mohammad H. Mahoor 3DPC 58 8 0 21 Nov 2021