Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.15979
Cited By
v1
v2 (latest)
A Greedy Algorithm for Quantizing Neural Networks
29 October 2020
Eric Lybrand
Rayan Saab
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Greedy Algorithm for Quantizing Neural Networks"
13 / 13 papers shown
Title
Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization
Shihao Zhang
Haoyu Zhang
Ian Colbert
Rayan Saab
MQ
84
0
0
16 May 2025
DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory
Jerry Chee
A. Backurs
Rainie Heck
Li Zhang
Janardhan Kulkarni
Thomas Rothvoss
Sivakanth Gopi
MQ
140
1
0
11 Jan 2025
Accumulator-Aware Post-Training Quantization
Ian Colbert
Fabian Grob
Giuseppe Franco
Jinjie Zhang
Rayan Saab
MQ
71
4
0
25 Sep 2024
Frame Quantization of Neural Networks
Wojciech Czaja
Sanghoon Na
59
1
0
11 Apr 2024
SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network Quantization
Jinjie Zhang
Rayan Saab
35
0
0
20 Sep 2023
QuIP: 2-Bit Quantization of Large Language Models With Guarantees
Jerry Chee
Yaohui Cai
Volodymyr Kuleshov
Chris De Sa
MQ
123
210
0
25 Jul 2023
Tuning-free one-bit covariance estimation using data-driven dithering
S. Dirksen
J. Maly
105
7
0
24 Jul 2023
A simple approach for quantizing neural networks
J. Maly
Rayan Saab
MQ
61
12
0
07 Sep 2022
Seeking Interpretability and Explainability in Binary Activated Neural Networks
Benjamin Leblanc
Pascal Germain
FAtt
90
1
0
07 Sep 2022
Algorithms for Discrepancy, Matchings, and Approximations: Fast, Simple, and Practical
Mónika Csikós
Nabil H. Mustafa
28
0
0
02 Sep 2022
Post-training Quantization for Neural Networks with Provable Guarantees
Jinjie Zhang
Yixuan Zhou
Rayan Saab
MQ
73
33
0
26 Jan 2022
Approximation of functions with one-bit neural networks
C. S. Güntürk
Weilin Li
44
9
0
16 Dec 2021
XnODR and XnIDR: Two Accurate and Fast Fully Connected Layers For Convolutional Neural Networks
Jian Sun
A. P. Fard
Mohammad H. Mahoor
3DPC
58
8
0
21 Nov 2021
1