ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.15979
  4. Cited By
A Greedy Algorithm for Quantizing Neural Networks
v1v2 (latest)

A Greedy Algorithm for Quantizing Neural Networks

29 October 2020
Eric Lybrand
Rayan Saab
    MQ
ArXiv (abs)PDFHTML

Papers citing "A Greedy Algorithm for Quantizing Neural Networks"

13 / 13 papers shown
Title
Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization
Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization
Shihao Zhang
Haoyu Zhang
Ian Colbert
Rayan Saab
MQ
84
0
0
16 May 2025
DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory
DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory
Jerry Chee
A. Backurs
Rainie Heck
Li Zhang
Janardhan Kulkarni
Thomas Rothvoss
Sivakanth Gopi
MQ
140
1
0
11 Jan 2025
Accumulator-Aware Post-Training Quantization
Accumulator-Aware Post-Training Quantization
Ian Colbert
Fabian Grob
Giuseppe Franco
Jinjie Zhang
Rayan Saab
MQ
71
4
0
25 Sep 2024
Frame Quantization of Neural Networks
Frame Quantization of Neural Networks
Wojciech Czaja
Sanghoon Na
59
1
0
11 Apr 2024
SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network
  Quantization
SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network Quantization
Jinjie Zhang
Rayan Saab
35
0
0
20 Sep 2023
QuIP: 2-Bit Quantization of Large Language Models With Guarantees
QuIP: 2-Bit Quantization of Large Language Models With Guarantees
Jerry Chee
Yaohui Cai
Volodymyr Kuleshov
Chris De Sa
MQ
123
210
0
25 Jul 2023
Tuning-free one-bit covariance estimation using data-driven dithering
Tuning-free one-bit covariance estimation using data-driven dithering
S. Dirksen
J. Maly
105
7
0
24 Jul 2023
A simple approach for quantizing neural networks
A simple approach for quantizing neural networks
J. Maly
Rayan Saab
MQ
61
12
0
07 Sep 2022
Seeking Interpretability and Explainability in Binary Activated Neural
  Networks
Seeking Interpretability and Explainability in Binary Activated Neural Networks
Benjamin Leblanc
Pascal Germain
FAtt
90
1
0
07 Sep 2022
Algorithms for Discrepancy, Matchings, and Approximations: Fast, Simple,
  and Practical
Algorithms for Discrepancy, Matchings, and Approximations: Fast, Simple, and Practical
Mónika Csikós
Nabil H. Mustafa
28
0
0
02 Sep 2022
Post-training Quantization for Neural Networks with Provable Guarantees
Post-training Quantization for Neural Networks with Provable Guarantees
Jinjie Zhang
Yixuan Zhou
Rayan Saab
MQ
73
33
0
26 Jan 2022
Approximation of functions with one-bit neural networks
Approximation of functions with one-bit neural networks
C. S. Güntürk
Weilin Li
44
9
0
16 Dec 2021
XnODR and XnIDR: Two Accurate and Fast Fully Connected Layers For
  Convolutional Neural Networks
XnODR and XnIDR: Two Accurate and Fast Fully Connected Layers For Convolutional Neural Networks
Jian Sun
A. P. Fard
Mohammad H. Mahoor
3DPC
58
8
0
21 Nov 2021
1