Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.14110
Cited By
Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators
25 January 2024
Yaniv Blumenfeld
Itay Hubara
Daniel Soudry
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators"
2 / 2 papers shown
Title
Overcoming Oscillations in Quantization-Aware Training
Markus Nagel
Marios Fournarakis
Yelysei Bondarenko
Tijmen Blankevoort
MQ
106
70
0
21 Mar 2022
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
124
665
0
24 Jan 2021
1