ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.14110
  4. Cited By
Towards Cheaper Inference in Deep Networks with Lower Bit-Width
  Accumulators

Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators

25 January 2024
Yaniv Blumenfeld
Itay Hubara
Daniel Soudry
ArXivPDFHTML

Papers citing "Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators"

2 / 2 papers shown
Title
Overcoming Oscillations in Quantization-Aware Training
Overcoming Oscillations in Quantization-Aware Training
Markus Nagel
Marios Fournarakis
Yelysei Bondarenko
Tijmen Blankevoort
MQ
106
70
0
21 Mar 2022
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
124
665
0
24 Jan 2021
1