Accurate Neural Training with 4-bit Matrix Multiplications at Standard
  Formats
v1v2v3v4 (latest)

Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats

    MQ

Papers citing "Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats"

15 / 15 papers shown
Title
EXAQ: Exponent Aware Quantization For LLMs Acceleration
EXAQ: Exponent Aware Quantization For LLMs Acceleration
Moran Shkolnik
Maxim Fishman
Brian Chmiel
Hilla Ben-Yaacov
Ron Banner
Kfir Y. Levy
73
0
0
04 Oct 2024

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.