Accurate Neural Training with 4-bit Matrix Multiplications at Standard
  Formats
v1v2v3v4 (latest)

Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats

    MQ

Papers citing "Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats"

15 / 15 papers shown
Title
EXAQ: Exponent Aware Quantization For LLMs Acceleration
EXAQ: Exponent Aware Quantization For LLMs Acceleration
Moran Shkolnik
Maxim Fishman
Brian Chmiel
Hilla Ben-Yaacov
Ron Banner
Kfir Y. Levy
73
0
0
04 Oct 2024