Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.21553
Cited By
Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models
30 April 2025
Lucas Maisonnave
Cyril Moineau
Olivier Bichler
Fabrice Rastello
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models"
1 / 1 papers shown
Title
Gradual Binary Search and Dimension Expansion : A general method for activation quantization in LLMs
Lucas Maisonnave
Cyril Moineau
Olivier Bichler
Fabrice Rastello
MQ
18
0
0
18 Apr 2025
1