Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.17560
Cited By
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
23 December 2024
Chao Zeng
Songwei Liu
Shu Yang
Fangmin Chen
Xing Mei
Lean Fu
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference"
Title
No papers