Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.15304
Cited By
SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention
24 February 2025
Hong Yankun
Li Xing
Zhen Hui-Ling
Yu Xianzhi
Liu Wulong
Yuan Mingxuan
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention"
Title
No papers