Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.11651
Cited By
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float
15 April 2025
Tianyi Zhang
Yang Sui
Shaochen Zhong
V. Chaudhary
Xia Hu
Anshumali Shrivastava
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float"
Title
No papers