Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.10395
Cited By
XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization
14 August 2025
Aditya Tomar
Coleman Hooper
M Lee
Haocheng Xi
Rishabh Tiwari
Wonjun Kang
Luca Manolache
Michael W. Mahoney
Kurt Keutzer
A. Gholami
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (31 upvotes)
Papers citing
"XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization"
0 / 0 papers shown
No papers found
Page 1 of 0