Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.19379
Cited By
Marconi: Prefix Caching for the Era of Hybrid LLMs
28 November 2024
Rui Pan
Zhuang Wang
Zhen Jia
Can Karakus
Luca Zancato
Tri Dao
Ravi Netravali
Yida Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Marconi: Prefix Caching for the Era of Hybrid LLMs"
2 / 2 papers shown
Title
Taming the Titans: A Survey of Efficient LLM Inference Serving
Ranran Zhen
J. Li
Yixin Ji
Z. Yang
Tong Liu
Qingrong Xia
Xinyu Duan
Z. Wang
Baoxing Huai
M. Zhang
LLMAG
77
0
0
28 Apr 2025
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
Yaxiong Wu
Sheng Liang
Chen Zhang
Y. Wang
Y. Zhang
Huifeng Guo
Ruiming Tang
Y. Liu
KELM
32
0
0
22 Apr 2025
1