Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.11271
Cited By
Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models
16 May 2025
Camille Couturier
Spyros Mastorakis
Haiying Shen
Saravan Rajmohan
Victor Rühle
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models"
2 / 2 papers shown
HA-RAG: Hotness-Aware RAG Acceleration via Mixed Precision and Data Placement
Danying Ge
Jianhua Gao
Yixue Yang
Weixing Ji
165
0
0
23 Oct 2025
Billion-scale similarity search with GPUs
IEEE Transactions on Big Data (TBD), 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
970
4,531
0
28 Feb 2017
1