ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.11271
  4. Cited By
Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models

Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models

16 May 2025
Camille Couturier
Spyros Mastorakis
Haiying Shen
Saravan Rajmohan
Victor Rühle
    KELM
ArXiv (abs)PDFHTML

Papers citing "Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models"

2 / 2 papers shown
HA-RAG: Hotness-Aware RAG Acceleration via Mixed Precision and Data Placement
HA-RAG: Hotness-Aware RAG Acceleration via Mixed Precision and Data Placement
Danying Ge
Jianhua Gao
Yixue Yang
Weixing Ji
165
0
0
23 Oct 2025
Billion-scale similarity search with GPUs
Billion-scale similarity search with GPUsIEEE Transactions on Big Data (TBD), 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
970
4,531
0
28 Feb 2017
1