Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.14852
Cited By
Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching
17 June 2025
Qizheng Zhang
Michael Wornow
Kunle Olukotun
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cost-Efficient Serving of LLM Agents via Test-Time Plan Caching"
2 / 2 papers shown
Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live
Hanchen Li
Qiuyang Mang
Runyuan He
Qizheng Zhang
Huanzhi Mao
Xiaokun Chen
Alvin Cheung
Joseph E. Gonzalez
Ion Stoica
Ion Stoica
196
2
0
04 Nov 2025
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Qizheng Zhang
Changran Hu
Shubhangi Upasani
Boyuan Ma
Fenglu Hong
...
Mengmeng Ji
Hanchen Li
Urmish Thakker
James Zou
Kunle Olukotun
LLMAG
KELM
223
29
0
06 Oct 2025
1
Page 1 of 1