Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2502.13965
Cited By
Autellix: An Efficient Serving Engine for LLM Agents as General Programs
20 February 2025
Michael Luo
Xiaoxiang Shi
Colin Cai
Tianjun Zhang
Justin Wong
Longji Xu
Chi Wang
Yanping Huang
Zhifeng Chen
Alfons Kemper
Ion Stoica
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (19 upvotes)
Papers citing
"Autellix: An Efficient Serving Engine for LLM Agents as General Programs"
7 / 7 papers shown
Title
Murakkab: Resource-Efficient Agentic Workflow Orchestration in Cloud Platforms
G. Chaudhry
Esha Choukse
Haoran Qiu
Íñigo Goiri
Rodrigo Fonseca
Adam Belay
Ricardo Bianchini
24
1
0
22 Aug 2025
The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective
Jiin Kim
Byeongjun Shin
Jinha Chung
Minsoo Rhu
LLMAG
LRM
113
7
0
04 Jun 2025
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
Ranjan Sapkota
Konstantinos I. Roumeliotis
Manoj Karkee
AI4TS
390
63
0
15 May 2025
Tempo: Application-aware LLM Serving with Mixed SLO Requirements
Wei Zhang
Zhiyu Wu
Yi Mu
Banruo Liu
Myungjin Lee
Fan Lai
217
4
0
24 Apr 2025
Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents
Yueying Li
Jim Dai
Tianyi Peng
455
6
0
10 Apr 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
598
3,873
0
22 Jan 2025
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Bradley Brown
Jordan Juravsky
Ryan Ehrlich
Ronald Clark
Quoc V. Le
Christopher Ré
Azalia Mirhoseini
ALM
LRM
456
469
0
03 Jan 2025
1