Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.05889
Cited By
KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
12 October 2022
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
Re-assign community
ArXiv
PDF
HTML
Papers citing
"KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources"
7 / 7 papers shown
Title
Deploying Foundation Model Powered Agent Services: A Survey
Wenchao Xu
Jinyu Chen
Peirong Zheng
Xiaoquan Yi
Tianyi Tian
...
Quan Wan
Haozhao Wang
Yunfeng Fan
Qinliang Su
Xuemin Shen
AI4CE
119
1
0
18 Dec 2024
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
Sohaib Ahmad
Hui Guan
Ramesh K. Sitaraman
40
4
0
04 Jul 2024
Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference
Baolin Li
Yankai Jiang
V. Gadepally
Devesh Tiwari
29
15
0
19 Mar 2024
H-EYE: Holistic Resource Modeling and Management for Diversely Scaled Edge-Cloud Systems
Ismet Dagli
Amid Morshedlou
Jamal Rostami
M. E. Belviranli
22
0
0
07 Feb 2024
HEET: A Heterogeneity Measure to Quantify the Difference across Distributed Computing Systems
Ali Mokhtari
Saeid Ghafouri
Pooyan Jamshidi
Mohsen Amini Salehi
18
1
0
06 Dec 2023
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
22
27
0
19 Apr 2023
Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
Francisco Romero
Mark Zhao
N. Yadwadkar
Christos Kozyrakis
33
100
0
03 Feb 2021
1