Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.11434
Cited By
RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances
23 July 2022
Baolin Li
Rohan Basu Roy
Tirthak Patel
V. Gadepally
K. Gettings
Devesh Tiwari
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances"
7 / 7 papers shown
Title
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUs
Xinning Hui
Yuanchao Xu
Zhishan Guo
Xipeng Shen
30
4
0
25 Apr 2024
HEET: A Heterogeneity Measure to Quantify the Difference across Distributed Computing Systems
Ali Mokhtari
Saeid Ghafouri
Pooyan Jamshidi
Mohsen Amini Salehi
16
1
0
06 Dec 2023
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
20
27
0
19 Apr 2023
KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
19
11
0
12 Oct 2022
PROFET: Profiling-based CNN Training Latency Prophet for GPU Cloud Instances
Sungjae Lee
Y. Hur
Subin Park
Kyungyong Lee
11
1
0
10 Aug 2022
SMLT: A Serverless Framework for Scalable and Adaptive Machine Learning Design and Training
Ahsan Ali
Syed Zawad
Paarijaat Aditya
Istemi Ekin Akkus
Ruichuan Chen
Feng Yan
24
9
0
04 May 2022
Deep Learning Training in Facebook Data Centers: Design of Scale-up and Scale-out Systems
Maxim Naumov
John Kim
Dheevatsa Mudigere
Srinivas Sridharan
Xiaodong Wang
...
Krishnakumar Nair
Isabel Gao
Bor-Yiing Su
Jiyan Yang
M. Smelyanskiy
GNN
41
83
0
20 Mar 2020
1