RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using
a Diverse Pool of Cloud Computing Instances

RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

23 July 2022

Baolin Li

Devesh Tiwari

Papers citing "RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances"

7 / 7 papers shown

Title
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUs Xinning Hui Yuanchao Xu Zhishan Guo Xipeng Shen 30 4 0 25 Apr 2024
HEET: A Heterogeneity Measure to Quantify the Difference across Distributed Computing Systems Ali Mokhtari Saeid Ghafouri Pooyan Jamshidi Mohsen Amini Salehi 16 1 0 06 Dec 2023
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service Baolin Li S. Samsi V. Gadepally Devesh Tiwari 20 27 0 19 Apr 2023
KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources Baolin Li S. Samsi V. Gadepally Devesh Tiwari 19 11 0 12 Oct 2022
PROFET: Profiling-based CNN Training Latency Prophet for GPU Cloud Instances Sungjae Lee Y. Hur Subin Park Kyungyong Lee 11 1 0 10 Aug 2022
SMLT: A Serverless Framework for Scalable and Adaptive Machine Learning Design and Training Ahsan Ali Syed Zawad Paarijaat Aditya Istemi Ekin Akkus Ruichuan Chen Feng Yan 24 9 0 04 May 2022
Deep Learning Training in Facebook Data Centers: Design of Scale-up and Scale-out Systems Maxim Naumov John Kim Dheevatsa Mudigere Srinivas Sridharan Xiaodong Wang ... Krishnakumar Nair Isabel Gao Bor-Yiing Su Jiyan Yang M. Smelyanskiy GNN 41 83 0 20 Mar 2020