ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.11434
  4. Cited By
RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using
  a Diverse Pool of Cloud Computing Instances

RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

23 July 2022
Baolin Li
Rohan Basu Roy
Tirthak Patel
V. Gadepally
K. Gettings
Devesh Tiwari
ArXivPDFHTML

Papers citing "RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances"

7 / 7 papers shown
Title
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on
  Serverless Platforms with Shareable GPUs
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUs
Xinning Hui
Yuanchao Xu
Zhishan Guo
Xipeng Shen
30
4
0
25 Apr 2024
HEET: A Heterogeneity Measure to Quantify the Difference across
  Distributed Computing Systems
HEET: A Heterogeneity Measure to Quantify the Difference across Distributed Computing Systems
Ali Mokhtari
Saeid Ghafouri
Pooyan Jamshidi
Mohsen Amini Salehi
16
1
0
06 Dec 2023
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning
  Inference Service
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
20
27
0
19 Apr 2023
KAIROS: Building Cost-Efficient Machine Learning Inference Systems with
  Heterogeneous Cloud Resources
KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
19
11
0
12 Oct 2022
PROFET: Profiling-based CNN Training Latency Prophet for GPU Cloud
  Instances
PROFET: Profiling-based CNN Training Latency Prophet for GPU Cloud Instances
Sungjae Lee
Y. Hur
Subin Park
Kyungyong Lee
11
1
0
10 Aug 2022
SMLT: A Serverless Framework for Scalable and Adaptive Machine Learning
  Design and Training
SMLT: A Serverless Framework for Scalable and Adaptive Machine Learning Design and Training
Ahsan Ali
Syed Zawad
Paarijaat Aditya
Istemi Ekin Akkus
Ruichuan Chen
Feng Yan
24
9
0
04 May 2022
Deep Learning Training in Facebook Data Centers: Design of Scale-up and
  Scale-out Systems
Deep Learning Training in Facebook Data Centers: Design of Scale-up and Scale-out Systems
Maxim Naumov
John Kim
Dheevatsa Mudigere
Srinivas Sridharan
Xiaodong Wang
...
Krishnakumar Nair
Isabel Gao
Bor-Yiing Su
Jiyan Yang
M. Smelyanskiy
GNN
41
83
0
20 Mar 2020
1