iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud

IEEE Transactions on Parallel and Distributed Systems (TPDS), 2022

3 November 2022

ArXiv (abs)PDF HTML Github (39★)

Papers citing "iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud"

13 / 13 papers shown

HRS: Hybrid Representation Framework with Scheduling Awareness for Time Series Forecasting in Crowdsourced Cloud-Edge Platforms

200

18 Aug 2025

LithOS: An Operating System for Efficient Machine Learning on GPUsSymposium on Operating Systems Principles (SOSP), 2025

215

21 Apr 2025

PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference

339

29 Mar 2025

Privacy-Aware Joint DNN Model Deployment and Partitioning Optimization for Collaborative Edge Inference ServicesIEEE Transactions on Services Computing (TSC), 2025

409

22 Feb 2025

ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud EnvironmentsInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024

Jihyuk Lee

Dimitrios Nikolopoulos

Cheol-Ho Hong

GNN

197

22 Sep 2024

Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey

Xiping Hu

393

12 Jun 2024

HarmonyBatch: Batching multi-SLO DNN Inference with Heterogeneous Serverless FunctionsInternational Workshop on Quality of Service (IWQoS), 2024

215

09 May 2024

Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey

Xiping Hu

386

09 Apr 2024

Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment

348

17 Dec 2023

Ultima: Robust and Tail-Optimal AllReduce for Distributed Deep Learning in the CloudSymposium on Networked Systems Design and Implementation (NSDI), 2023

219

10 Oct 2023

Pareto-Secure Machine Learning (PSML): Fingerprinting and Securing Inference Serving Systems

443

03 Jul 2023

Opportunities of Renewable Energy Powered DNN Inference

Seyed Morteza Nabavinejad

Tian Guo

AI4CE

232

21 Jun 2023

Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference ServiceInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2023

Baolin Li

S. Samsi

V. Gadepally

Devesh Tiwari

327

19 Apr 2023