ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.01713
  4. Cited By
iGniter: Interference-Aware GPU Resource Provisioning for Predictable
  DNN Inference in the Cloud

iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud

IEEE Transactions on Parallel and Distributed Systems (TPDS), 2022
3 November 2022
Fei Xu
Jianian Xu
Jiabin Chen
Li Chen
Ruitao Shang
Zhi Zhou
Fengyuan Liu
    GNN
ArXiv (abs)PDFHTMLGithub (39★)

Papers citing "iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud"

13 / 13 papers shown
HRS: Hybrid Representation Framework with Scheduling Awareness for Time Series Forecasting in Crowdsourced Cloud-Edge Platforms
HRS: Hybrid Representation Framework with Scheduling Awareness for Time Series Forecasting in Crowdsourced Cloud-Edge Platforms
Tiancheng Zhang
Cheng Zhang
S. Liu
Xiaofei Wang
Shaoyuan Huang
Wenyu Wang
AI4TS
200
0
0
18 Aug 2025
LithOS: An Operating System for Efficient Machine Learning on GPUs
LithOS: An Operating System for Efficient Machine Learning on GPUsSymposium on Operating Systems Principles (SOSP), 2025
Patrick H. Coppock
Brian Zhang
Eliot H. Solomon
Vasilis Kypriotis
Leon Yang
Bikash Sharma
Dan Schatzberg
Todd C. Mowry
Dimitrios Skarlatos
215
15
0
21 Apr 2025
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference
Guanqiao Qu
Qian Chen
Xianhao Chen
Kaibin Huang
Yuguang Fang
339
6
0
29 Mar 2025
Privacy-Aware Joint DNN Model Deployment and Partitioning Optimization for Collaborative Edge Inference Services
Privacy-Aware Joint DNN Model Deployment and Partitioning Optimization for Collaborative Edge Inference ServicesIEEE Transactions on Services Computing (TSC), 2025
Zhipeng Cheng
Xiaoyu Xia
Hong Wang
Minghui Liwang
Ning Chen
Xuwei Fan
Xianbin Wang
409
0
0
22 Feb 2025
ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in
  Cloud Environments
ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud EnvironmentsInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024
Munkyu Lee
Sihoon Seong
Minki Kang
Jihyuk Lee
Gap-Joo Na
In-Geol Chun
Dimitrios Nikolopoulos
Cheol-Ho Hong
GNN
197
17
0
22 Sep 2024
Resource Allocation and Workload Scheduling for Large-Scale Distributed
  Deep Learning: A Survey
Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey
Feng Liang
Zhen Zhang
Haifeng Lu
Chengming Li
Victor C. M. Leung
Yanyi Guo
Xiping Hu
393
16
0
12 Jun 2024
HarmonyBatch: Batching multi-SLO DNN Inference with Heterogeneous
  Serverless Functions
HarmonyBatch: Batching multi-SLO DNN Inference with Heterogeneous Serverless FunctionsInternational Workshop on Quality of Service (IWQoS), 2024
Jiabin Chen
Fei Xu
Yikun Gu
Li Chen
Fangming Liu
Zhi Zhou
215
9
0
09 May 2024
Communication-Efficient Large-Scale Distributed Deep Learning: A
  Comprehensive Survey
Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey
Feng Liang
Zhen Zhang
Haifeng Lu
Victor C. M. Leung
Yanyi Guo
Xiping Hu
GNN
386
27
0
09 Apr 2024
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO
  Guarantees via DNN Re-alignment
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
348
24
0
17 Dec 2023
Ultima: Robust and Tail-Optimal AllReduce for Distributed Deep Learning
  in the Cloud
Ultima: Robust and Tail-Optimal AllReduce for Distributed Deep Learning in the CloudSymposium on Networked Systems Design and Implementation (NSDI), 2023
Ertza Warraich
Omer Shabtai
Khalid Manaa
S. Vargaftik
Y. Piasetzky
Matty Kadosh
Lalith Suresh
Muhammad Shahbaz
219
1
0
10 Oct 2023
Pareto-Secure Machine Learning (PSML): Fingerprinting and Securing
  Inference Serving Systems
Pareto-Secure Machine Learning (PSML): Fingerprinting and Securing Inference Serving Systems
Debopam Sanyal
Jui-Tse Hung
Manavi Agrawal
Prahlad Jasti
Shahab Nikkhoo
S. Jha
Tianhao Wang
Sibin Mohan
Alexey Tumanov
443
1
0
03 Jul 2023
Opportunities of Renewable Energy Powered DNN Inference
Opportunities of Renewable Energy Powered DNN Inference
Seyed Morteza Nabavinejad
Tian Guo
AI4CE
232
2
0
21 Jun 2023
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning
  Inference Service
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference ServiceInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2023
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
327
56
0
19 Apr 2023
1
Page 1 of 1