Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.01713
Cited By
iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud
3 November 2022
Fei Xu
Jianian Xu
Jiabin Chen
Li Chen
Ruitao Shang
Zhi Zhou
Fengyuan Liu
GNN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud"
6 / 6 papers shown
Title
LithOS: An Operating System for Efficient Machine Learning on GPUs
Patrick H. Coppock
Brian Zhang
Eliot H. Solomon
Vasilis Kypriotis
Leon Yang
Bikash Sharma
Dan Schatzberg
Todd C. Mowry
Dimitrios Skarlatos
27
0
0
21 Apr 2025
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference
Guanqiao Qu
Qian Chen
Xianhao Chen
Kaibin Huang
Yuguang Fang
46
1
0
29 Mar 2025
Privacy-Aware Joint DNN Model Deployment and Partition Optimization for Delay-Efficient Collaborative Edge Inference
Zhipeng Cheng
Xiaoyu Xia
Hong Wang
Minghui Liwang
Ning Chen
Xuwei Fan
Xianbin Wang
54
0
0
22 Feb 2025
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
31
11
0
17 Dec 2023
Pareto-Secure Machine Learning (PSML): Fingerprinting and Securing Inference Serving Systems
Debopam Sanyal
Jui-Tse Hung
Manavi Agrawal
Prahlad Jasti
Shahab Nikkhoo
S. Jha
Tianhao Wang
Sibin Mohan
Alexey Tumanov
42
0
0
03 Jul 2023
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
25
27
0
19 Apr 2023
1