Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.11428
Cited By
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant Systems for Machine Learning
23 July 2022
Baolin Li
Tirthak Patel
S. Samsi
V. Gadepally
Devesh Tiwari
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant Systems for Machine Learning"
11 / 11 papers shown
Title
LithOS: An Operating System for Efficient Machine Learning on GPUs
Patrick H. Coppock
Brian Zhang
Eliot H. Solomon
Vasilis Kypriotis
Leon Yang
Bikash Sharma
Dan Schatzberg
Todd C. Mowry
Dimitrios Skarlatos
40
0
0
21 Apr 2025
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach
Urvij Saroliya
Eishi Arima
Dai Liu
Martin Schulz
46
1
0
14 May 2024
SCAR: Scheduling Multi-Model AI Workloads on Heterogeneous Multi-Chiplet Module Accelerators
Mohanad Odema
Luke Chen
Hyoukjun Kwon
Mohammad Abdullah Al Faruque
41
4
0
01 May 2024
Fair Resource Allocation in Virtualized O-RAN Platforms
Fatih Aslan
Georgios Iosifidis
J. Ayala-Romero
A. Garcia-Saavedra
Xavier Costa Pérez
30
9
0
17 Feb 2024
H-EYE: Holistic Resource Modeling and Management for Diversely Scaled Edge-Cloud Systems
Ismet Dagli
Amid Morshedlou
Jamal Rostami
M. E. Belviranli
24
0
0
07 Feb 2024
Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
28
27
0
19 Apr 2023
KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources
Baolin Li
S. Samsi
V. Gadepally
Devesh Tiwari
30
11
0
12 Oct 2022
An Analysis of Collocation on GPUs for Deep Learning Training
Ties Robroek
Ehsan Yousefzadeh-Asl-Miandoab
Pınar Tözün
22
9
0
13 Sep 2022
MAPA: Multi-Accelerator Pattern Allocation Policy for Multi-Tenant GPU Servers
K. Ranganath
Joshua D. Suetterlein
Joseph Manzano
Shuaiwen Leon Song
Daniel Wong
38
15
0
07 Oct 2021
Clairvoyant Prefetching for Distributed Machine Learning I/O
Nikoli Dryden
Roman Böhringer
Tal Ben-Nun
Torsten Hoefler
33
56
0
21 Jan 2021
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,599
0
17 Apr 2017
1