Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.03583
Cited By
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
4 July 2024
Sohaib Ahmad
Hui Guan
Ramesh K. Sitaraman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling"
3 / 3 papers shown
Title
Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
Francisco Romero
Mark Zhao
N. Yadwadkar
Christos Kozyrakis
33
100
0
03 Feb 2021
Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider
Mohammad Shahrad
Rodrigo Fonseca
Íñigo Goiri
G. Chaudhry
Paul Batum
Jason Cooke
Eduardo Laureano
Colby Tresness
M. Russinovich
Ricardo Bianchini
81
601
0
06 Mar 2020
What is the State of Neural Network Pruning?
Davis W. Blalock
Jose Javier Gonzalez Ortiz
Jonathan Frankle
John Guttag
185
1,027
0
06 Mar 2020
1