Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.10636
Cited By
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment
17 December 2023
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment"
4 / 4 papers shown
Title
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference
Guanqiao Qu
Qian Chen
Xianhao Chen
Kaibin Huang
Yuguang Fang
44
0
0
29 Mar 2025
Boosting Mobile CNN Inference through Semantic Memory
Yun Li
Chen Zhang
S. Han
Li Lyna Zhang
B. Yin
Yunxin Liu
Mengwei Xu
ObjD
26
16
0
05 Dec 2021
Smart at what cost? Characterising Mobile Deep Neural Networks in the wild
Mario Almeida
Stefanos Laskaridis
Abhinav Mehrotra
L. Dudziak
Ilias Leontiadis
Nicholas D. Lane
HAI
95
44
0
28 Sep 2021
Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
Francisco Romero
Mark Zhao
N. Yadwadkar
Christos Kozyrakis
31
100
0
03 Feb 2021
1