ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.10636
  4. Cited By
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO
  Guarantees via DNN Re-alignment

Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment

17 December 2023
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
ArXivPDFHTML

Papers citing "Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment"

4 / 4 papers shown
Title
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference
Guanqiao Qu
Qian Chen
Xianhao Chen
Kaibin Huang
Yuguang Fang
44
0
0
29 Mar 2025
Boosting Mobile CNN Inference through Semantic Memory
Boosting Mobile CNN Inference through Semantic Memory
Yun Li
Chen Zhang
S. Han
Li Lyna Zhang
B. Yin
Yunxin Liu
Mengwei Xu
ObjD
26
16
0
05 Dec 2021
Smart at what cost? Characterising Mobile Deep Neural Networks in the
  wild
Smart at what cost? Characterising Mobile Deep Neural Networks in the wild
Mario Almeida
Stefanos Laskaridis
Abhinav Mehrotra
L. Dudziak
Ilias Leontiadis
Nicholas D. Lane
HAI
95
44
0
28 Sep 2021
Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video
  Analytics Pipelines
Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
Francisco Romero
Mark Zhao
N. Yadwadkar
Christos Kozyrakis
31
100
0
03 Feb 2021
1