ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.04713
  4. Cited By
Serving and Optimizing Machine Learning Workflows on Heterogeneous
  Infrastructures

Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures

10 May 2022
Yongji Wu
Matthew Lentz
Danyang Zhuo
Yao Lu
ArXivPDFHTML

Papers citing "Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures"

11 / 11 papers shown
Title
Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey
Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey
J. H. Liu
Yao Du
Kun Yang
Yan Wang
Xiping Hu
Z. Wang
Y. Liu
Peng Sun
Azzedine Boukerche
Victor C.M. Leung
38
0
0
03 May 2025
Circinus: Efficient Query Planner for Compound ML Serving
Circinus: Efficient Query Planner for Compound ML Serving
Banruo Liu
Wei-Yu Lin
Minghao Fang
Yihan Jiang
Fan Lai
LRM
34
0
0
23 Apr 2025
Algorithmic Data Minimization for Machine Learning over Internet-of-Things Data Streams
Ted Shaowang
Shinan Liu
Jonatas Marques
Nick Feamster
S. Krishnan
31
0
0
07 Mar 2025
Deploying Foundation Model Powered Agent Services: A Survey
Deploying Foundation Model Powered Agent Services: A Survey
Wenchao Xu
Jinyu Chen
Peirong Zheng
Xiaoquan Yi
Tianyi Tian
...
Quan Wan
Haozhao Wang
Yunfeng Fan
Qinliang Su
Xuemin Shen
AI4CE
115
1
0
18 Dec 2024
Teola: Towards End-to-End Optimization of LLM-based Applications
Teola: Towards End-to-End Optimization of LLM-based Applications
Xin Tan
Yimin Jiang
Yitao Yang
Hong-Yu Xu
57
5
0
29 Jun 2024
Biathlon: Harnessing Model Resilience for Accelerating ML Inference
  Pipelines
Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines
Chaokun Chang
Eric Lo
Chunxiao Ye
21
2
0
18 May 2024
Hydro: Adaptive Query Processing of ML Queries
Hydro: Adaptive Query Processing of ML Queries
Gaurav Tarlok Kakkar
Jiashen Cao
Aubhro Sengupta
Joy Arulraj
Hyesoon Kim
28
1
0
22 Mar 2024
Computing in the Era of Large Generative Models: From Cloud-Native to
  AI-Native
Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native
Yao Lu
Song Bian
Lequn Chen
Yongjun He
Yulong Hui
...
Huanchen Zhang
Minjia Zhang
Qizhen Zhang
Tianyi Zhou
Danyang Zhuo
24
7
0
17 Jan 2024
OTAS: An Elastic Transformer Serving System via Token Adaptation
OTAS: An Elastic Transformer Serving System via Token Adaptation
Jinyu Chen
Wenchao Xu
Zicong Hong
Song Guo
Haozhao Wang
Jie Zhang
Deze Zeng
25
4
0
10 Jan 2024
VQPy: An Object-Oriented Approach to Modern Video Analytics
VQPy: An Object-Oriented Approach to Modern Video Analytics
Shan Yu
Zhenting Zhu
Yu Chen
Hanchen Xu
Pengzhan Zhao
Yang Wang
Arthi Padmanabhan
Hugo Latapie
Harry Xu
39
3
0
03 Nov 2023
Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video
  Analytics Pipelines
Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
Francisco Romero
Mark Zhao
N. Yadwadkar
Christos Kozyrakis
31
100
0
03 Feb 2021
1