ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.10892
  4. Cited By
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference
  Serving Systems
v1v2 (latest)

Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems

21 April 2023
Mehran Salmani
Saeid Ghafouri
Alireza Sanaee
Kamran Razavi
M. Muhlhauser
Joseph Doyle
Pooyan Jamshidi
U. O. N. Carolina
ArXiv (abs)PDFHTML

Papers citing "Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems"

5 / 5 papers shown
Title
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices
Saeid Ghafouri
Mohsen Fayyaz
Xiangchen Li
Deepu John
Bo Ji
Dimitrios Nikolopoulos
Hans Vandierendonck
116
1
0
20 Jul 2025
EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference
  Optimization at Edge
EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference Optimization at EdgeConference on Network and Service Management (CNSM), 2024
Motahare Mounesan
Xiaojie Zhang
S. Debroy
145
5
0
16 Oct 2024
Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical
  Scaling
Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling
Kamran Razavi
Saeid Ghafouri
Max Mühlhäuser
Pooyan Jamshidi
Lin Wang
184
6
0
31 Mar 2024
Resource Allocation of Industry 4.0 Micro-Service Applications across
  Serverless Fog Federation
Resource Allocation of Industry 4.0 Micro-Service Applications across Serverless Fog FederationFuture generations computer systems (FGCS), 2024
R. Hussain
Mohsen Amini Salehi
137
15
0
14 Jan 2024
IPA: Inference Pipeline Adaptation to Achieve High Accuracy and
  Cost-Efficiency
IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-EfficiencyJournal of Systems Research (JSR), 2023
Saeid Ghafouri
Kamran Razavi
Mehran Salmani
Alireza Sanaee
T. Lorido-Botran
Lin Wang
Joseph Doyle
Pooyan Jamshidi
197
3
0
24 Aug 2023
1