Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.02445
Cited By
MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms
3 April 2024
Jiaang Duan
Shiyou Qian
Dingyu Yang
Hanwen Hu
Jian Cao
Guangtao Xue
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MOPAR: A Model Partitioning Framework for Deep Learning Inference Services on Serverless Platforms"
2 / 2 papers shown
Title
iServe: An Intent-based Serving System for LLMs
Dimitrios Liakopoulos
Tianrui Hu
Prasoon Sinha
N. Yadwadkar
VLM
176
0
0
08 Jan 2025
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
316
4,077
0
24 May 2022
1