Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.15763
Cited By
Hybrid Offline-online Scheduling Method for Large Language Model Inference Optimization
14 February 2025
Bowen Pang
Kai Li
Ruifeng She
Feifan Wang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hybrid Offline-online Scheduling Method for Large Language Model Inference Optimization"
1 / 1 papers shown
Title
Automatic Operator-level Parallelism Planning for Distributed Deep Learning -- A Mixed-Integer Programming Approach
Ruifeng She
Bowen Pang
Kai Li
Zehua Liu
Tao Zhong
54
0
0
12 Mar 2025
1