Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.01302
Cited By
IOS: Inter-Operator Scheduler for CNN Acceleration
2 November 2020
Yaoyao Ding
Ligeng Zhu
Zhihao Jia
Gennady Pekhimenko
Song Han
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IOS: Inter-Operator Scheduler for CNN Acceleration"
5 / 5 papers shown
Title
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving
Yaoyao Ding
Bohan Hou
X. Zhang
Allan Lin
Tianqi Chen
Cody Yu Hao
Yida Wang
Gennady Pekhimenko
43
0
0
17 Apr 2025
AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization
Zhiying Xu
H. Peng
Wei Wang
GNN
24
3
0
02 Dec 2022
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
Muyang Li
Ji Lin
Chenlin Meng
Stefano Ermon
Song Han
Jun-Yan Zhu
DiffM
34
45
0
03 Nov 2022
ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations
Zhiying Xu
Jiafan Xu
H. Peng
Wei Wang
Xiaoliang Wang
...
Haipeng Dai
Yixu Xu
Hao Cheng
Kun Wang
Guihai Chen
18
0
0
22 Oct 2022
Third ArchEdge Workshop: Exploring the Design Space of Efficient Deep Neural Networks
Fuxun Yu
Dimitrios Stamoulis
Di Wang
Dimitrios Lymberopoulos
Xiang Chen
3DV
6
1
0
22 Nov 2020
1