Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.03031
Cited By
Nimble: Efficiently Compiling Dynamic Neural Networks for Model Inference
4 June 2020
Haichen Shen
Jared Roesch
Zhi Chen
Wei-Neng Chen
Yong Wu
Mu Li
Vin Sharma
Zachary Tatlock
Yida Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Nimble: Efficiently Compiling Dynamic Neural Networks for Model Inference"
5 / 5 papers shown
Title
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving
Yaoyao Ding
Bohan Hou
X. Zhang
Allan Lin
Tianqi Chen
Cody Yu Hao
Yida Wang
Gennady Pekhimenko
50
0
0
17 Apr 2025
Relax: Composable Abstractions for End-to-End Dynamic Machine Learning
Ruihang Lai
Junru Shao
Siyuan Feng
Steven Lyubomirsky
Bohan Hou
...
Sunghyun Park
Prakalp Srivastava
Jared Roesch
T. Mowry
Tianqi Chen
45
9
0
01 Nov 2023
ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations
Zhiying Xu
Jiafan Xu
H. Peng
Wei Wang
Xiaoliang Wang
...
Haipeng Dai
Yixu Xu
Hao Cheng
Kun Wang
Guihai Chen
20
0
0
22 Oct 2022
Memory Planning for Deep Neural Networks
Maksim Levental
23
4
0
23 Feb 2022
DISC: A Dynamic Shape Compiler for Machine Learning Workloads
Kai Zhu
Wenyi Zhao
Zhen Zheng
Tianyou Guo
Pengzhan Zhao
...
Junjie Bai
Jun Yang
Xiaoyong Liu
Lansong Diao
Wei Lin
25
27
0
09 Mar 2021
1