Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.02732
Cited By
Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning
4 December 2020
Woosuk Kwon
Gyeong-In Yu
Eunji Jeong
Byung-Gon Chun
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning"
17 / 17 papers shown
Title
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters
WenZheng Zhang
Yang Hu
Jing Shi
Xiaoying Bai
29
1
0
22 Aug 2024
Orchestrating Quantum Cloud Environments with Qonductor
Emmanouil Giortamis
Francisco Romao
Nathaniel Tornow
Dmitry Lugovoy
Pramod Bhatotia
22
3
0
08 Aug 2024
Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows
Yuting Yang
Andrea Merlina
Weijia Song
Tiancheng Yuan
Ken Birman
Roman Vitenberg
41
0
0
27 Feb 2024
A Differentiable Framework for End-to-End Learning of Hybrid Structured Compression
Moonjung Eo
Suhyun Kang
Wonjong Rhee
17
1
0
21 Sep 2023
ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines
Siyuan Chen
Pratik Fegade
Tianqi Chen
Phillip B. Gibbons
T. Mowry
14
0
0
08 Feb 2023
Baechi: Fast Device Placement of Machine Learning Graphs
Beomyeol Jeon
L. Cai
Chirag Shetty
P. Srivastava
Jintao Jiang
Xiaolan Ke
Yitao Meng
Cong Xie
Indranil Gupta
GNN
11
18
0
20 Jan 2023
PiPAD: Pipelined and Parallel Dynamic GNN Training on GPUs
Chunyang Wang
Desen Sun
Yunru Bai
GNN
AI4CE
39
15
0
01 Jan 2023
A Fast Post-Training Pruning Framework for Transformers
Woosuk Kwon
Sehoon Kim
Michael W. Mahoney
Joseph Hassoun
Kurt Keutzer
A. Gholami
13
143
0
29 Mar 2022
Pathways: Asynchronous Distributed Dataflow for ML
P. Barham
Aakanksha Chowdhery
J. Dean
Sanjay Ghemawat
Steven Hand
...
Parker Schuh
Ryan Sepassi
Laurent El Shafey
C. A. Thekkath
Yonghui Wu
GNN
MoE
45
126
0
23 Mar 2022
Optimal channel selection with discrete QCQP
Yeonwoo Jeong
Deokjae Lee
Gaon An
Changyong Son
Hyun Oh Song
11
1
0
24 Feb 2022
Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning Programs
Taebum Kim
Eunji Jeong
Geonyong Kim
Yunmo Koo
Sehoon Kim
Gyeong-In Yu
Byung-Gon Chun
AI4CE
17
5
0
23 Jan 2022
Safe and Practical GPU Acceleration in TrustZone
Heejin Park
F. Lin
29
4
0
04 Nov 2021
Scheduling Optimization Techniques for Neural Network Training
Hyungjun Oh
Junyeol Lee
HyeongJu Kim
Jiwon Seo
13
0
0
03 Oct 2021
Characterizing Concurrency Mechanisms for NVIDIA GPUs under Deep Learning Workloads
Guin Gilman
R. Walls
GNN
BDL
32
17
0
01 Oct 2021
Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning
S. Choi
Sunho Lee
Yeonjae Kim
Jongse Park
Youngjin Kwon
Jaehyuk Huh
14
21
0
01 Sep 2021
GPUReplay: A 50-KB GPU Stack for Client ML
Heejin Park
F. Lin
25
9
0
04 May 2021
IOS: Inter-Operator Scheduler for CNN Acceleration
Yaoyao Ding
Ligeng Zhu
Zhihao Jia
Gennady Pekhimenko
Song Han
21
72
0
02 Nov 2020
1