ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.02732
  4. Cited By
Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning

Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning

4 December 2020
Woosuk Kwon
Gyeong-In Yu
Eunji Jeong
Byung-Gon Chun
ArXivPDFHTML

Papers citing "Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning"

17 / 17 papers shown
Title
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous
  GPU Clusters
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters
WenZheng Zhang
Yang Hu
Jing Shi
Xiaoying Bai
29
1
0
22 Aug 2024
Orchestrating Quantum Cloud Environments with Qonductor
Orchestrating Quantum Cloud Environments with Qonductor
Emmanouil Giortamis
Francisco Romao
Nathaniel Tornow
Dmitry Lugovoy
Pramod Bhatotia
22
3
0
08 Aug 2024
Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows
Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows
Yuting Yang
Andrea Merlina
Weijia Song
Tiancheng Yuan
Ken Birman
Roman Vitenberg
41
0
0
27 Feb 2024
A Differentiable Framework for End-to-End Learning of Hybrid Structured
  Compression
A Differentiable Framework for End-to-End Learning of Hybrid Structured Compression
Moonjung Eo
Suhyun Kang
Wonjong Rhee
19
1
0
21 Sep 2023
ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via
  Learned Finite State Machines
ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines
Siyuan Chen
Pratik Fegade
Tianqi Chen
Phillip B. Gibbons
T. Mowry
14
0
0
08 Feb 2023
Baechi: Fast Device Placement of Machine Learning Graphs
Baechi: Fast Device Placement of Machine Learning Graphs
Beomyeol Jeon
L. Cai
Chirag Shetty
P. Srivastava
Jintao Jiang
Xiaolan Ke
Yitao Meng
Cong Xie
Indranil Gupta
GNN
11
18
0
20 Jan 2023
PiPAD: Pipelined and Parallel Dynamic GNN Training on GPUs
PiPAD: Pipelined and Parallel Dynamic GNN Training on GPUs
Chunyang Wang
Desen Sun
Yunru Bai
GNN
AI4CE
39
15
0
01 Jan 2023
A Fast Post-Training Pruning Framework for Transformers
A Fast Post-Training Pruning Framework for Transformers
Woosuk Kwon
Sehoon Kim
Michael W. Mahoney
Joseph Hassoun
Kurt Keutzer
A. Gholami
15
143
0
29 Mar 2022
Pathways: Asynchronous Distributed Dataflow for ML
Pathways: Asynchronous Distributed Dataflow for ML
P. Barham
Aakanksha Chowdhery
J. Dean
Sanjay Ghemawat
Steven Hand
...
Parker Schuh
Ryan Sepassi
Laurent El Shafey
C. A. Thekkath
Yonghui Wu
GNN
MoE
45
126
0
23 Mar 2022
Optimal channel selection with discrete QCQP
Optimal channel selection with discrete QCQP
Yeonwoo Jeong
Deokjae Lee
Gaon An
Changyong Son
Hyun Oh Song
11
1
0
24 Feb 2022
Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning
  Programs
Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning Programs
Taebum Kim
Eunji Jeong
Geonyong Kim
Yunmo Koo
Sehoon Kim
Gyeong-In Yu
Byung-Gon Chun
AI4CE
17
5
0
23 Jan 2022
Safe and Practical GPU Acceleration in TrustZone
Safe and Practical GPU Acceleration in TrustZone
Heejin Park
F. Lin
29
4
0
04 Nov 2021
Scheduling Optimization Techniques for Neural Network Training
Scheduling Optimization Techniques for Neural Network Training
Hyungjun Oh
Junyeol Lee
HyeongJu Kim
Jiwon Seo
13
0
0
03 Oct 2021
Characterizing Concurrency Mechanisms for NVIDIA GPUs under Deep
  Learning Workloads
Characterizing Concurrency Mechanisms for NVIDIA GPUs under Deep Learning Workloads
Guin Gilman
R. Walls
GNN
BDL
32
17
0
01 Oct 2021
Multi-model Machine Learning Inference Serving with GPU Spatial
  Partitioning
Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning
S. Choi
Sunho Lee
Yeonjae Kim
Jongse Park
Youngjin Kwon
Jaehyuk Huh
16
21
0
01 Sep 2021
GPUReplay: A 50-KB GPU Stack for Client ML
GPUReplay: A 50-KB GPU Stack for Client ML
Heejin Park
F. Lin
25
9
0
04 May 2021
IOS: Inter-Operator Scheduler for CNN Acceleration
IOS: Inter-Operator Scheduler for CNN Acceleration
Yaoyao Ding
Ligeng Zhu
Zhihao Jia
Gennady Pekhimenko
Song Han
23
72
0
02 Nov 2020
1