Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning

Neural Information Processing Systems (NeurIPS), 2020

4 December 2020

Papers citing "Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning"

17 / 17 papers shown

Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU ClustersAAAI Conference on Artificial Intelligence (AAAI), 2024

224

22 Aug 2024

Orchestrating Quantum Cloud Environments with Qonductor

285

08 Aug 2024

Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows

232

27 Feb 2024

A Differentiable Framework for End-to-End Learning of Hybrid Structured Compression

Moonjung Eo

Suhyun Kang

Wonjong Rhee

282

21 Sep 2023

ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State MachinesInternational Conference on Machine Learning (ICML), 2023

192

08 Feb 2023

Baechi: Fast Device Placement of Machine Learning GraphsACM Symposium on Cloud Computing (SoCC), 2020

181

20 Jan 2023

PiPAD: Pipelined and Parallel Dynamic GNN Training on GPUsACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2023

307

01 Jan 2023

A Fast Post-Training Pruning Framework for TransformersNeural Information Processing Systems (NeurIPS), 2022

Sehoon Kim

282

212

29 Mar 2022

Pathways: Asynchronous Distributed Dataflow for MLConference on Machine Learning and Systems (MLSys), 2022

...

Laurent El Shafey

374

150

23 Mar 2022

Optimal channel selection with discrete QCQPInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

248

24 Feb 2022

Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning ProgramsNeural Information Processing Systems (NeurIPS), 2022

Sehoon Kim

217

23 Jan 2022

Safe and Practical GPU Acceleration in TrustZone

Heejin Park

F. Lin

168

04 Nov 2021

Scheduling Optimization Techniques for Neural Network Training

Hyungjun Oh

Junyeol Lee

HyeongJu Kim

Jiwon Seo

173

03 Oct 2021

Characterizing Concurrency Mechanisms for NVIDIA GPUs under Deep Learning Workloads

Guin Gilman

R. Walls

GNN BDL

238

01 Oct 2021

Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning

200

01 Sep 2021

GPUReplay: A 50-KB GPU Stack for Client MLInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021

Heejin Park

F. Lin

310

04 May 2021

IOS: Inter-Operator Scheduler for CNN AccelerationConference on Machine Learning and Systems (MLSys), 2020

Song Han

311

02 Nov 2020