Scaling Distributed Deep Learning Workloads beyond the Memory Capacity with KARMA

International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020

26 August 2020

Papers citing "Scaling Distributed Deep Learning Workloads beyond the Memory Capacity with KARMA"

5 / 5 papers shown

Democratizing AI: Open-source Scalable LLM Training on GPU-based SupercomputersInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024

...

875

12 Feb 2025

FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Collaborative TrainingIEEE Transactions on Network and Service Management (IEEE TNSM), 2022

298

20 Nov 2022

PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU ApplicationsInternational Conference on Supercomputing (ICS), 2022

270

05 Apr 2022

A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks

240

09 Nov 2021

An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural NetworksIEEE International Symposium on High-Performance Parallel Distributed Computing (HPDC), 2020

199

19 Apr 2021