Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.08633
Cited By
Hydra: A System for Large Multi-Model Deep Learning
16 October 2021
Kabir Nagrecha
Arun Kumar
MoE
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hydra: A System for Large Multi-Model Deep Learning"
7 / 7 papers shown
Title
Saturn: Efficient Multi-Large-Model Deep Learning
Kabir Nagrecha
Arun Kumar
3DGS
14
0
0
06 Nov 2023
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models
Kabir Nagrecha
Lingyi Liu
P. Delgado
Prasanna Padmanabhan
OffRL
AI4CE
25
5
0
13 Aug 2023
Systems for Parallel and Distributed Large-Model Deep Learning Training
Kabir Nagrecha
GNN
VLM
MoE
18
7
0
06 Jan 2023
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines
Shigang Li
Torsten Hoefler
GNN
AI4CE
LRM
77
130
0
14 Jul 2021
Model-Parallel Model Selection for Deep Learning Systems
Kabir Nagrecha
29
16
0
14 Jul 2021
ZeRO-Offload: Democratizing Billion-Scale Model Training
Jie Ren
Samyam Rajbhandari
Reza Yazdani Aminabadi
Olatunji Ruwase
Shuangyang Yang
Minjia Zhang
Dong Li
Yuxiong He
MoE
160
413
0
18 Jan 2021
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,815
0
17 Sep 2019
1