Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.09373
Cited By
Understanding Data Storage and Ingestion for Large-Scale Deep Recommendation Model Training
20 August 2021
Mark Zhao
Niket Agarwal
Aarti Basant
B. Gedik
Satadru Pan
Muhammet Mustafa Ozdal
Rakesh Komuravelli
Jerry Y. Pan
Tianshu Bao
Haowei Lu
Sundaram Narayanan
Jack Langman
Kevin Wilfong
Harsha Rastogi
Carole-Jean Wu
Christos Kozyrakis
Parikshit Pol
GNN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding Data Storage and Ingestion for Large-Scale Deep Recommendation Model Training"
19 / 19 papers shown
Title
OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training
Juntao Zhao
Qi Lu
Wei Jia
Borui Wan
Lei Zuo
...
Y. Hu
Yanghua Peng
H. Lin
Xin Liu
Chuan Wu
AI4CE
32
0
0
14 Apr 2025
PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers
Gwangoo Yeo
Jiin Kim
Yujeong Choi
Minsoo Rhu
69
0
0
28 Nov 2024
TensorSocket: Shared Data Loading for Deep Learning Training
Ties Robroek
Neil Kim Nielsen
Pınar Tözün
21
2
0
27 Sep 2024
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models
Yunjae Lee
Hyeseong Kim
Minsoo Rhu
24
3
0
11 Jun 2024
Beyond Efficiency: Scaling AI Sustainably
Carole-Jean Wu
Bilge Acun
Ramya Raghavendra
Kim Hazelwood
GNN
33
13
0
08 Jun 2024
Bullion: A Column Store for Machine Learning
Gang Liao
Ye Liu
Jianjun Chen
Daniel J. Abadi
24
5
0
13 Apr 2024
Data Acquisition: A New Frontier in Data-centric AI
Lingjiao Chen
Bilge Acun
Newsha Ardalani
Yifan Sun
Feiyang Kang
...
Yongchan Kwon
Ruoxi Jia
Carole-Jean Wu
Matei A. Zaharia
James Y. Zou
27
8
0
22 Nov 2023
InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models
Kabir Nagrecha
Lingyi Liu
P. Delgado
Prasanna Padmanabhan
OffRL
AI4CE
14
5
0
13 Aug 2023
TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
N. Jouppi
George Kurian
Sheng R. Li
Peter C. Ma
R. Nagarajan
...
Brian Towles
C. Young
Xiaoping Zhou
Zongwei Zhou
David A. Patterson
BDL
VLM
22
334
0
04 Apr 2023
FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Geet Sethi
Pallab Bhattacharya
Dhruv Choudhary
Carole-Jean Wu
Christos Kozyrakis
9
4
0
08 Jan 2023
Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks
Mingyu Liang
Wenyin Fu
Louis Feng
Zhongyi Lin
P. Panakanti
Shengbao Zheng
Srinivas Sridharan
Christina Delimitrou
11
12
0
16 Dec 2022
RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Mark Zhao
Dhruv Choudhary
Devashish Tyagi
A. Somani
Max Kaplan
...
Jongsoo Park
Aarti Basant
Niket Agarwal
Carole-Jean Wu
Christos Kozyrakis
VLM
8
6
0
09 Nov 2022
tf.data service: A Case for Disaggregating ML Input Data Processing
Andrew Audibert
Yangrui Chen
D. Graur
Ana Klimovic
Jiří Šimša
C. A. Thekkath
34
16
0
26 Oct 2022
Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores
Arsany Guirguis
Diana Petrescu
Florin Dinu
D. Quoc
Javier Picorel
R. Guerraoui
19
0
0
16 Oct 2022
Understanding Scaling Laws for Recommendation Models
Newsha Ardalani
Carole-Jean Wu
Zeliang Chen
Bhargav Bhushanam
Adnan Aziz
21
28
0
17 Aug 2022
CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics
Jinwoo Hwang
Minsu Kim
Daeun Kim
Seungho Nam
Yoonsung Kim
Dohee Kim
Hardik Sharma
Jongse Park
30
14
0
02 Jul 2022
Heterogeneous Acceleration Pipeline for Recommendation System Training
Muhammad Adnan
Yassaman Ebrahimzadeh Maboud
Divyat Mahajan
Prashant J. Nair
11
16
0
11 Apr 2022
RecShard: Statistical Feature-Based Memory Optimization for Industry-Scale Neural Recommendation
Geet Sethi
Bilge Acun
Niket Agarwal
Christos Kozyrakis
Caroline Trippel
Carole-Jean Wu
31
65
0
25 Jan 2022
tf.data: A Machine Learning Data Processing Framework
D. Murray
Jiří Šimša
Ana Klimovic
Ihor Indyk
PINN
AI4CE
LMTD
39
86
0
28 Jan 2021
1