Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.06468
Cited By
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning
18 February 2019
Youngeun Kwon
Minsoo Rhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning"
11 / 11 papers shown
Title
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators
Aditya Manglik
Minesh Patel
Haiyu Mao
Behzad Salami
Jisung Park
Lois Orosa
O. Mutlu
15
1
0
10 Nov 2022
SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures
Yunjae Lee
Jin-Won Chung
Minsoo Rhu
GNN
27
48
0
10 May 2022
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards
Youngeun Kwon
Minsoo Rhu
16
27
0
10 May 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks
Ranggi Hwang
M. Kang
Jiwon Lee
D. Kam
Youngjoo Lee
Minsoo Rhu
GNN
11
20
0
01 Mar 2022
Energy-Efficient Deflection-based On-chip Networks: Topology, Routing, Flow Control
Rachata Ausavarungnirun
O. Mutlu
21
0
0
05 Dec 2021
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Yujeong Choi
Yunseong Kim
Minsoo Rhu
11
66
0
25 Oct 2020
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
11
39
0
25 Oct 2020
Enabling Compute-Communication Overlap in Distributed Deep Learning Training Platforms
Saeed Rashidi
Matthew Denton
Srinivas Sridharan
S. Srinivasan
Amoghavarsha Suresh
Jade Nie
T. Krishna
13
45
0
30 Jun 2020
DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference
Udit Gupta
Samuel Hsia
V. Saraph
Xiaodong Wang
Brandon Reagen
Gu-Yeon Wei
Hsien-Hsin S. Lee
David Brooks
Carole-Jean Wu
GNN
25
188
0
08 Jan 2020
Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design
Xingyao Zhang
S. Song
Chenhao Xie
Jing Wang
Wei-gong Zhang
Xin Fu
15
20
0
07 Nov 2019
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
13
44
0
22 May 2018
1