ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.06468
  4. Cited By
Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep
  Learning

Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning

18 February 2019
Youngeun Kwon
Minsoo Rhu
ArXivPDFHTML

Papers citing "Beyond the Memory Wall: A Case for Memory-centric HPC System for Deep Learning"

11 / 11 papers shown
Title
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive
  RAM-based Neural Network Accelerators
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators
Aditya Manglik
Minesh Patel
Haiyu Mao
Behzad Salami
Jisung Park
Lois Orosa
O. Mutlu
15
1
0
10 Nov 2022
SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage
  Processing Architectures
SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures
Yunjae Lee
Jin-Won Chung
Minsoo Rhu
GNN
27
48
0
10 May 2022
Training Personalized Recommendation Systems from (GPU) Scratch: Look
  Forward not Backwards
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards
Youngeun Kwon
Minsoo Rhu
16
27
0
10 May 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for
  Memory-Efficient Graph Convolutional Neural Networks
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks
Ranggi Hwang
M. Kang
Jiwon Lee
D. Kam
Youngjoo Lee
Minsoo Rhu
GNN
11
20
0
01 Mar 2022
Energy-Efficient Deflection-based On-chip Networks: Topology, Routing,
  Flow Control
Energy-Efficient Deflection-based On-chip Networks: Topology, Routing, Flow Control
Rachata Ausavarungnirun
O. Mutlu
21
0
0
05 Dec 2021
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning
  Inference
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Yujeong Choi
Yunseong Kim
Minsoo Rhu
11
66
0
25 Oct 2020
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized
  Recommendation Training
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
11
39
0
25 Oct 2020
Enabling Compute-Communication Overlap in Distributed Deep Learning
  Training Platforms
Enabling Compute-Communication Overlap in Distributed Deep Learning Training Platforms
Saeed Rashidi
Matthew Denton
Srinivas Sridharan
S. Srinivasan
Amoghavarsha Suresh
Jade Nie
T. Krishna
13
45
0
30 Jun 2020
DeepRecSys: A System for Optimizing End-To-End At-scale Neural
  Recommendation Inference
DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference
Udit Gupta
Samuel Hsia
V. Saraph
Xiaodong Wang
Brandon Reagen
Gu-Yeon Wei
Hsien-Hsin S. Lee
David Brooks
Carole-Jean Wu
GNN
25
188
0
08 Jan 2020
Enabling Highly Efficient Capsule Networks Processing Through A
  PIM-Based Architecture Design
Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design
Xingyao Zhang
S. Song
Chenhao Xie
Jing Wang
Wei-gong Zhang
Xin Fu
15
20
0
07 Nov 2019
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN
  Training
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
13
44
0
22 May 2018
1