Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.05968
Cited By
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
12 May 2020
Ranggi Hwang
Taehun Kim
Youngeun Kwon
Minsoo Rhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations"
15 / 15 papers shown
Title
ACCL+: an FPGA-Based Collective Engine for Distributed Applications
Zhenhao He
Dario Korolija
Yu Zhu
Benjamin Ramhorst
Tristan Laan
L. Petrica
Michaela Blott
Gustavo Alonso
GNN
21
6
0
18 Dec 2023
Splitwise: Efficient generative LLM inference using phase splitting
Pratyush Patel
Esha Choukse
Chaojie Zhang
Aashaka Shah
Íñigo Goiri
Saeed Maleki
Ricardo Bianchini
47
197
0
30 Nov 2023
REED: Chiplet-Based Accelerator for Fully Homomorphic Encryption
Aikata Aikata
A. Mert
Sunmin Kwon
M. Deryabin
S. Roy
100
2
0
05 Aug 2023
SmartSAGE: Training Large-scale Graph Neural Networks using In-Storage Processing Architectures
Yunjae Lee
Jin-Won Chung
Minsoo Rhu
GNN
29
48
0
10 May 2022
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards
Youngeun Kwon
Minsoo Rhu
21
27
0
10 May 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks
Ranggi Hwang
M. Kang
Jiwon Lee
D. Kam
Youngjoo Lee
Minsoo Rhu
GNN
11
20
0
01 Mar 2022
Energy-Efficient Deflection-based On-chip Networks: Topology, Routing, Flow Control
Rachata Ausavarungnirun
O. Mutlu
21
0
0
05 Dec 2021
Hardware-assisted Trusted Memory Disaggregation for Secure Far Memory
Taekyung Heo
Seung-Hyun Kang
Sanghyeon Lee
Soojin Hwang
Jaehyuk Huh
16
1
0
25 Aug 2021
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Hanrui Wang
Zhekai Zhang
Song Han
26
374
0
17 Dec 2020
CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery
Kiwan Maeng
Shivam Bharuka
Isabel Gao
M. C. Jeffrey
V. Saraph
...
Caroline Trippel
Jiyan Yang
Michael G. Rabbat
Brandon Lucia
Carole-Jean Wu
OffRL
11
31
0
05 Nov 2020
Understanding Capacity-Driven Scale-Out Neural Recommendation Inference
Michael Lui
Yavuz Yetim
Özgür Özkan
Zhuoran Zhao
Shin-Yeh Tsai
Carole-Jean Wu
Mark Hempstead
GNN
BDL
LRM
22
51
0
04 Nov 2020
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Yujeong Choi
Yunseong Kim
Minsoo Rhu
19
66
0
25 Oct 2020
Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training
Youngeun Kwon
Yunjae Lee
Minsoo Rhu
19
39
0
25 Oct 2020
Optimizing Deep Learning Recommender Systems' Training On CPU Cluster Architectures
Dhiraj D. Kalamkar
E. Georganas
S. Srinivasan
Jianping Chen
Mikhail Shiryaev
A. Heinecke
48
47
0
10 May 2020
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing
Liu Ke
Udit Gupta
Carole-Jean Wu
B. Cho
Mark Hempstead
...
Dheevatsa Mudigere
Maxim Naumov
Martin D. Schatz
M. Smelyanskiy
Xiaodong Wang
41
213
0
30 Dec 2019
1