Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2105.12676
Cited By
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
26 May 2021
Zhaoxia Deng
Deng
Jongsoo Park
P. T. P. Tang
Haixin Liu
Jie
J. Yang
Hector Yuen
Jianyu Huang
D. Khudia
Xiaohan Wei
Ellie Wen
Dhruv Choudhary
Raghuraman Krishnamoorthi
Carole-Jean Wu
S. Nadathur
Changkyu Kim
Maxim Naumov
S. Naghshineh
M. Smelyanskiy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale"
4 / 4 papers shown
Title
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference
Yejin Lee
Anna Y. Sun
Basil Hosmer
Bilge Acun
Can Balioglu
...
Ram Pasunuru
Scott Yih
Sravya Popuri
Xing Liu
Carole-Jean Wu
52
2
0
30 Sep 2024
With Shared Microexponents, A Little Shifting Goes a Long Way
Bita Darvish Rouhani
Ritchie Zhao
V. Elango
Rasoul Shafipour
Mathew Hall
...
Eric S. Chung
Zhaoxia Deng
S. Naghshineh
Jongsoo Park
Maxim Naumov
MQ
41
36
0
16 Feb 2023
FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
D. Khudia
Jianyu Huang
Protonu Basu
Summer Deng
Haixin Liu
Jongsoo Park
M. Smelyanskiy
FedML
MQ
49
46
0
13 Jan 2021
Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems
Weijie Zhao
Deping Xie
Ronglai Jia
Yulei Qian
Rui Ding
Mingming Sun
P. Li
MoE
59
150
0
12 Mar 2020
1