ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.09886
4
186

Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications

24 November 2018
Jongsoo Park
Maxim Naumov
Protonu Basu
Summer Deng
Aravind Kalaiah
D. Khudia
James Law
Parth Malani
Andrey Malevich
N. Satish
J. Pino
Martin D. Schatz
Alexander Sidorov
V. Sivakumar
Andrew Tulloch
Xiaodong Wang
Yiming Wu
Hector Yuen
Utku Diril
Dmytro Dzhulgakov
K. Hazelwood
Bill Jia
Yangqing Jia
Lin Qiao
Vijay Rao
Nadav Rotem
S. Yoo
M. Smelyanskiy
    FedML
    GNN
    BDL
ArXivPDFHTML
Abstract

The application of deep learning techniques resulted in remarkable improvement of machine learning models. In this paper provides detailed characterizations of deep learning models used in many Facebook social network services. We present computational characteristics of our models, describe high performance optimizations targeting existing systems, point out their limitations and make suggestions for the future general-purpose/accelerated inference hardware. Also, we highlight the need for better co-design of algorithms, numerics and computing platforms to address the challenges of workloads often run in data centers.

View on arXiv
Comments on this paper