Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.11750
Cited By
A Heterogeneous Chiplet Architecture for Accelerating End-to-End Transformer Models
18 December 2023
Harsh Sharma
Pratyush Dhingra
J. Doppa
Ümit Y. Ogras
P. Pande
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Heterogeneous Chiplet Architecture for Accelerating End-to-End Transformer Models"
3 / 3 papers shown
Title
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
57
15
0
06 Oct 2024
Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference
Christopher Wolters
Xiaoxuan Yang
Ulf Schlichtmann
Toyotaro Suzumura
34
11
0
12 Jun 2024
Dataflow-Aware PIM-Enabled Manycore Architecture for Deep Learning Workloads
Harsh Sharma
Gaurav Narang
J. Doppa
Ümit Y. Ogras
P. Pande
20
1
0
28 Mar 2024
1