Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2507.10178
Cited By
v1
v2
v3 (latest)
Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving
14 July 2025
Wonung Kim
Yubin Lee
Yoonsung Kim
Jinwoo Hwang
Seongryong Oh
Jiyong Jung
Aziz Huseynov
Woong Gyu Park
Chang Hyun Park
Divya Mahajan
Jongse Park
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11★)
Papers citing
"Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving"
1 / 1 papers shown
Title
P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Yuzong Chen
Chao Fang
Xilai Dai
Yuheng Wu
Thierry Tambe
Marian Verhelst
Mohamed S. Abdelfattah
51
0
0
10 Nov 2025
1