ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2507.10178
  4. Cited By
Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving
v1v2v3 (latest)

Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving

14 July 2025
Wonung Kim
Yubin Lee
Yoonsung Kim
Jinwoo Hwang
Seongryong Oh
Jiyong Jung
Aziz Huseynov
Woong Gyu Park
Chang Hyun Park
Divya Mahajan
Jongse Park
ArXiv (abs)PDFHTMLGithub (11★)

Papers citing "Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving"

1 / 1 papers shown
Title
P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Yuzong Chen
Chao Fang
Xilai Dai
Yuheng Wu
Thierry Tambe
Marian Verhelst
Mohamed S. Abdelfattah
51
0
0
10 Nov 2025
1