Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.20166
Cited By
LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System
28 December 2024
Hyucksung Kwon
Kyungmo Koo
Janghyeon Kim
W. Lee
Minjae Lee
Hyungdeok Lee
Yousub Jung
Jaehan Park
Yosub Song
Byeongsu Yang
Haerang Choi
Guhyun Kim
Jongsoon Won
Woojae Shin
Changhyun Kim
Gyeongcheol Shin
Yongkee Kwon
Ilkon Kim
Euicheol Lim
John Kim
Jungwook Choi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System"
3 / 3 papers shown
Title
Sparse Attention Remapping with Clustering for Efficient LLM Decoding on PIM
Zehao Fan
Garrett Gagnon
Zhenyu Liu
Liu Liu
19
0
0
09 May 2025
L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference
Qingyuan Liu
Liyan Chen
Yanning Yang
H. Wang
Dong Du
Zhigang Mao
Naifeng Jing
Yubin Xia
Haibo Chen
29
0
0
24 Apr 2025
PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System
Yintao He
Haiyu Mao
Christina Giannoula
Mohammad Sadrosadati
Juan Gómez Luna
Huawei Li
Xiaowei Li
Ying Wang
O. Mutlu
38
5
0
21 Feb 2025
1