Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.05676
Cited By
PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design
8 March 2024
Wenqi Jiang
Shuai Zhang
Boran Han
Jie Wang
Bernie Wang
Tim Kraska
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design"
4 / 4 papers shown
Title
Taming the Titans: A Survey of Efficient LLM Inference Serving
Ranran Zhen
J. Li
Yixin Ji
Z. Yang
Tong Liu
Qingrong Xia
Xinyu Duan
Z. Wang
Baoxing Huai
M. Zhang
LLMAG
75
0
0
28 Apr 2025
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
33
14
0
15 Oct 2023
Co-design Hardware and Algorithm for Vector Search
Wenqi Jiang
Shigang Li
Yu Zhu
Johannes de Fine Licht
Zhenhao He
...
Cédric Renggli
Shuai Zhang
Theodoros Rekatsinas
Torsten Hoefler
Gustavo Alonso
57
8
0
19 Jun 2023
Internet-Augmented Dialogue Generation
M. Komeili
Kurt Shuster
Jason Weston
RALM
226
278
0
15 Jul 2021
1