Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.05821
Cited By
Optimizing LLM Queries in Relational Data Analytics Workloads
9 March 2024
Shu Liu
Asim Biswal
Audrey Cheng
Xiangxi Mo
Shiyi Cao
Joseph E. Gonzalez
Ion Stoica
Matei A. Zaharia
Ion Stoica
Joseph E. Gonzalez
Matei Zaharia
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimizing LLM Queries in Relational Data Analytics Workloads"
2 / 2 papers shown
Title
Hydragen: High-Throughput LLM Inference with Shared Prefixes
Jordan Juravsky
Bradley Brown
Ryan Ehrlich
Daniel Y. Fu
Christopher Ré
Azalia Mirhoseini
49
35
0
07 Feb 2024
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
144
365
0
13 Mar 2023
1