Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.08698
Cited By
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
10 April 2024
Jie Ou
Yueming Chen
Wenhong Tian
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding"
4 / 4 papers shown
Title
Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models
Jinliang Lu
Ziliang Pang
Min Xiao
Yaochen Zhu
Rui Xia
Jiajun Zhang
MoMe
27
17
0
08 Jul 2024
Teola: Towards End-to-End Optimization of LLM-based Applications
Xin Tan
Yimin Jiang
Yitao Yang
Hong-Yu Xu
45
4
0
29 Jun 2024
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Ying Sheng
Lianmin Zheng
Binhang Yuan
Zhuohan Li
Max Ryabinin
...
Joseph E. Gonzalez
Percy Liang
Christopher Ré
Ion Stoica
Ce Zhang
138
365
0
13 Mar 2023
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
170
3,504
0
10 Jun 2015
1