Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.13019
Cited By
A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models
15 May 2024
Mahsa Khoshnoodi
Vinija Jain
Mingye Gao
Malavika Srikanth
Aman Chadha
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models"
3 / 3 papers shown
Title
Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy
Yao-Min Zhao
Zhitian Xie
Chen Liang
Chenyi Zhuang
Jinjie Gu
45
11
0
20 Dec 2023
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
2,009
0
28 Jul 2020
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,815
0
17 Sep 2019
1