Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.08696
Cited By
Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling
16 August 2024
Xianzhen Luo
Yixuan Wang
Qingfu Zhu
Zhiming Zhang
Xuanyu Zhang
Qing Yang
Dongliang Xu
Wanxiang Che
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling"
2 / 2 papers shown
Title
DReSD: Dense Retrieval for Speculative Decoding
Milan Gritta
Huiyin Xue
Gerasimos Lampouras
RALM
90
0
0
24 Feb 2025
Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training
Yixuan Wang
Xianzhen Luo
Fuxuan Wei
Yijun Liu
Qingfu Zhu
Xuanyu Zhang
Qing Yang
Dongliang Xu
Wanxiang Che
35
3
0
25 Jun 2024
1