Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
All Papers
Title
Home
Papers
2401.06706
Cited By
Multi-Candidate Speculative Decoding
12 January 2024
Sen Yang
Shujian Huang
Xinyu Dai
Jiajun Chen
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Multi-Candidate Speculative Decoding"
9 / 9 papers shown
Title
SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding
Design Automation Conference (DAC), 2024
Linye Wei
Shuzhang Zhong
Songqiang Xu
Runsheng Wang
Ru Huang
Meng Li
44
0
0
24 Jul 2025
SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences
Jungyoub Cha
Hyunjong Kim
Sungzoon Cho
VLM
186
0
0
27 May 2025
Think Before You Accept: Semantic Reflective Verification for Faster Speculative Decoding
Yixuan Wang
Yijun Liu
Shiyu Ji
Yuzhuang Xu
Yang Xu
Qingfu Zhu
Wanxiang Che
OffRL
LRM
148
0
0
24 May 2025
L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models
Xiaohao Liu
Xiaobo Xia
Weixiang Zhao
Manyi Zhang
Xianzhi Yu
Xiu Su
Shuo Yang
See-Kiong Ng
Tat-Seng Chua
KELM
LRM
152
0
0
23 May 2025
Accelerating Large Language Model Reasoning via Speculative Search
Zhihai Wang
Jie Wang
Jilai Pan
Xilin Xia
Huiling Zhen
Mingxuan Yuan
Jianye Hao
Feng Wu
ReLM
LRM
218
4
0
03 May 2025
Collaborative Speculative Inference for Efficient LLM Inference Serving
Luyao Gao
Jianchun Liu
Hongli Xu
Xichong Zhang
Yunming Liao
Liusheng Huang
144
1
0
13 Mar 2025
SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
Kaixuan Huang
Xudong Guo
M. Y. Wang
280
30
0
30 May 2024
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference
Hao Mark Chen
Wayne Luk
Ka-Fai Cedric Yiu
Rui Li
Konstantin Mishchenko
Stylianos I. Venieris
Hongxiang Fan
110
12
0
28 May 2024
Fast Transformer Decoding: One Write-Head is All You Need
Noam M. Shazeer
348
558
0
06 Nov 2019
1