Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2505.20776
Cited By
v1
v2
v3 (latest)
SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences
27 May 2025
Jungyoub Cha
Hyunjong Kim
Sungzoon Cho
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences"
4 / 4 papers shown
Title
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
386
45
0
03 Mar 2025
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
Penghui Yang
Cunxiao Du
Fengzhuo Zhang
Haonan Wang
Tianyu Pang
Chao Du
Bo An
RALM
167
1
0
24 Feb 2025
OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure
Jikai Wang
Yi Su
Juntao Li
Qingrong Xia
Zi Ye
Xinyu Duan
Zhefeng Wang
Min Zhang
238
25
0
25 Jun 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
340
231
0
26 Jan 2024
1