Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.00142
Cited By
Dynamic Depth Decoding: Faster Speculative Decoding for LLMs
30 August 2024
Oscar Brown
Zhengjie Wang
Andrea Do
Nikhil Mathew
Cheng Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamic Depth Decoding: Faster Speculative Decoding for LLMs"
3 / 3 papers shown
Title
Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding
J. Li
Yixing Xu
Haiduo Huang
Xuanwu Yin
D. Li
Edith C. -H. Ngai
E. Barsoum
45
0
0
13 Mar 2025
Speculative Decoding and Beyond: An In-Depth Survey of Techniques
Y. Hu
Zining Liu
Zhenyuan Dong
Tianfan Peng
Bradley McDanel
S. Zhang
85
0
0
27 Feb 2025
C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
Feiye Huo
Jianchao Tan
K. Zhang
Xunliang Cai
Shengli Sun
36
0
0
20 Feb 2025
1