Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.04104
Cited By
PipeDec: Low-Latency Pipeline-based Inference with Dynamic Speculative Decoding towards Large-scale Models
5 April 2025
Haofei Yin
Mengbai Xiao
Rouzhou Lu
Xiao Zhang
Dongxiao Yu
Guanghui Zhang
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PipeDec: Low-Latency Pipeline-based Inference with Dynamic Speculative Decoding towards Large-scale Models"
Title
No papers