Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.04304
Cited By
Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models
7 May 2024
Jonathan Mamou
Oren Pereg
Daniel Korat
Moshe Berchansky
Nadav Timor
Moshe Wasserblat
Roy Schwartz
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models"
1 / 1 papers shown
Title
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Nikhil Bhendawade
Irina Belousova
Qichen Fu
Henry Mason
Mohammad Rastegari
Mahyar Najibi
LRM
24
27
0
16 Feb 2024
1