Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models

7 May 2024

Papers citing "Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models"

1 / 1 papers shown

Title
Speculative Streaming: Fast LLM Inference without Auxiliary Models Nikhil Bhendawade Irina Belousova Qichen Fu Henry Mason Mohammad Rastegari Mahyar Najibi LRM 24 27 0 16 Feb 2024