ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.04304
  4. Cited By
Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large
  Language Models

Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models

7 May 2024
Jonathan Mamou
Oren Pereg
Daniel Korat
Moshe Berchansky
Nadav Timor
Moshe Wasserblat
Roy Schwartz
ArXivPDFHTML

Papers citing "Dynamic Speculation Lookahead Accelerates Speculative Decoding of Large Language Models"

1 / 1 papers shown
Title
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Nikhil Bhendawade
Irina Belousova
Qichen Fu
Henry Mason
Mohammad Rastegari
Mahyar Najibi
LRM
24
27
0
16 Feb 2024
1