Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2503.00491
Cited By

Tutorial Proposal: Speculative Decoding for Efficient LLM Inference

1 March 2025

ArXiv (abs)PDF HTML

Papers citing "Tutorial Proposal: Speculative Decoding for Efficient LLM Inference"

3 / 3 papers shown

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

Long Le

Huaixiu Steven Zheng

...

Anush Mattapalli

329

74

0

11 Jul 2024

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyInternational Conference on Machine Learning (ICML), 2024

Hongyang R. Zhang

590

319

0

26 Jan 2024

Fast Transformer Decoding: One Write-Head is All You Need

Fast Transformer Decoding: One Write-Head is All You Need

Noam M. Shazeer

599

641

0

06 Nov 2019