Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2502.10424
Cited By

QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache

QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache

5 February 2025

Michael W. Mahoney

Kemal Kurniawan

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache"

3 / 53 papers shown

Compressive Transformers for Long-Range Sequence Modelling

Compressive Transformers for Long-Range Sequence ModellingInternational Conference on Learning Representations (ICLR), 2019

Siddhant M. Jayakumar

Timothy Lillicrap

305

778

0

13 Nov 2019

Exploring the Limits of Transfer Learning with a Unified Text-to-Text
Transformer

Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerJournal of machine learning research (JMLR), 2019

Noam M. Shazeer

Sharan Narang

1.6K

23,949

0

23 Oct 2019

Pointer Sentinel Mixture Models

Pointer Sentinel Mixture Models

1.1K

3,525

0

26 Sep 2016

Page 2 of 2