ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.05076
  4. Cited By
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent
  Sparse Attention

TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

7 October 2024
Lijie Yang
Zhihao Zhang
Zhuofu Chen
Zikun Li
Zhihao Jia
ArXivPDFHTML

Papers citing "TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention"

1 / 1 papers shown
Title
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs
  with 1000x Input Token Reduction
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction
Zhenmei Shi
Yifei Ming
Xuan-Phi Nguyen
Yingyu Liang
Shafiq Joty
76
27
0
25 Sep 2024
1