ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.18857
  4. Cited By
Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation

Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation

26 April 2025
Yi Lu
Wanxu Zhao
Xin Zhou
Chenxin An
Cong Wang
Shuo Li
Yue Yang
Jun Zhao
Changzhi Sun
Tao Gui
Tao Gui
Qi Zhang
ArXiv (abs)PDFHTML

Papers citing "Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation"

3 / 3 papers shown
Title
Thus Spake Long-Context Large Language Model
Thus Spake Long-Context Large Language Model
Xiaoran Liu
Ruixiao Li
Mianqiu Huang
Zhigeng Liu
Yuerong Song
...
Qiang Liu
Yaqian Zhou
Qi Zhang
Xuanjing Huang
Xipeng Qiu
236
5
0
24 Feb 2025
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Changzhi Sun
B. Guo
Y. Wu
Qipeng Guo
Lixing Shen
Zhan Chen
Xipeng Qiu
Tao Gui
Tao Gui
208
10
0
20 Feb 2025
Round and Round We Go! What makes Rotary Positional Encodings useful?
Round and Round We Go! What makes Rotary Positional Encodings useful?International Conference on Learning Representations (ICLR), 2024
Federico Barbero
Alex Vitvitskyi
Christos Perivolaropoulos
Razvan Pascanu
Petar Velickovic
371
64
0
08 Oct 2024
1