ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.06519
37
2

SEGMENT+: Long Text Processing with Short-Context Language Models

9 October 2024
Wei Shi
Shuang Li
Kerun Yu
Jinglei Chen
Zujie Liang
Xinhui Wu
Yuxi Qian
Feng Wei
Bo Zheng
Jiaqing Liang
Jiangjie Chen
Yanghua Xiao
    RALM
    VLM
ArXivPDFHTML
Abstract

There is a growing interest in expanding the input capacity of language models (LMs) across various domains. However, simply increasing the context window does not guarantee robust performance across diverse long-input processing tasks, such as understanding extensive documents and extracting detailed information from lengthy and noisy data. In response, we introduce SEGMENT+, a general framework that enables LMs to handle extended inputs within limited context windows efficiently. SEGMENT+ utilizes structured notes and a filtering module to manage information flow, resulting in a system that is both controllable and interpretable. Our extensive experiments across various model sizes, focusing on long-document question-answering and Needle-in-a-Haystack tasks, demonstrate the effectiveness of SEGMENT+ in improving performance.

View on arXiv
Comments on this paper