27
0

CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation

Abstract

Text semantic segmentation involves partitioning a document into multiple paragraphs with continuous semantics based on the subject matter, contextual information, and document structure. Traditional approaches have typically relied on preprocessing documents into segments to address input length constraints, resulting in the loss of critical semantic information across segments. To address this, we present CrossFormer, a transformer-based model featuring a novel cross-segment fusion module that dynamically models latent semantic dependencies across document segments, substantially elevating segmentation accuracy. Additionally, CrossFormer can replace rule-based chunk methods within the Retrieval-Augmented Generation (RAG) system, producing more semantically coherent chunks that enhance its efficacy. Comprehensive evaluations confirm CrossFormer's state-of-the-art performance on public text semantic segmentation datasets, alongside considerable gains on RAG benchmarks.

View on arXiv
@article{ni2025_2503.23671,
  title={ CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation },
  author={ Tongke Ni and Yang Fan and Junru Zhou and Xiangping Wu and Qingcai Chen },
  journal={arXiv preprint arXiv:2503.23671},
  year={ 2025 }
}
Comments on this paper