ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.20988
49
0

Cross-Modal State-Space Graph Reasoning for Structured Summarization

26 March 2025
Hannah Kim
Sofia Martinez
Jason Lee
ArXivPDFHTML
Abstract

The ability to extract compact, meaningful summaries from large-scale and multimodal data is critical for numerous applications, ranging from video analytics to medical reports. Prior methods in cross-modal summarization have often suffered from high computational overheads and limited interpretability. In this paper, we propose a \textit{Cross-Modal State-Space Graph Reasoning} (\textbf{CSS-GR}) framework that incorporates a state-space model with graph-based message passing, inspired by prior work on efficient state-space models. Unlike existing approaches relying on purely sequential models, our method constructs a graph that captures inter- and intra-modal relationships, allowing more holistic reasoning over both textual and visual streams. We demonstrate that our approach significantly improves summarization quality and interpretability while maintaining computational efficiency, as validated on standard multimodal summarization benchmarks. We also provide a thorough ablation study to highlight the contributions of each component.

View on arXiv
@article{kim2025_2503.20988,
  title={ Cross-Modal State-Space Graph Reasoning for Structured Summarization },
  author={ Hannah Kim and Sofia Martinez and Jason Lee },
  journal={arXiv preprint arXiv:2503.20988},
  year={ 2025 }
}
Comments on this paper