ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.19378
86
0

Libra: Leveraging Temporal Images for Biomedical Radiology Analysis

28 November 2024
Xi Zhang
Zaiqiao Meng
Jake Lever
Edmond S. L. Ho
    MedIm
ArXivPDFHTML
Abstract

Radiology report generation (RRG) requires advanced medical image analysis, effective temporal reasoning, and accurate text generation. While multimodal large language models (MLLMs) align with pre-trained vision encoders to enhance visual-language understanding, most existing methods rely on single-image analysis or rule-based heuristics to process multiple images, failing to fully leverage temporal information in multi-modal medical datasets. In this paper, we introduce Libra, a temporal-aware MLLM tailored for chest X-ray report generation. Libra combines a radiology-specific image encoder with a novel Temporal Alignment Connector (TAC), designed to accurately capture and integrate temporal differences between paired current and prior images. Extensive experiments on the MIMIC-CXR dataset demonstrate that Libra establishes a new state-of-the-art benchmark among similarly scaled MLLMs, setting new standards in both clinical relevance and lexical accuracy.

View on arXiv
@article{zhang2025_2411.19378,
  title={ Libra: Leveraging Temporal Images for Biomedical Radiology Analysis },
  author={ Xi Zhang and Zaiqiao Meng and Jake Lever and Edmond S. L. Ho },
  journal={arXiv preprint arXiv:2411.19378},
  year={ 2025 }
}
Comments on this paper