ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.02365
62
0

EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports

4 March 2025
L. Moukheiber
Mira Moukheiber
Dana Moukheiiber
Jae-Woo Ju
Hyung-Chul Lee
    LM&MA
ArXivPDFHTML
Abstract

We introduce a novel question-answering (QA) dataset using echocardiogram reports sourced from the Medical Information Mart for Intensive Care database. This dataset is specifically designed to enhance QA systems in cardiology, consisting of 771,244 QA pairs addressing a wide array of cardiac abnormalities and their severity. We compare large language models (LLMs), including open-source and biomedical-specific models for zero-shot evaluation, and closed-source models for zero-shot and three-shot evaluation. Our results show that fine-tuning LLMs improves performance across various QA metrics, validating the value of our dataset. Clinicians also qualitatively evaluate the best-performing model to assess the LLM responses for correctness. Further, we conduct fine-grained fairness audits to assess the bias-performance trade-off of LLMs across various social determinants of health. Our objective is to propel the field forward by establishing a benchmark for LLM AI agents aimed at supporting clinicians with cardiac differential diagnoses, thereby reducing the documentation burden that contributes to clinician burnout and enabling healthcare professionals to focus more on patient care.

View on arXiv
@article{moukheiber2025_2503.02365,
  title={ EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports },
  author={ Lama Moukheiber and Mira Moukheiber and Dana Moukheiiber and Jae-Woo Ju and Hyung-Chul Lee },
  journal={arXiv preprint arXiv:2503.02365},
  year={ 2025 }
}
Comments on this paper