ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2511.14638
150
0

A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases

18 November 2025
Tao Yang
Dandan Huang
Yunting Lin
P. Wu
Zhikun Wu
Gangyuan Ma
Y. Lu
Xinran Dong
Dingpeng Li
J. Ge
Zhiyan Zhang
Xuanzhao Huang
Wenyan Nong
Yao Zhou
Hui Tang
Hongxi Yang
Shijie Zhang
Juan-Zi Li
Xiaojun Cao
Lin Yang
Xia Gao
Kaishou Xu
X. Gu
Wen Zhang
Huimin Xia
Li Liu
Wenhao Zhou
Mulin Jun Li
    LM&MALRM
ArXiv (abs)PDFHTML
Main:50 Pages
5 Figures
Abstract

Rare diseases affect hundreds of millions worldwide, yet diagnosis often spans years. Convectional pipelines decouple noisy evidence extraction from downstream inferential diagnosis, and general/medical large language models (LLMs) face scarce real world electronic health records (EHRs), stale domain knowledge, and hallucinations. We assemble a large, domain specialized clinical corpus and a clinician validated reasoning set, and develop RareSeek R1 via staged instruction tuning, chain of thought learning, and graph grounded retrieval. Across multicenter EHR narratives and public benchmarks, RareSeek R1 attains state of the art accuracy, robust generalization, and stability under noisy or overlapping phenotypes. Augmented retrieval yields the largest gains when narratives pair with prioritized variants by resolving ambiguity and aligning candidates to mechanisms. Human studies show performance on par with experienced physicians and consistent gains in assistive use. Notably, transparent reasoning highlights decisive non phenotypic evidence (median 23.1%, such as imaging, interventions, functional tests) underpinning many correct diagnoses. This work advances a narrative first, knowledge integrated reasoning paradigm that shortens the diagnostic odyssey and enables auditable, clinically translatable decision support.

View on arXiv
Comments on this paper