14
0

ReCellTy: Domain-specific knowledge graph retrieval-augmented LLMs workflow for single-cell annotation

Abstract

To enable precise and fully automated cell type annotation with large language models (LLMs), we developed a graph structured feature marker database to retrieve entities linked to differential genes for cell reconstruction. We further designed a multi task workflow to optimize the annotation process. Compared to general purpose LLMs, our method improves human evaluation scores by up to 0.21 and semantic similarity by 6.1% across 11 tissue types, while more closely aligning with the cognitive logic of manual annotation.

View on arXiv
@article{han2025_2505.00017,
  title={ ReCellTy: Domain-specific knowledge graph retrieval-augmented LLMs workflow for single-cell annotation },
  author={ Dezheng Han and Yibin Jia and Ruxiao Chen and Wenjie Han and Shuaishuai Guo and Jianbo Wang },
  journal={arXiv preprint arXiv:2505.00017},
  year={ 2025 }
}
Comments on this paper