Kongzi: A Historical Large Language Model with Fact Enhancement

13 April 2025

Abstract

The capabilities of the latest large language models (LLMs) have been extended from pure natural language understanding to complex reasoning tasks. However, current reasoning models often exhibit factual inaccuracies in longer reasoning chains, which poses challenges for historical reasoning and limits the potential of LLMs in complex, knowledge-intensive tasks. Historical studies require not only the accurate presentation of factual information but also the ability to establish cross-temporal correlations and derive coherent conclusions from fragmentary and often ambiguous sources. To address these challenges, we propose Kongzi, a large language model specifically designed for historical analysis. Through the integration of curated, high-quality historical data and a novel fact-reinforcement learning strategy, Kongzi demonstrates strong factual alignment and sophisticated reasoning depth. Extensive experiments on tasks such as historical question answering and narrative generation demonstrate that Kongzi outperforms existing models in both factual accuracy and reasoning depth. By effectively addressing the unique challenges inherent in historical texts, Kongzi sets a new standard for the development of accurate and reliable LLMs in professional domains.

View on arXiv

@article{yang2025_2504.09488,
  title={ Kongzi: A Historical Large Language Model with Fact Enhancement },
  author={ Jiashu Yang and Ningning Wang and Yian Zhao and Chaoran Feng and Junjia Du and Hao Pang and Zhirui Fang and Xuxin Cheng },
  journal={arXiv preprint arXiv:2504.09488},
  year={ 2025 }
}

Comments on this paper