A Continued Pretrained LLM Approach for Automatic Medical Note Generation

North American Chapter of the Association for Computational Linguistics (NAACL), 2024

14 March 2024

Dong Yuan

Eti Rastogi

Gautam Naik

Sree Prasanna Rajagopal

Abstract

LLMs are revolutionizing NLP tasks. However, the most powerful LLM, like GPT-4, is too costly for most domain-specific scenarios. We present the first continuously trained 13B Llama2-based LLM that is purpose-built for medical conversations and measured on automated scribing. Our results show that our model outperforms GPT-4 in PubMedQA with 76.6\% accuracy and matches its performance in summarizing medical conversations into SOAP notes. Notably, our model exceeds GPT-4 in capturing a higher number of correct medical concepts and outperforms human scribes with higher correctness and completeness.

View on arXiv

Comments on this paper