v1v2v3 (latest)

Pareto-Optimized Open-Source LLMs for Healthcare via Context Retrieval

Artificial Intelligence Applications and Innovations (AIAI), 2024

23 September 2024

Jordi Bayarri-Planas

Ashwin Kumar Gururajan

Dario Garcia-Gasulla

ArXiv (abs)PDF HTML HuggingFace (21 upvotes)

Abstract

This study leverages optimized context retrieval to enhance open-source Large Language Models (LLMs) for cost-effective, high performance healthcare AI. We demonstrate that this approach achieves state-of-the-art accuracy on medical question answering at a fraction of the cost of proprietary models, significantly improving the cost-accuracy Pareto frontier on the MedQA benchmark. Key contributions include: (1) OpenMedQA, a novel benchmark revealing a performance gap in open-ended medical QA compared to multiple-choice formats; (2) a practical, reproducible pipeline for context retrieval optimization; and (3) open-source resources (Prompt Engine, CoT/ToT/Thinking databases) to empower healthcare AI development. By advancing retrieval techniques and QA evaluation, we enable more affordable and reliable LLM solutions for healthcare.

View on arXiv

Comments on this paper