From Natural Language to SQL: Review of LLM-based Text-to-SQL Systems

Abstract
LLMs when used with Retrieval Augmented Generation (RAG), are greatly improving the SOTA of translating natural language queries to structured and correct SQL. Unlike previous reviews, this survey provides a comprehensive study of the evolution of LLM-based text-to-SQL systems, from early rule-based models to advanced LLM approaches that use (RAG) systems. We discuss benchmarks, evaluation methods, and evaluation metrics. Also, we uniquely study the use of Graph RAGs for better contextual accuracy and schema linking in these systems. Finally, we highlight key challenges such as computational efficiency, model robustness, and data privacy toward improvements of LLM-based text-to-SQL systems.
View on arXiv@article{mohammadjafari2025_2410.01066, title={ From Natural Language to SQL: Review of LLM-based Text-to-SQL Systems }, author={ Ali Mohammadjafari and Anthony S. Maida and Raju Gottumukkala }, journal={arXiv preprint arXiv:2410.01066}, year={ 2025 } }
Comments on this paper