266

Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis

Computational Linguistics (CL), 2024
Main:20 Pages
3 Figures
Bibliography:1 Pages
15 Tables
Appendix:7 Pages
Abstract

After the introduction of Large Language Models (LLMs), there have been substantial improvements in the performance of Natural Language Generation (NLG) tasks, including Text Summarization and Machine Translation. However, LLMs still produce outputs containing hallucinations, that is, content not grounded in factual information. Therefore, developing methods to assess the factuality of LLMs has become urgent.

View on arXiv
Comments on this paper