Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis
Computational Linguistics (CL), 2024
- HILM
Main:20 Pages
3 Figures
Bibliography:1 Pages
15 Tables
Appendix:7 Pages
Abstract
After the introduction of Large Language Models (LLMs), there have been substantial improvements in the performance of Natural Language Generation (NLG) tasks, including Text Summarization and Machine Translation. However, LLMs still produce outputs containing hallucinations, that is, content not grounded in factual information. Therefore, developing methods to assess the factuality of LLMs has become urgent.
View on arXivComments on this paper
