v1v2v3v4 (latest)

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

International Conference on Learning Representations (ICLR), 2024

3 October 2024

ArXiv (abs)PDF HTML HuggingFace (49 upvotes)

Papers citing "LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations"

31 / 131 papers shown

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Potsawee Manakul

Adian Liusie

Mark Gales

HILM LRM

419

673

15 Mar 2023

LLaMA: Open and Efficient Foundation Language Models

...

6.3K

17,759

27 Feb 2023

Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language GenerationInternational Conference on Learning Representations (ICLR), 2023

607

474

19 Feb 2023

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

...

709

1,621

08 Feb 2023

Discovering Latent Knowledge in Language Models Without SupervisionInternational Conference on Learning Representations (ICLR), 2022

417

531

07 Dec 2022

RARR: Researching and Revising What Language Models Say, Using Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Arun Tejasvi Chaganty

...

704

282

17 Oct 2022

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

Nuno M. Guerreiro

Elena Voita

André F. T. Martins

HILM

290

10 Aug 2022

Language Models (Mostly) Know What They Know

...

638

1,139

11 Jul 2022

RED-ACE: Robust Error Detection for ASR using Confidence EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

138

14 Mar 2022

Locating and Editing Factual Associations in GPTNeural Information Processing Systems (NeurIPS), 2022

947

1,956

10 Feb 2022

Survey of Hallucination in Natural Language GenerationACM Computing Surveys (ACM CSUR), 2022

...

Andrea Madotto

905

3,504

08 Feb 2022

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

342

460

18 Nov 2021

Learning Compact Metrics for MTConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

185

112

12 Oct 2021

TruthfulQA: Measuring How Models Mimic Human FalsehoodsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

1.6K

2,670

08 Sep 2021

A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Zhifang Sui

939

182

18 Apr 2021

$$Q^{2}$: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering$

Q^{2}

: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

406

153

16 Apr 2021

QuestEval: Summarization Asks for Fact-based EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

274

321

23 Mar 2021

Probing Classifiers: Promises, Shortcomings, and AdvancesInternational Conference on Computational Logic (ICCL), 2021

Yonatan Belinkov

750

590

24 Feb 2021

KoBE: Knowledge-Based Machine Translation EvaluationFindings (Findings), 2020

174

23 Sep 2020

COMET: A Neural Framework for MT EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

464

1,363

18 Sep 2020

On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation

Chaojun Wang

Rico Sennrich

229

182

07 May 2020

BLEURT: Learning Robust Metrics for Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Thibault Sellam

Dipanjan Das

Ankur P. Parikh

636

1,750

09 Apr 2020

Evaluating the Factual Consistency of Abstractive Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

333

847

28 Oct 2019

On Identifiability in TransformersInternational Conference on Learning Representations (ICLR), 2019

Gino Brunner

Yang Liu

Damian Pascual

Oliver Richter

Massimiliano Ciaramita

Roger Wattenhofer

ViT

315

202

12 Aug 2019

Context is Key: Grammatical Error Detection with Contextual Word Representations

Samuel J. Bell

H. Yannakoudakis

Marek Rei

166

15 Jun 2019

ERNIE: Enhanced Language Representation with Informative EntitiesAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Zhengyan Zhang

Xu Han

Zhiyuan Liu

Xin Jiang

Maosong Sun

Qun Liu

391

1,520

17 May 2019

Wronging a Right: Generating Better Errors to Improve Grammatical Error DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2018

Sudhanshu Kasewa

Pontus Stenetorp

Sebastian Riedel

200

26 Sep 2018

HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2018

Christopher D. Manning

RALM

898

3,623

25 Sep 2018

Gender Bias in Coreference Resolution: Evaluation and Debiasing MethodsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2018

333

1,080

18 Apr 2018

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

Luke Zettlemoyer

1.8K

3,345

09 May 2017

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference

Adina Williams

Nikita Nangia

Samuel R. Bowman

1.3K

4,838

18 Apr 2017