ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.02707
  4. Cited By
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
v1v2v3v4 (latest)

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

International Conference on Learning Representations (ICLR), 2024
3 October 2024
Hadas Orgad
Michael Toker
Zorik Gekhman
Roi Reichart
Idan Szpektor
Hadas Kotek
Yonatan Belinkov
    HILMAIFin
ArXiv (abs)PDFHTMLHuggingFace (49 upvotes)

Papers citing "LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations"

31 / 131 papers shown
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for
  Generative Large Language Models
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Potsawee Manakul
Adian Liusie
Mark Gales
HILMLRM
419
673
0
15 Mar 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALMPILM
6.3K
17,759
0
27 Feb 2023
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation
  in Natural Language Generation
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language GenerationInternational Conference on Learning Representations (ICLR), 2023
Lorenz Kuhn
Y. Gal
Sebastian Farquhar
UQLM
607
474
0
19 Feb 2023
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on
  Reasoning, Hallucination, and Interactivity
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityInternational Joint Conference on Natural Language Processing (IJCNLP), 2023
Yejin Bang
Samuel Cahyawijaya
Nayeon Lee
Wenliang Dai
Jane Polak Scowcroft
...
Tiezheng Yu
Willy Chung
Quyet V. Do
Yan Xu
Pascale Fung
ReLMLRM
709
1,621
0
08 Feb 2023
Discovering Latent Knowledge in Language Models Without Supervision
Discovering Latent Knowledge in Language Models Without SupervisionInternational Conference on Learning Representations (ICLR), 2022
Collin Burns
Haotian Ye
Dan Klein
Jacob Steinhardt
417
531
0
07 Dec 2022
RARR: Researching and Revising What Language Models Say, Using Language
  Models
RARR: Researching and Revising What Language Models Say, Using Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Luyu Gao
Zhuyun Dai
Panupong Pasupat
Anthony Chen
Arun Tejasvi Chaganty
...
Vincent Zhao
Ni Lao
Hongrae Lee
Da-Cheng Juan
Kelvin Guu
HILMKELM
704
282
0
17 Oct 2022
Looking for a Needle in a Haystack: A Comprehensive Study of
  Hallucinations in Neural Machine Translation
Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Nuno M. Guerreiro
Elena Voita
André F. T. Martins
HILM
290
68
0
10 Aug 2022
Language Models (Mostly) Know What They Know
Language Models (Mostly) Know What They Know
Saurav Kadavath
Tom Conerly
Amanda Askell
T. Henighan
Dawn Drain
...
Nicholas Joseph
Benjamin Mann
Sam McCandlish
C. Olah
Jared Kaplan
ELM
638
1,139
0
11 Jul 2022
RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
RED-ACE: Robust Error Detection for ASR using Confidence EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zorik Gekhman
Dina Zverinski
Jonathan Mallinson
Genady Beryozkin
138
10
0
14 Mar 2022
Locating and Editing Factual Associations in GPT
Locating and Editing Factual Associations in GPTNeural Information Processing Systems (NeurIPS), 2022
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
KELM
947
1,956
0
10 Feb 2022
Survey of Hallucination in Natural Language Generation
Survey of Hallucination in Natural Language GenerationACM Computing Surveys (ACM CSUR), 2022
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
...
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
HILMLRM
905
3,504
0
08 Feb 2022
SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in
  Summarization
SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization
Philippe Laban
Tobias Schnabel
Paul N. Bennett
Marti A. Hearst
HILM
342
460
0
18 Nov 2021
Learning Compact Metrics for MT
Learning Compact Metrics for MTConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Amy Pu
Hyung Won Chung
Ankur P. Parikh
Sebastian Gehrmann
Thibault Sellam
185
112
0
12 Oct 2021
TruthfulQA: Measuring How Models Mimic Human Falsehoods
TruthfulQA: Measuring How Models Mimic Human FalsehoodsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Stephanie C. Lin
Jacob Hilton
Owain Evans
HILM
1.6K
2,670
0
08 Sep 2021
A Token-level Reference-free Hallucination Detection Benchmark for
  Free-form Text Generation
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Tianyu Liu
Yizhe Zhang
Chris Brockett
Yi Mao
Zhifang Sui
Weizhu Chen
W. Dolan
HILM
939
182
0
18 Apr 2021
$Q^{2}$: Evaluating Factual Consistency in Knowledge-Grounded Dialogues
  via Question Generation and Question Answering
Q2Q^{2}Q2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Or Honovich
Leshem Choshen
Roee Aharoni
Ella Neeman
Idan Szpektor
Omri Abend
HILM
406
153
0
16 Apr 2021
QuestEval: Summarization Asks for Fact-based Evaluation
QuestEval: Summarization Asks for Fact-based EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Thomas Scialom
Paul-Alexis Dray
Patrick Gallinari
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
Alex Jinpeng Wang
HILM
274
321
0
23 Mar 2021
Probing Classifiers: Promises, Shortcomings, and Advances
Probing Classifiers: Promises, Shortcomings, and AdvancesInternational Conference on Computational Logic (ICCL), 2021
Yonatan Belinkov
750
590
0
24 Feb 2021
KoBE: Knowledge-Based Machine Translation Evaluation
KoBE: Knowledge-Based Machine Translation EvaluationFindings (Findings), 2020
Zorik Gekhman
Roee Aharoni
Genady Beryozkin
Markus Freitag
Wolfgang Macherey
174
15
0
23 Sep 2020
COMET: A Neural Framework for MT Evaluation
COMET: A Neural Framework for MT EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Ricardo Rei
Craig Alan Stewart
Ana C. Farinha
A. Lavie
464
1,363
0
18 Sep 2020
On Exposure Bias, Hallucination and Domain Shift in Neural Machine
  Translation
On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation
Chaojun Wang
Rico Sennrich
229
182
0
07 May 2020
BLEURT: Learning Robust Metrics for Text Generation
BLEURT: Learning Robust Metrics for Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Thibault Sellam
Dipanjan Das
Ankur P. Parikh
636
1,750
0
09 Apr 2020
Evaluating the Factual Consistency of Abstractive Text Summarization
Evaluating the Factual Consistency of Abstractive Text SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
HILM
333
847
0
28 Oct 2019
On Identifiability in Transformers
On Identifiability in TransformersInternational Conference on Learning Representations (ICLR), 2019
Gino Brunner
Yang Liu
Damian Pascual
Oliver Richter
Massimiliano Ciaramita
Roger Wattenhofer
ViT
315
202
0
12 Aug 2019
Context is Key: Grammatical Error Detection with Contextual Word
  Representations
Context is Key: Grammatical Error Detection with Contextual Word Representations
Samuel J. Bell
H. Yannakoudakis
Marek Rei
166
46
0
15 Jun 2019
ERNIE: Enhanced Language Representation with Informative Entities
ERNIE: Enhanced Language Representation with Informative EntitiesAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Zhengyan Zhang
Xu Han
Zhiyuan Liu
Xin Jiang
Maosong Sun
Qun Liu
391
1,520
0
17 May 2019
Wronging a Right: Generating Better Errors to Improve Grammatical Error
  Detection
Wronging a Right: Generating Better Errors to Improve Grammatical Error DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Sudhanshu Kasewa
Pontus Stenetorp
Sebastian Riedel
200
60
0
26 Sep 2018
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question
  Answering
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
898
3,623
0
25 Sep 2018
Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods
Gender Bias in Coreference Resolution: Evaluation and Debiasing MethodsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2018
Jieyu Zhao
Tianlu Wang
Mark Yatskar
Vicente Ordonez
Kai-Wei Chang
333
1,080
0
18 Apr 2018
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
1.8K
3,345
0
09 May 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through
  Inference
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
1.3K
4,838
0
18 Apr 2017
Previous
123