How Language Model Hallucinations Can Snowball

International Conference on Machine Learning (ICML), 2023

22 May 2023

Ofir Press

ArXiv (abs)PDF HTML HuggingFace (3 upvotes)

Papers citing "How Language Model Hallucinations Can Snowball"

50 / 125 papers shown

MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent CollaborationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

275

19 Mar 2025

Where do Large Vision-Language Models Look at when Answering Questions?

284

18 Mar 2025

DatawiseAgent: A Notebook-Centric LLM Agent Framework for Adaptive and Robust Data Science Automation

284

10 Mar 2025

Can LLMs Explain Themselves Counterfactually?

Zahra Dehghanighobadi

Asja Fischer

Muhammad Bilal Zafar

LRM

408

25 Feb 2025

GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-CheckingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

570

23 Feb 2025

Preventing Rogue Agents Improves Multi-Agent Collaboration

Ohav Barbi

Ori Yoran

Mor Geva

327

09 Feb 2025

ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and UncertaintyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

388

28 Dec 2024

The Potential of LLMs in Medical Education: Generating Questions and Answers for Qualification Exams

Liyang Dou

Yifan Gu

Yuanyuan Wu

Wensheng Zhang

Ying Sun

Xuebing Yang

LM&MA ELM

317

31 Oct 2024

Retrieval-Augmented Generation with Estimation of Source Reliability

463

30 Oct 2024

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

...

466

18 Oct 2024

QSpec: Speculative Decoding with Complementary Quantization Schemes

436

15 Oct 2024

A Survey on the Honesty of Large Language Models

Siheng Li

Cheng Yang

Taiqiang Wu

Chufan Shi

Yuji Zhang

...

Ngai Wong

Wai Lam

293

27 Sep 2024

Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience

...

322

22 Aug 2024

Visual Agents as Fast and Slow ThinkersInternational Conference on Learning Representations (ICLR), 2024

Zhenting Wang

529

16 Aug 2024

ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning

Alan Yuille

312

05 Aug 2024

Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024

579

04 Aug 2024

Social and Ethical Risks Posed by General-Purpose LLMs for Settling Newcomers in Canada

I. Nejadgholi

Maryam Molamohammadi

Samir Bakhtawar

380

15 Jul 2024

Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models

Jiajun Zhang

384

08 Jul 2024

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

425

08 Jul 2024

Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling

232

02 Jul 2024

Learning to Refine with Fine-Grained Natural Language Feedback

532

02 Jul 2024

First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning

333

23 Jun 2024

Chain-of-Probe: Examining the Necessity and Accuracy of CoT Step-by-Step

Zezhong Wang

Xingshan Zeng

Weiwen Liu

Yufei Wang

Liangyou Li

Yasheng Wang

Lifeng Shang

Xin Jiang

Qun Liu

Kam-Fai Wong

LRM

255

23 Jun 2024

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation

Bairu Hou

Yang Zhang

Jacob Andreas

Shiyu Chang

299

11 Jun 2024

ANAH: Analytical Annotation of Hallucinations in Large Language Models

Dahua Lin

203

30 May 2024

Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation

235

30 May 2024

OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs

415

09 May 2024

The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey

330

136

17 Apr 2024

Self-playing Adversarial Language Game Enhances LLM Reasoning

Xiaolong Li

492

16 Apr 2024

Automating Research Synthesis with Domain-Specific Large Language Model Fine-Tuning

231

08 Apr 2024

Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art

Neeloy Chakraborty

Melkior Ornik

Katherine Driggs-Campbell

LRM

433

25 Mar 2024

FIT-RAG: Black-Box RAG with Factual Information and Token Reduction

193

21 Mar 2024

ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

Xing Xie

217

08 Mar 2024

SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models

246

04 Mar 2024

Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning

Boi Faltings

420

21 Feb 2024

Rowen: Adaptive Retrieval-Augmented Generation for Hallucination Mitigation in LLMs

432

16 Feb 2024

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains

430

01 Feb 2024

Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning

320

31 Jan 2024

Generative AI in EU Law: Liability, Privacy, Intellectual Property, and CybersecuritySocial Science Research Network (SSRN), 2024

440

14 Jan 2024

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D GenerationComputer Vision and Pattern Recognition (CVPR), 2024

Tong Wu

Guandao Yang

Zhibing Li

Kai Zhang

Ziwei Liu

Leonidas Guibas

Dahua Lin

Gordon Wetzstein

EGVM VGen

397

138

08 Jan 2024

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models

188

04 Jan 2024

The Persuasive Power of Large Language Models

Simon Martin Breum

Daniel Vaedele Egdal

Victor Gram Mortensen

Anders Giovanni Møller

L. Aiello

AI4CE

204

24 Dec 2023

LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?

Amr El Abbadi

536

16 Dec 2023

Making Large Language Models Better Knowledge Miners for Online Marketing with Progressive Prompting Augmentation

343

08 Dec 2023

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-AllocationComputer Vision and Pattern Recognition (CVPR), 2023

Conghui He

Dahua Lin

463

360

29 Nov 2023

Calibrated Language Models Must HallucinateSymposium on the Theory of Computing (STOC), 2023

Adam Tauman Kalai

Santosh Vempala

HILM

399

129

24 Nov 2023

Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification

396

15 Nov 2023

Can Knowledge Graphs Reduce Hallucinations in LLMs? : A SurveyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

312

145

14 Nov 2023

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

...

437

1,930

09 Nov 2023

In-Context Learning Dynamics with Random Binary SequencesInternational Conference on Learning Representations (ICLR), 2023

425

26 Oct 2023