v1v2 (latest)

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

1 February 2024

Shangbin Feng

Weijia Shi

Yike Wang

Wenxuan Ding

Vidhisha Balachandran

Yulia Tsvetkov

ArXiv (abs)PDF HTML Github (29★)

Papers citing "Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration"

50 / 64 papers shown

WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate

142

02 Dec 2025

LORE: A Large Generative Model for Search Relevance

...

502

02 Dec 2025

Hallucinate Less by Thinking More: Aspect-Based Causal Abstention for Large Language Models

183

21 Nov 2025

ZoFia: Zero-Shot Fake News Detection with Entity-Guided Retrieval and Multi-LLM Interaction

175

03 Nov 2025

Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?

234

31 Oct 2025

HACK: Hallucinations Along Certainty and Knowledge Axes

249

28 Oct 2025

FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain

179

17 Oct 2025

CaRT: Teaching LLM Agents to Know When They Know Enough

172

09 Oct 2025

LLM Chemistry Estimation for Multi-LLM Recommendation

H. Sánchez

Briland Hitaj

172

04 Oct 2025

Sample, Align, Synthesize: Graph-Based Response Synthesis with ConGrs

Sayan Ghosh

Shahzaib Saqib Warraich

Dhruv Tarsadiya

Gregory Yauney

Swabha Swayamdipta

220

03 Oct 2025

Detecting (Un)answerability in Large Language Models with Linear Directions

Maor Juliet Lavi

Tova Milo

Mor Geva

172

26 Sep 2025

Predicting Language Models' Success at Zero-Shot Probabilistic Prediction

Kevin Ren

Santiago Cortes-Gomez

164

18 Sep 2025

A Systematic Survey on Large Language Models for Evolutionary Optimization: From Modeling to Solving

462

10 Sep 2025

X-SQL: Expert Schema Linking and Understanding of Text-to-SQL with Multi-LLMs

Dazhi Peng

130

07 Sep 2025

Do Retrieval Augmented Language Models Know When They Don't Know?

225

01 Sep 2025

Identifying and Answering Questions with False Assumptions: An Interpretable Approach

Zijie Wang

Eduardo Blanco

HILM

257

21 Aug 2025

Expertise-aware Multi-LLM Recruitment and Collaboration for Medical Decision-Making

297

19 Aug 2025

The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models

Xinyi Liu

Weiguang Wang

Hangfeng He

298

20 Jun 2025

AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions

268

10 Jun 2025

SPARTA ALIGNMENT: Collectively Aligning Multiple Language Models through Combat

450

05 Jun 2025

High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

296

04 Jun 2025

Delta-KNN: Improving Demonstration Selection in In-Context Learning for Alzheimer's Disease DetectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

331

04 Jun 2025

Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary

443

01 Jun 2025

Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments

258

31 May 2025

CausalAbstain: Enhancing Multilingual LLMs with Causal Reasoning for Trustworthy AbstentionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

357

31 May 2025

Multiple LLM Agents Debate for Equitable Cultural AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

436

30 May 2025

Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing

260

27 May 2025

Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Injae Na

Keonwoong Noh

Woohwan Jung

293

27 May 2025

Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation

412

26 May 2025

InFact: Informativeness Alignment for Improved LLM Factuality

Roi Cohen

Russa Biswas

Gerard de Melo

273

26 May 2025

GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling

466

25 May 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal DecodingComputer Vision and Pattern Recognition (CVPR), 2025

...

417

22 May 2025

A Weighted Byzantine Fault Tolerance Consensus Driven Trusted Multiple Large Language Models NetworkIEEE Transactions on Cognitive Communications and Networking (TCCN), 2025

284

08 May 2025

Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review

688

25 Apr 2025

HalluLens: LLM Hallucination BenchmarkAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

585

24 Apr 2025

Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined PromptsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

246

19 Apr 2025

MKA: Leveraging Cross-Lingual Consensus for Model Abstention

Sharad Duwal

385

31 Mar 2025

FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text

Varich Boonsanong

Vidhisha Balachandran

411

19 Mar 2025

MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent CollaborationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

406

19 Mar 2025

Don't lie to your friends: Learning what you know from collaborative self-play

504

18 Mar 2025

Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations

Ziwei Ji

L. Yu

Yeskendir Koishekenov

592

18 Mar 2025

Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined BehaviorsDesign Automation Conference (DAC), 2025

224

04 Mar 2025

Answer, Refuse, or Guess? Investigating Risk-Aware Decision Making in Language Models

353

03 Mar 2025

Conformal Linguistic Calibration: Trading-off between Factuality and Specificity

Zhengping Jiang

Anqi Liu

Benjamin Van Durme

651

26 Feb 2025

R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs

586

18 Feb 2025

Implicit Communication of Contextual Information in Human-Robot CollaborationIEEE/ACM International Conference on Human-Robot Interaction (HRI), 2025

Yan Zhang

225

09 Feb 2025

A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future DirectionsACM Computing Surveys (ACM CSUR), 2024

511

07 Dec 2024

Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact CompletionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

679

18 Oct 2024

ETF: An Entity Tracing Framework for Hallucination Detection in Code SummariesAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Kishan Maharaj

Vitobha Munigala

Srikanth G. Tamilselvam

Praveen Venkateswaran

Sayandeep Sen

Palani Kodeswaran

Abhijit Mishra

Pushpak Bhattacharyya

HILM

485

17 Oct 2024

Latent Space Chain-of-Embedding Enables Output-free LLM Self-EvaluationInternational Conference on Learning Representations (ICLR), 2024

Yiming Wang

430

17 Oct 2024