Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2402.00367
Cited By
v1
v2 (latest)
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
1 February 2024
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Vidhisha Balachandran
Yulia Tsvetkov
Re-assign community
ArXiv (abs)
PDF
HTML
Github (29★)
Papers citing
"Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration"
50 / 64 papers shown
WISE: Weighted Iterative Society-of-Experts for Robust Multimodal Multi-Agent Debate
A. Cherian
River Doyle
Eyal Ben-Dov
Suhas Lohit
Kuan-Chuan Peng
LLMAG
MoE
142
0
0
02 Dec 2025
LORE: A Large Generative Model for Search Relevance
Chenji Lu
Zhuo Chen
Hui Zhao
Zhiyuan Zeng
Gang Zhao
...
Haoran Li
Songyan Liu
P. Wang
Jian Xu
Bo Zheng
OffRL
AI4TS
LRM
502
1
0
02 Dec 2025
Hallucinate Less by Thinking More: Aspect-Based Causal Abstention for Large Language Models
Vy Nguyen
Ziqi Xu
J. Chan
Estrid He
Feng Xia
Xiuzhen Zhang
183
1
0
21 Nov 2025
ZoFia: Zero-Shot Fake News Detection with Entity-Guided Retrieval and Multi-LLM Interaction
Lvhua Wu
Xuefeng Jiang
Sheng Sun
Tian Wen
Yuwei Wang
Min Liu
175
2
0
03 Nov 2025
Why Do Multilingual Reasoning Gaps Emerge in Reasoning Language Models?
Deokhyung Kang
Seonjeong Hwang
Daehui Kim
Hyounghun Kim
Gary Geunbae Lee
LRM
234
3
0
31 Oct 2025
HACK: Hallucinations Along Certainty and Knowledge Axes
Adi Simhi
Jonathan Herzig
Itay Itzhak
Dana Arad
Zorik Gekhman
Roi Reichart
Fazl Barez
Gabriel Stanovsky
Idan Szpektor
Yonatan Belinkov
249
3
0
28 Oct 2025
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain
Tiansheng Hu
Tongyan Hu
Liuyang Bai
Yilun Zhao
Arman Cohan
Chen Zhao
179
5
0
17 Oct 2025
CaRT: Teaching LLM Agents to Know When They Know Enough
Grace Liu
Yuxiao Qu
J. Schneider
Aarti Singh
Aviral Kumar
LRM
172
0
0
09 Oct 2025
LLM Chemistry Estimation for Multi-LLM Recommendation
H. Sánchez
Briland Hitaj
172
1
0
04 Oct 2025
Sample, Align, Synthesize: Graph-Based Response Synthesis with ConGrs
Sayan Ghosh
Shahzaib Saqib Warraich
Dhruv Tarsadiya
Gregory Yauney
Swabha Swayamdipta
220
0
0
03 Oct 2025
Detecting (Un)answerability in Large Language Models with Linear Directions
Maor Juliet Lavi
Tova Milo
Mor Geva
172
3
0
26 Sep 2025
Predicting Language Models' Success at Zero-Shot Probabilistic Prediction
Kevin Ren
Santiago Cortes-Gomez
Carlos Patiño
Ananya Joshi
Ruiqi Lyu
Jingjing Tang
Alistair Turcan
Khurram Yamin
Steven Wu
Bryan Wilder
164
2
0
18 Sep 2025
A Systematic Survey on Large Language Models for Evolutionary Optimization: From Modeling to Solving
Yisong Zhang
Ran Cheng
Guoxing Yi
Kay Chen Tan
OffRL
462
9
0
10 Sep 2025
X-SQL: Expert Schema Linking and Understanding of Text-to-SQL with Multi-LLMs
Dazhi Peng
130
0
0
07 Sep 2025
Do Retrieval Augmented Language Models Know When They Don't Know?
Youchao Zhou
Heyan Huang
Yicheng Liu
Rui Dai
Xinglin Wang
Xingchen Zhang
Shumin Shi
Yang Deng
225
0
0
01 Sep 2025
Identifying and Answering Questions with False Assumptions: An Interpretable Approach
Zijie Wang
Eduardo Blanco
HILM
257
0
0
21 Aug 2025
Expertise-aware Multi-LLM Recruitment and Collaboration for Medical Decision-Making
Liuxin Bao
Zhihao Peng
Xiaofei Zhou
Runmin Cong
Jiyong Zhang
Yixuan Yuan
297
1
0
19 Aug 2025
The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
Xinyi Liu
Weiguang Wang
Hangfeng He
298
0
0
20 Jun 2025
AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions
Polina Kirichenko
Mark Ibrahim
Kamalika Chaudhuri
Samuel J. Bell
LRM
268
46
0
10 Jun 2025
SPARTA ALIGNMENT: Collectively Aligning Multiple Language Models through Combat
Yuru Jiang
Wenxuan Ding
Shangbin Feng
Greg Durrett
Yulia Tsvetkov
450
4
0
05 Jun 2025
High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning
Tim Franzmeyer
Archie Sravankumar
Lijuan Liu
Yuning Mao
Rui Hou
Sinong Wang
Jakob Foerster
Luke Zettlemoyer
Madian Khabsa
KELM
ALM
296
0
0
04 Jun 2025
Delta-KNN: Improving Demonstration Selection in In-Context Learning for Alzheimer's Disease Detection
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Chuyuan Li
Raymond Li
Thalia S. Field
Giuseppe Carenini
331
1
0
04 Jun 2025
Position: Agent Should Invoke External Tools ONLY When Epistemically Necessary
Hongru Wang
Cheng Qian
Pengfei Yu
Jiahao Qiu
Boyang Xue
Mengdi Wang
Heng Ji
Kam-Fai Wong
Kam-Fai Wong
443
10
0
01 Jun 2025
Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments
Li Zhang
Morgan A. Gray
Jaromír Šavelka
Kevin D. Ashley
258
1
0
31 May 2025
CausalAbstain: Enhancing Multilingual LLMs with Causal Reasoning for Trustworthy Abstention
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yuxi Sun
Aoqi Zuo
Wei Gao
Jing Ma
357
4
0
31 May 2025
Multiple LLM Agents Debate for Equitable Cultural Alignment
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Dayeon Ki
Rachel Rudinger
Tianyi Zhou
Marine Carpuat
LLMAG
436
12
0
30 May 2025
Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing
Raoyuan Zhao
Abdullatif Köksal
Ali Modarressi
Michael A. Hedderich
Hinrich Schutze
260
5
0
27 May 2025
Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Injae Na
Keonwoong Noh
Woohwan Jung
293
0
0
27 May 2025
Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation
Keane Ong
Rui Mao
Deeksha Varshney
Paul Pu Liang
Erik Cambria
G. Mengaldo
AIFin
OffRL
412
2
0
26 May 2025
InFact: Informativeness Alignment for Improved LLM Factuality
Roi Cohen
Russa Biswas
Gerard de Melo
273
1
0
26 May 2025
GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling
Jialong Zhou
L. Wang
Xiao Yang
LLMAG
466
13
0
25 May 2025
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
Computer Vision and Pattern Recognition (CVPR), 2025
Feilong Tang
Chengzhi Liu
Zhongxing Xu
Ming Hu
Zelin Peng
...
Minquan Lin
Yifan Peng
Xuelian Cheng
Imran Razzak
Zongyuan Ge
417
33
0
22 May 2025
A Weighted Byzantine Fault Tolerance Consensus Driven Trusted Multiple Large Language Models Network
IEEE Transactions on Cognitive Communications and Networking (TCCN), 2025
Haoxiang Luo
Gang Sun
Yinqiu Liu
Dongcheng Zhao
Dusit Niyato
Hongfang Yu
Schahram Dustdar
284
16
0
08 May 2025
Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review
Toghrul Abbasli
Kentaroh Toyoda
Yuan Wang
Leon Witt
Muhammad Asif Ali
Yukai Miao
Dan Li
Qingsong Wei
UQCV
HILM
688
2
0
25 Apr 2025
HalluLens: LLM Hallucination Benchmark
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yejin Bang
Ziwei Ji
Alan Schelten
Anthony Hartshorn
Tara Fowler
Cheng Zhang
Nicola Cancedda
Pascale Fung
HILM
585
68
0
24 Apr 2025
Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Kun Qian
Maximillian Chen
Siyan Li
Arpit Sharma
Zhou Yu
246
1
0
19 Apr 2025
MKA: Leveraging Cross-Lingual Consensus for Model Abstention
Sharad Duwal
385
1
0
31 Mar 2025
FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text
Varich Boonsanong
Vidhisha Balachandran
Xiaochuang Han
Shangbin Feng
Lucy Lu Wang
Yulia Tsvetkov
411
4
0
19 Mar 2025
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
David Wan
Justin Chih-Yao Chen
Elias Stengel-Eskin
Joey Tianyi Zhou
LLMAG
LRM
406
8
0
19 Mar 2025
Don't lie to your friends: Learning what you know from collaborative self-play
Jacob Eisenstein
Reza Aghajani
Adam Fisch
Dheeru Dua
Fantine Huot
Mirella Lapata
Vicky Zayats
Jonathan Berant
504
8
0
18 Mar 2025
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
Ziwei Ji
L. Yu
Yeskendir Koishekenov
Yejin Bang
Anthony Hartshorn
Alan Schelten
Cheng Zhang
Pascale Fung
Nicola Cancedda
592
26
0
18 Mar 2025
Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors
Design Automation Conference (DAC), 2025
Renshuang Jiang
Pan Dong
Zhenling Duan
Yu Shi
Xiaoxiang Fang
Yan Ding
Jun Ma
Shuai Zhao
Zhe Jiang
224
0
0
04 Mar 2025
Answer, Refuse, or Guess? Investigating Risk-Aware Decision Making in Language Models
Cheng-Kuang Wu
Zhi Rui Tam
Chieh-Yen Lin
Yun-Nung Chen
Hung-yi Lee
353
4
0
03 Mar 2025
Conformal Linguistic Calibration: Trading-off between Factuality and Specificity
Zhengping Jiang
Anqi Liu
Benjamin Van Durme
651
10
0
26 Feb 2025
R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs
Sumin Jo
Junseong Choi
Jiho Kim
Edward Choi
586
4
0
18 Feb 2025
Implicit Communication of Contextual Information in Human-Robot Collaboration
IEEE/ACM International Conference on Human-Robot Interaction (HRI), 2025
Yan Zhang
225
2
0
09 Feb 2025
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
ACM Computing Surveys (ACM CSUR), 2024
Ola Shorinwa
Zhiting Mei
Justin Lidard
Allen Z. Ren
Anirudha Majumdar
HILM
LRM
511
19
0
07 Dec 2024
Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact Completion
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Denitsa Saynova
Lovisa Hagström
Moa Johansson
Richard Johansson
Marco Kuhlmann
HILM
679
2
0
18 Oct 2024
ETF: An Entity Tracing Framework for Hallucination Detection in Code Summaries
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Kishan Maharaj
Vitobha Munigala
Srikanth G. Tamilselvam
Praveen Venkateswaran
Sayandeep Sen
Palani Kodeswaran
Abhijit Mishra
Pushpak Bhattacharyya
HILM
485
3
0
17 Oct 2024
Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
International Conference on Learning Representations (ICLR), 2024
Yiming Wang
Pei Zhang
Baosong Yang
Yang Li
Rui Wang
LRM
430
48
0
17 Oct 2024
1
2
Next
Page 1 of 2