Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2304.10513
Cited By
v1
v2
v3 (latest)
Why Does ChatGPT Fall Short in Providing Truthful Answers?
20 April 2023
Shen Zheng
Jie Huang
Kevin Chen-Chuan Chang
HILM
AI4MH
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"Why Does ChatGPT Fall Short in Providing Truthful Answers?"
43 / 43 papers shown
Large Language Models Hallucination: A Comprehensive Survey
Aisha Alansari
Hamzah Luqman
HILM
LRM
601
14
0
05 Oct 2025
LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions
Xixun Lin
Yucheng Ning
Jingwen Zhang
Yan Dong
Y. Liu
...
Bin Wang
Yanan Cao
Kai-xiang Chen
Songlin Hu
Li Guo
LLMAG
LRM
415
22
0
23 Sep 2025
Exploring and Mitigating Fawning Hallucinations in Large Language Models
Zixuan Shangguan
Yanjie Dong
Lanjun Wang
Xiaoyi Fan
Victor C. M. Leung
Xiping Hu
128
3
0
31 Aug 2025
How Does Response Length Affect Long-Form Factuality
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
James Xu Zhao
Jimmy Z.J. Liu
Bryan Hooi
See-Kiong Ng
HILM
KELM
279
4
0
29 May 2025
GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation
International Conference on Learning Representations (ICLR), 2025
Tao Feng
Yihang Sun
Jiaxuan You
540
19
0
16 Mar 2025
Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jinjie Wei
Dongling Xiao
Jinjie Wei
Mingcheng Li
Zhaoyu Chen
Ke Li
Li Zhang
HILM
652
16
0
28 Jan 2025
AI Assistants for Spaceflight Procedures: Combining Generative Pre-Trained Transformer and Retrieval-Augmented Generation on Knowledge Graphs With Augmented Reality Cues
Oliver Bensch
Leonie Bensch
Tommy Nilsson
Florian Saling
Bernd Bewer
Sophie Jentzsch
Tobias Hecking
J. Nathan Kutz
158
3
0
21 Sep 2024
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Yulong Chen
Yang Liu
Jianhao Yan
X. Bai
Ming Zhong
Yinghao Yang
Ziyi Yang
Chenguang Zhu
Yue Zhang
ALM
ELM
238
20
0
16 Aug 2024
Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models
Zikai Xie
HILM
LRM
620
14
0
09 Aug 2024
Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency
Taiji Li
Zhi Li
Yin Zhang
HILM
378
23
0
31 Jul 2024
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions
Bojana Bašaragin
Adela Ljajić
Darija Medvecki
Lorenzo Cassano
Milos Kosprdic
Nikola Milosevic
LM&MA
353
11
0
06 Jul 2024
REAL Sampling: Boosting Factuality and Diversity of Open-Ended Generation via Asymptotic Entropy
Haw-Shiuan Chang
Nanyun Peng
Mohit Bansal
Anil Ramakrishna
Tagyoung Chung
HILM
285
9
0
11 Jun 2024
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
Wen Luo
Tianshu Shen
Wei Li
Guangyue Peng
Richeng Xuan
Houfeng Wang
Xi Yang
HILM
353
27
0
11 Jun 2024
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework
Xiaoxi Sun
Jinpeng Li
Yan Zhong
Dongyan Zhao
Rui Yan
LLMAG
HILM
307
21
0
05 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILM
RALM
489
8
0
03 Jun 2024
A Survey of Useful LLM Evaluation
Ji-Lun Peng
Sijia Cheng
Egil Diau
Yung-Yu Shih
Po-Heng Chen
Yen-Ting Lin
Yun-Nung Chen
LLMAG
ELM
324
36
0
03 Jun 2024
Towards Rationality in Language and Multimodal Agents: A Survey
Bowen Jiang
Yangxinyu Xie
Xiaomeng Wang
Yuan Yuan
Camillo J Taylor
Tanwi Mallick
Weijie J. Su
Camillo J. Taylor
Tanwi Mallick
LLMAG
449
4
0
01 Jun 2024
Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval
Mengjia Niu
Hao Li
Jie Shi
Hamed Haddadi
Fan Mo
HILM
235
29
0
10 May 2024
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
Changsu Choi
Yongbin Jeong
Seoyoon Park
Inho Won
HyeonSeok Lim
...
Yiseul Lee
HyeJin Lee
Younggyun Hahm
Hansaem Kim
Kyungtae Lim
340
24
0
16 Mar 2024
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Corby Rosset
Ho-Lam Chung
Guanghui Qin
Ethan C. Chau
Zhuo Feng
Ahmed Hassan Awadallah
Jennifer Neville
Nikhil Rao
330
25
0
27 Feb 2024
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Simone Balloccu
Patrícia Schmidtová
Mateusz Lango
Ondrej Dusek
SILM
ELM
PILM
565
300
0
06 Feb 2024
Alignment for Honesty
Neural Information Processing Systems (NeurIPS), 2023
Yuqing Yang
Ethan Chern
Xipeng Qiu
Graham Neubig
Pengfei Liu
318
64
0
12 Dec 2023
Axiomatic Preference Modeling for Longform Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Corby Rosset
Guoqing Zheng
Victor C. Dibia
Ahmed Hassan Awadallah
Paul Bennett
SyDa
229
8
0
02 Dec 2023
On the Calibration of Large Language Models and Alignment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chiwei Zhu
Benfeng Xu
Quan Wang
Yongdong Zhang
Zhendong Mao
366
79
0
22 Nov 2023
SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiaxin Zhang
Zhuohang Li
Kamalika Das
Sricharan Kumar
Kumar Sricharan
HILM
LRM
422
103
0
03 Nov 2023
Critical Role of Artificially Intelligent Conversational Chatbot
S. A. Mostafa
Md Z. Islam
Mohammad Z. Islam
Fairose Jeehan
Saujanna Jafreen
Raihan U. Islam
AI4MH
144
0
0
31 Oct 2023
Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving
Frontiers in Education (FIE), 2023
Karen D. Wang
E. Burkholder
Carl E. Wieman
S. Salehi
Nicholas Haber
AI4CE
ELM
248
71
0
12 Oct 2023
Large Language Models can Learn Rules
Zhaocheng Zhu
Yuan Xue
Xinyun Chen
Denny Zhou
Jian Tang
Dale Schuurmans
Hanjun Dai
LRM
ReLM
369
91
0
10 Oct 2023
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations
Deren Lei
Yaxi Li
Mengya Hu
Mingyu Wang
Vincent Yun
Emily Ching
Eslam Kamal
HILM
LRM
364
60
0
06 Oct 2023
Evaluating Hallucinations in Chinese Large Language Models
Qinyuan Cheng
Tianxiang Sun
Wenwei Zhang
Siyin Wang
Xiangyang Liu
...
Junliang He
Mianqiu Huang
Zhangyue Yin
Kai Chen
Xipeng Qiu
HILM
ELM
317
44
0
05 Oct 2023
Dodo: Dynamic Contextual Compression for Decoder-only LMs
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Guanghui Qin
Corby Rosset
Ethan C. Chau
Nikhil Rao
Benjamin Van Durme
264
19
0
03 Oct 2023
Large Language Models Cannot Self-Correct Reasoning Yet
International Conference on Learning Representations (ICLR), 2023
Jie Huang
Xinyun Chen
Swaroop Mishra
Huaixiu Steven Zheng
Adams Wei Yu
Xinying Song
Denny Zhou
ReLM
LRM
742
819
0
03 Oct 2023
AutoHall: Automated Factuality Hallucination Dataset Generation for Large Language Models
IEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2023
Zouying Cao
Yifei Yang
Hai Zhao
Hai Zhao
HILM
676
12
0
30 Sep 2023
Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis
Li Du
Yequan Wang
Xingrun Xing
Yiqun Ya
Xiang Li
Xin Jiang
Xuezhi Fang
HILM
251
23
0
11 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context Learning?
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Sheng Lu
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Iryna Gurevych
LRM
ELM
ReLM
484
151
0
04 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Computational Linguistics (CL), 2023
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Freda Shi
Shuming Shi
Shuming Shi
LRM
RALM
HILM
851
953
0
03 Sep 2023
Leveraging Explainable AI to Analyze Researchers' Aspect-Based Sentiment about ChatGPT
International Conference on Intelligent Human Computer Interaction (IHCI), 2023
S. Lakhanpal
Ajay Gupta
R. Agrawal
264
2
0
16 Aug 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Ming-Yu Liu
Peng Xu
Mohammad Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
281
45
0
15 Aug 2023
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models
China National Conference on Chinese Computational Linguistics (CNCCL), 2023
Ziyu Zhuang
Qiguang Chen
Longxuan Ma
Mingda Li
Yi Han
Yushan Qian
Haopeng Bai
Zixian Feng
Weinan Zhang
Ting Liu
ELM
221
24
0
15 Aug 2023
The Hitchhiker's Guide to Program Analysis: A Journey with Large Language Models
Haonan Li
Yu Hao
Yizhuo Zhai
Zhiyun Qian
LLMAG
273
37
0
01 Aug 2023
Citation: A Key to Building Responsible and Accountable Large Language Models
Jie Huang
Kevin Chen-Chuan Chang
HILM
402
33
0
05 Jul 2023
Towards Reasoning in Large Language Models: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jie Huang
Kevin Chen-Chuan Chang
LM&MA
ELM
LRM
1.3K
872
0
20 Dec 2022
Can Language Models Be Specific? How?
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jie Huang
Kevin Chen-Chuan Chang
Jinjun Xiong
Wen-mei W. Hwu
241
9
0
11 Oct 2022
1
Page 1 of 1