ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.10513
  4. Cited By
Why Does ChatGPT Fall Short in Providing Truthful Answers?
v1v2v3 (latest)

Why Does ChatGPT Fall Short in Providing Truthful Answers?

20 April 2023
Shen Zheng
Jie Huang
Kevin Chen-Chuan Chang
    HILMAI4MH
ArXiv (abs)PDFHTMLGithub

Papers citing "Why Does ChatGPT Fall Short in Providing Truthful Answers?"

43 / 43 papers shown
Large Language Models Hallucination: A Comprehensive Survey
Large Language Models Hallucination: A Comprehensive Survey
Aisha Alansari
Hamzah Luqman
HILMLRM
601
14
0
05 Oct 2025
LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions
LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions
Xixun Lin
Yucheng Ning
Jingwen Zhang
Yan Dong
Y. Liu
...
Bin Wang
Yanan Cao
Kai-xiang Chen
Songlin Hu
Li Guo
LLMAGLRM
415
22
0
23 Sep 2025
Exploring and Mitigating Fawning Hallucinations in Large Language Models
Exploring and Mitigating Fawning Hallucinations in Large Language Models
Zixuan Shangguan
Yanjie Dong
Lanjun Wang
Xiaoyi Fan
Victor C. M. Leung
Xiping Hu
128
3
0
31 Aug 2025
How Does Response Length Affect Long-Form Factuality
How Does Response Length Affect Long-Form FactualityAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
James Xu Zhao
Jimmy Z.J. Liu
Bryan Hooi
See-Kiong Ng
HILMKELM
279
4
0
29 May 2025
GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation
GraphEval: A Lightweight Graph-Based LLM Framework for Idea EvaluationInternational Conference on Learning Representations (ICLR), 2025
Tao Feng
Yihang Sun
Jiaxuan You
540
19
0
16 Mar 2025
Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful ComparatorsAAAI Conference on Artificial Intelligence (AAAI), 2024
Jinjie Wei
Dongling Xiao
Jinjie Wei
Mingcheng Li
Zhaoyu Chen
Ke Li
Li Zhang
HILM
652
16
0
28 Jan 2025
AI Assistants for Spaceflight Procedures: Combining Generative
  Pre-Trained Transformer and Retrieval-Augmented Generation on Knowledge
  Graphs With Augmented Reality Cues
AI Assistants for Spaceflight Procedures: Combining Generative Pre-Trained Transformer and Retrieval-Augmented Generation on Knowledge Graphs With Augmented Reality Cues
Oliver Bensch
Leonie Bensch
Tommy Nilsson
Florian Saling
Bernd Bewer
Sophie Jentzsch
Tobias Hecking
J. Nathan Kutz
158
3
0
21 Sep 2024
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering
  LLM Weaknesses
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Yulong Chen
Yang Liu
Jianhao Yan
X. Bai
Ming Zhong
Yinghao Yang
Ziyi Yang
Chenguang Zhu
Yue Zhang
ALMELM
238
20
0
16 Aug 2024
Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models
Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models
Zikai Xie
HILMLRM
620
14
0
09 Aug 2024
Improving Faithfulness of Large Language Models in Summarization via
  Sliding Generation and Self-Consistency
Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency
Taiji Li
Zhi Li
Yin Zhang
HILM
378
23
0
31 Jul 2024
How do you know that? Teaching Generative Language Models to Reference
  Answers to Biomedical Questions
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions
Bojana Bašaragin
Adela Ljajić
Darija Medvecki
Lorenzo Cassano
Milos Kosprdic
Nikola Milosevic
LM&MA
353
11
0
06 Jul 2024
REAL Sampling: Boosting Factuality and Diversity of Open-Ended
  Generation via Asymptotic Entropy
REAL Sampling: Boosting Factuality and Diversity of Open-Ended Generation via Asymptotic Entropy
Haw-Shiuan Chang
Nanyun Peng
Mohit Bansal
Anil Ramakrishna
Tagyoung Chung
HILM
285
9
0
11 Jun 2024
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level
  Hallucination Evaluation
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
Wen Luo
Tianshu Shen
Wei Li
Guangyue Peng
Richeng Xuan
Houfeng Wang
Xi Yang
HILM
353
27
0
11 Jun 2024
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent
  Debate Framework
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework
Xiaoxi Sun
Jinpeng Li
Yan Zhong
Dongyan Zhao
Rui Yan
LLMAGHILM
307
21
0
05 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model
  Hallucinations with High Accuracy and Low Cost
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILMRALM
489
8
0
03 Jun 2024
A Survey of Useful LLM Evaluation
A Survey of Useful LLM Evaluation
Ji-Lun Peng
Sijia Cheng
Egil Diau
Yung-Yu Shih
Po-Heng Chen
Yen-Ting Lin
Yun-Nung Chen
LLMAGELM
324
36
0
03 Jun 2024
Towards Rationality in Language and Multimodal Agents: A Survey
Towards Rationality in Language and Multimodal Agents: A Survey
Bowen Jiang
Yangxinyu Xie
Xiaomeng Wang
Yuan Yuan
Camillo J Taylor
Tanwi Mallick
Weijie J. Su
Camillo J. Taylor
Tanwi Mallick
LLMAG
449
4
0
01 Jun 2024
Mitigating Hallucinations in Large Language Models via
  Self-Refinement-Enhanced Knowledge Retrieval
Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval
Mengjia Niu
Hao Li
Jie Shi
Hamed Haddadi
Fan Mo
HILM
235
29
0
10 May 2024
Optimizing Language Augmentation for Multilingual Large Language Models:
  A Case Study on Korean
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
Changsu Choi
Yongbin Jeong
Seoyoon Park
Inho Won
HyeonSeok Lim
...
Yiseul Lee
HyeJin Lee
Younggyun Hahm
Hansaem Kim
Kyungtae Lim
340
24
0
16 Mar 2024
Researchy Questions: A Dataset of Multi-Perspective, Decompositional
  Questions for LLM Web Agents
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Corby Rosset
Ho-Lam Chung
Guanghui Qin
Ethan C. Chau
Zhuo Feng
Ahmed Hassan Awadallah
Jennifer Neville
Nikhil Rao
330
25
0
27 Feb 2024
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in
  Closed-Source LLMs
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Simone Balloccu
Patrícia Schmidtová
Mateusz Lango
Ondrej Dusek
SILMELMPILM
565
300
0
06 Feb 2024
Alignment for Honesty
Alignment for HonestyNeural Information Processing Systems (NeurIPS), 2023
Yuqing Yang
Ethan Chern
Xipeng Qiu
Graham Neubig
Pengfei Liu
318
64
0
12 Dec 2023
Axiomatic Preference Modeling for Longform Question Answering
Axiomatic Preference Modeling for Longform Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Corby Rosset
Guoqing Zheng
Victor C. Dibia
Ahmed Hassan Awadallah
Paul Bennett
SyDa
229
8
0
02 Dec 2023
On the Calibration of Large Language Models and Alignment
On the Calibration of Large Language Models and AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chiwei Zhu
Benfeng Xu
Quan Wang
Yongdong Zhang
Zhendong Mao
366
79
0
22 Nov 2023
SAC3: Reliable Hallucination Detection in Black-Box Language Models via
  Semantic-aware Cross-check Consistency
SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check ConsistencyConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiaxin Zhang
Zhuohang Li
Kamalika Das
Sricharan Kumar
Kumar Sricharan
HILMLRM
422
103
0
03 Nov 2023
Critical Role of Artificially Intelligent Conversational Chatbot
Critical Role of Artificially Intelligent Conversational Chatbot
S. A. Mostafa
Md Z. Islam
Mohammad Z. Islam
Fairose Jeehan
Saujanna Jafreen
Raihan U. Islam
AI4MH
144
0
0
31 Oct 2023
Examining the Potential and Pitfalls of ChatGPT in Science and
  Engineering Problem-Solving
Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-SolvingFrontiers in Education (FIE), 2023
Karen D. Wang
E. Burkholder
Carl E. Wieman
S. Salehi
Nicholas Haber
AI4CEELM
248
71
0
12 Oct 2023
Large Language Models can Learn Rules
Large Language Models can Learn Rules
Zhaocheng Zhu
Yuan Xue
Xinyun Chen
Denny Zhou
Jian Tang
Dale Schuurmans
Hanjun Dai
LRMReLM
369
91
0
10 Oct 2023
Chain of Natural Language Inference for Reducing Large Language Model
  Ungrounded Hallucinations
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations
Deren Lei
Yaxi Li
Mengya Hu
Mingyu Wang
Vincent Yun
Emily Ching
Eslam Kamal
HILMLRM
364
60
0
06 Oct 2023
Evaluating Hallucinations in Chinese Large Language Models
Evaluating Hallucinations in Chinese Large Language Models
Qinyuan Cheng
Tianxiang Sun
Wenwei Zhang
Siyin Wang
Xiangyang Liu
...
Junliang He
Mianqiu Huang
Zhangyue Yin
Kai Chen
Xipeng Qiu
HILMELM
317
44
0
05 Oct 2023
Dodo: Dynamic Contextual Compression for Decoder-only LMs
Dodo: Dynamic Contextual Compression for Decoder-only LMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Guanghui Qin
Corby Rosset
Ethan C. Chau
Nikhil Rao
Benjamin Van Durme
264
19
0
03 Oct 2023
Large Language Models Cannot Self-Correct Reasoning Yet
Large Language Models Cannot Self-Correct Reasoning YetInternational Conference on Learning Representations (ICLR), 2023
Jie Huang
Xinyun Chen
Swaroop Mishra
Huaixiu Steven Zheng
Adams Wei Yu
Xinying Song
Denny Zhou
ReLMLRM
742
819
0
03 Oct 2023
AutoHall: Automated Factuality Hallucination Dataset Generation for Large Language Models
AutoHall: Automated Factuality Hallucination Dataset Generation for Large Language ModelsIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2023
Zouying Cao
Yifei Yang
Hai Zhao
Hai Zhao
HILM
676
12
0
30 Sep 2023
Quantifying and Attributing the Hallucination of Large Language Models
  via Association Analysis
Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis
Li Du
Yequan Wang
Xingrun Xing
Yiqun Ya
Xiang Li
Xin Jiang
Xuezhi Fang
HILM
251
23
0
11 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context
  Learning?
Are Emergent Abilities in Large Language Models just In-Context Learning?Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Sheng Lu
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Iryna Gurevych
LRMELMReLM
484
151
0
04 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsComputational Linguistics (CL), 2023
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Freda Shi
Shuming Shi
Shuming Shi
LRMRALMHILM
851
953
0
03 Sep 2023
Leveraging Explainable AI to Analyze Researchers' Aspect-Based Sentiment
  about ChatGPT
Leveraging Explainable AI to Analyze Researchers' Aspect-Based Sentiment about ChatGPTInternational Conference on Intelligent Human Computer Interaction (IHCI), 2023
S. Lakhanpal
Ajay Gupta
R. Agrawal
264
2
0
16 Aug 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder
  Language Models
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Ming-Yu Liu
Peng Xu
Mohammad Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
281
45
0
15 Aug 2023
Through the Lens of Core Competency: Survey on Evaluation of Large
  Language Models
Through the Lens of Core Competency: Survey on Evaluation of Large Language ModelsChina National Conference on Chinese Computational Linguistics (CNCCL), 2023
Ziyu Zhuang
Qiguang Chen
Longxuan Ma
Mingda Li
Yi Han
Yushan Qian
Haopeng Bai
Zixian Feng
Weinan Zhang
Ting Liu
ELM
221
24
0
15 Aug 2023
The Hitchhiker's Guide to Program Analysis: A Journey with Large
  Language Models
The Hitchhiker's Guide to Program Analysis: A Journey with Large Language Models
Haonan Li
Yu Hao
Yizhuo Zhai
Zhiyun Qian
LLMAG
273
37
0
01 Aug 2023
Citation: A Key to Building Responsible and Accountable Large Language
  Models
Citation: A Key to Building Responsible and Accountable Large Language Models
Jie Huang
Kevin Chen-Chuan Chang
HILM
402
33
0
05 Jul 2023
Towards Reasoning in Large Language Models: A Survey
Towards Reasoning in Large Language Models: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jie Huang
Kevin Chen-Chuan Chang
LM&MAELMLRM
1.3K
872
0
20 Dec 2022
Can Language Models Be Specific? How?
Can Language Models Be Specific? How?Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jie Huang
Kevin Chen-Chuan Chang
Jinjun Xiong
Wen-mei W. Hwu
241
9
0
11 Oct 2022
1
Page 1 of 1