ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.04121
  4. Cited By
Hallucination Detection: Robustly Discerning Reliable Answers in Large
  Language Models

Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models

4 July 2024
Yuyan Chen
Qiang Fu
Yichen Yuan
Zhihao Wen
Ge Fan
Dayiheng Liu
Dongmei Zhang
Zhixu Li
Yanghua Xiao
    HILM
ArXiv (abs)PDFHTML

Papers citing "Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models"

20 / 20 papers shown
Title
Bounded PCTL Model Checking of Large Language Model Outputs
Bounded PCTL Model Checking of Large Language Model Outputs
Dennis Gross
Helge Spieker
A. Gotlieb
85
0
0
23 Sep 2025
Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation
Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation
Nandakishor M
HILM
60
0
0
23 Sep 2025
Control the Temperature: Selective Sampling for Diverse and High-Quality LLM Outputs
Control the Temperature: Selective Sampling for Diverse and High-Quality LLM Outputs
S. Troshin
Wafaa Mohammed
Yan Meng
Christof Monz
Antske Fokkens
Vlad Niculae
84
2
0
20 Sep 2025
Decomposing and Revising What Language Models Generate
Decomposing and Revising What Language Models Generate
Zhichao Yan
Jiaoyan Chen
Jiapu Wang
Xiaoli Li
Ru Li
Jeff Z. Pan
HILMReLM
102
1
0
31 Aug 2025
Hallucination Detection and Mitigation with Diffusion in Multi-Variate Time-Series Foundation Models
Hallucination Detection and Mitigation with Diffusion in Multi-Variate Time-Series Foundation Models
Vijja Wichitwechkarn
Charles Fox
Ruchi Choudhary
AI4TS
96
0
0
23 Jul 2025
Interpretable LLMs for Credit Risk: A Systematic Review and Taxonomy
Interpretable LLMs for Credit Risk: A Systematic Review and Taxonomy
Muhammed Golec
Maha AlabdulJalil
183
1
0
04 Jun 2025
Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection
Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection
Atharva Kulkarni
Yuan-kang Zhang
Joel Ruben Antony Moniz
Xiou Ge
Bo-Hsiang Tseng
Dhivya Piraviperumal
Siyang Song
Hong-ye Yu
HILM
279
5
0
25 Apr 2025
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future DirectionsACM Computing Surveys (ACM CSUR), 2024
Ola Shorinwa
Zhiting Mei
Justin Lidard
Allen Z. Ren
Anirudha Majumdar
HILMLRM
319
19
0
07 Dec 2024
Probing LLM Hallucination from Within: Perturbation-Driven Approach via Internal Knowledge
Probing LLM Hallucination from Within: Perturbation-Driven Approach via Internal Knowledge
Seongmin Lee
Hsiang Hsu
Chun-Fu Chen
Duen Horng
LRM
360
2
0
14 Nov 2024
Atomic Fact Decomposition Helps Attributed Question Answering
Atomic Fact Decomposition Helps Attributed Question Answering
Zhichao Yan
Jiashuo Wang
Jiaoyan Chen
Xiaoli Li
Ru Li
Jeff Z. Pan
KELMHILM
261
10
0
22 Oct 2024
Visual Agents as Fast and Slow Thinkers
Visual Agents as Fast and Slow ThinkersInternational Conference on Learning Representations (ICLR), 2024
Guangyan Sun
Haoyang Ling
Zhenting Wang
Cheng-Long Wang
Siqi Ma
Qifan Wang
Ying Nian Wu
Ying Nian Wu
Dongfang Liu
Dongfang Liu
LLMAGLRM
418
40
0
16 Aug 2024
XMeCap: Meme Caption Generation with Sub-Image Adaptability
XMeCap: Meme Caption Generation with Sub-Image Adaptability
Yuyan Chen
Songzhou Yan
Zhihong Zhu
Zhixu Li
Yanghua Xiao
VLM
354
16
0
24 Jul 2024
CUPID: Improving Battle Fairness and Position Satisfaction in Online
  MOBA Games with a Re-matchmaking System
CUPID: Improving Battle Fairness and Position Satisfaction in Online MOBA Games with a Re-matchmaking System
Ge Fan
Chaoyun Zhang
Kai Wang
Yingjie Li
Junyang Chen
Zenglin Xu
169
5
0
28 Jun 2024
A Survey of Language-Based Communication in Robotics
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
608
15
0
06 Jun 2024
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art
Neeloy Chakraborty
Melkior Ornik
Katherine Driggs-Campbell
LRM
376
27
0
25 Mar 2024
The Human Factor in Detecting Errors of Large Language Models: A
  Systematic Literature Review and Future Research Directions
The Human Factor in Detecting Errors of Large Language Models: A Systematic Literature Review and Future Research Directions
Christian A. Schiller
ALM
50
3
0
13 Mar 2024
AutoAttacker: A Large Language Model Guided System to Implement
  Automatic Cyber-attacks
AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks
Jiacen Xu
Jack W. Stokes
Geoff McDonald
Xuesong Bai
David Marshall
Siyue Wang
Adith Swaminathan
Zhou Li
215
93
0
02 Mar 2024
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe
  Approach
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach
Maryam Amirizaniani
Elias Martin
Tanya Roosta
Aman Chadha
Chirag Shah
151
11
0
14 Feb 2024
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented
  Generation of Large Language Models
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Yuanjie Lyu
Zhiyu Li
Pengnian Qi
Feiyu Xiong
Simin Niu
Wenjin Wang
Hao Wu
Huan Liu
Tong Xu
Enhong Chen
RALM
241
71
0
30 Jan 2024
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Dario Di Palma
Giovanni Maria Biancofiore
Vito Walter Anelli
Fedelucio Narducci
Tommaso Di Noia
E. Sciascio
ALM
287
36
0
07 Sep 2023
1