Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.12086
Cited By
FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
18 October 2023
Xiang Chen
Duanzheng Song
Honghao Gui
Chengxi Wang
Ningyu Zhang
Jiang Yong
Fei Huang
Chengfei Lv
Dan Zhang
Huajun Chen
HILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FactCHD: Benchmarking Fact-Conflicting Hallucination Detection"
15 / 15 papers shown
Title
SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes
Raúl Vázquez
Timothee Mickus
Elaine Zosa
Teemu Vahtola
Jörg Tiedemann
...
Liane Guillou
Ona de Gibert
Jaione Bengoetxea
Joseph Attieh
Marianna Apidianaki
HILM
VLM
LRM
80
0
0
16 Apr 2025
C-FAITH: A Chinese Fine-Grained Benchmark for Automated Hallucination Evaluation
Xu Zhang
Zhifei Liu
Jiahao Wang
Huixuan Zhang
Fan Xu
Junzhe Zhang
Xiaojun Wan
HILM
29
0
0
14 Apr 2025
SelfCheckAgent: Zero-Resource Hallucination Detection in Generative Large Language Models
Diyana Muhammed
Gollam Rabby
Sören Auer
LLMAG
HILM
74
0
0
03 Feb 2025
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMs
F. S. Bao
Miaoran Li
Renyi Qu
Ge Luo
Erana Wan
...
Ruixuan Tu
Chenyu Xu
Matthew Gonzales
Ofer Mendelevitch
Amin Ahmad
VLM
HILM
15
2
0
17 Oct 2024
MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
Chenxi Wang
Xiang Chen
N. Zhang
Bozhong Tian
Haoming Xu
Shumin Deng
H. Chen
MLLM
LRM
29
4
0
15 Oct 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Shafiq R. Joty
HILM
110
16
0
30 Sep 2024
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu
Ziwei Ji
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai Chen
HILM
34
5
0
05 Jul 2024
Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs
Junjie Wang
Mingyang Chen
Binbin Hu
Dan Yang
Ziqi Liu
...
Jinjie Gu
Jun Zhou
Jeff Z. Pan
Wen Zhang
Huajun Chen
RALM
26
12
0
20 Jun 2024
AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysis
Natalia Griogoriadou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
VLM
22
0
0
01 Apr 2024
Unified Hallucination Detection for Multimodal Large Language Models
Xiang Chen
Chenxi Wang
Yida Xue
Ningyu Zhang
Xiaoyan Yang
Qian Li
Yue Shen
Lei Liang
Jinjie Gu
Huajun Chen
HILM
28
38
0
05 Feb 2024
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models
Mintong Kang
Nezihe Merve Gürel
Ning Yu
D. Song
Bo-wen Li
76
20
0
05 Feb 2024
Alleviating Hallucinations of Large Language Models through Induced Hallucinations
Yue Zhang
Leyang Cui
Wei Bi
Shuming Shi
HILM
34
49
0
25 Dec 2023
RJUA-QA: A Comprehensive QA Dataset for Urology
Shiwei Lyu
Chenfei Chi
Hongbo Cai
Lei Shi
Xiaoyan Yang
...
Xiaowei Ma
Yue Shen
Jinjie Gu
Wei Xue
Yiran Huang
LM&MA
26
3
0
15 Dec 2023
Resolving Knowledge Conflicts in Large Language Models
Yike Wang
Shangbin Feng
Heng Wang
Weijia Shi
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
48
12
0
02 Oct 2023
Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics
Artidoro Pagnoni
Vidhisha Balachandran
Yulia Tsvetkov
HILM
215
305
0
27 Apr 2021
1