Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and
Aleatoric Awareness

Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness

2 July 2024

Khyathi Raghavi Chandu

Jae Sung Park

Yejin Choi

Papers citing "Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness"

9 / 9 papers shown

Title
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty Yongjin Yang Haneul Yoo Hwaran Lee 56 1 0 13 Aug 2024
Hallucination of Multimodal Large Language Models: A Survey Zechen Bai Pichao Wang Tianjun Xiao Tong He Zongbo Han Zheng Zhang Mike Zheng Shou VLM LRM 68 136 0 29 Apr 2024
Multi-Modal Hallucination Control by Visual Information Grounding Alessandro Favero L. Zancato Matthew Trager Siddharth Choudhary Pramuditha Perera Alessandro Achille Ashwin Swaminathan Stefano Soatto MLLM 60 62 0 20 Mar 2024
Hallucination Detection and Hallucination Mitigation: An Investigation Junliang Luo Tianyu Li Di Wu Michael R. M. Jenkin Steve Liu Gregory Dudek HILM LLMAG 26 20 0 16 Jan 2024
Silkie: Preference Distillation for Large Visual Language Models Lei Li Zhihui Xie Mukai Li Shunian Chen Peiyi Wang Liang Chen Yazheng Yang Benyou Wang Lingpeng Kong MLLM 96 67 0 17 Dec 2023
Post-Abstention: Towards Reliably Re-Attempting the Abstained Instances in QA Neeraj Varshney Chitta Baral 26 12 0 02 May 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models Potsawee Manakul Adian Liusie Mark J. F. Gales HILM LRM 145 386 0 15 Mar 2023
Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive Summarization Mengyao Cao Yue Dong Jackie C.K. Cheung HILM 170 144 0 30 Aug 2021
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation Tianyu Liu Yizhe Zhang Chris Brockett Yi Mao Zhifang Sui Weizhu Chen W. Dolan HILM 209 140 0 18 Apr 2021