ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.00569
  4. Cited By
Investigating and Mitigating the Multimodal Hallucination Snowballing in
  Large Vision-Language Models
v1v2v3v4 (latest)

Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models

30 June 2024
Weihong Zhong
Xiaocheng Feng
Liang Zhao
Qiming Li
Lei Huang
Yuxuan Gu
Weitao Ma
Yuan Xu
Bing Qin
    MLLM
ArXiv (abs)PDFHTMLGithub

Papers citing "Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models"

15 / 15 papers shown
Suppressing VLM Hallucinations with Spectral Representation Filtering
Suppressing VLM Hallucinations with Spectral Representation Filtering
Ameen Ali
Tamim Zoabi
Lior Wolf
177
0
0
15 Nov 2025
ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input
ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input
Hendric Voss
Stefan Kopp
SLR
330
0
0
20 Oct 2025
MedMMV: A Controllable Multimodal Multi-Agent Framework for Reliable and Verifiable Clinical Reasoning
MedMMV: A Controllable Multimodal Multi-Agent Framework for Reliable and Verifiable Clinical Reasoning
Hongjun Liu
Yinghao Zhu
Y Samuel Wang
Yitao Long
Zeyu Lai
Lequan Yu
Chen Zhao
LRM
228
3
0
29 Sep 2025
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
Xinlei Yu
C. Xu
Guibin Zhang
Yongbo He
Zhangquan Chen
...
Jiangning Zhang
Yue Liao
Xiaobin Hu
Yu-Gang Jiang
Shuicheng Yan
315
10
0
26 Sep 2025
Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models
Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models
Pu Jian
Junhong Wu
Wei Sun
Chen Wang
Shuo Ren
Jiajun Zhang
LRM
173
13
0
15 Sep 2025
OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination
OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination
Junzhe Chen
Tianshu Zhang
Shiyu Huang
Yuwei Niu
Chao Sun
Rongzhou Zhang
G. Zhou
Lijie Wen
Xuming Hu
MLLM
225
4
0
31 Aug 2025
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Wenbin An
Jiahao Nie
Yaqiang Wu
Feng Tian
Shijian Lu
Q. Zheng
MLLM
252
1
0
14 Aug 2025
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal DecodingComputer Vision and Pattern Recognition (CVPR), 2025
Feilong Tang
Chengzhi Liu
Zhongxing Xu
Ming Hu
Zelin Peng
...
Minquan Lin
Yifan Peng
Xuelian Cheng
Imran Razzak
Zongyuan Ge
423
35
0
22 May 2025
TARAC: Mitigating Hallucination in LVLMs via Temporal Attention Real-time Accumulative Connection
TARAC: Mitigating Hallucination in LVLMs via Temporal Attention Real-time Accumulative Connection
C. Xie
Tongxuan Liu
Lei Jiang
Yuting Zeng
Jinpei Guo
Yunheng Shen
Weizhe Huang
Jing Li
Xiaohua Xu
VLMLRM
280
7
0
05 Apr 2025
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction
Chao Wang
Weiwei Fu
Yang Zhou
MLLMVLM
449
4
0
06 Mar 2025
Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models
Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models
Yaqi Sun
Kyohei Atarashi
Koh Takeuchi
Hisashi Kashima
MLLM
250
1
0
24 Feb 2025
Visual Attention Never Fades: Selective Progressive Attention ReCalibration for Detailed Image Captioning in Multimodal Large Language Models
Visual Attention Never Fades: Selective Progressive Attention ReCalibration for Detailed Image Captioning in Multimodal Large Language Models
Mingi Jung
Saehuyng Lee
Eunji Kim
Sungroh Yoon
1.1K
15
0
03 Feb 2025
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation DataInternational Conference on Learning Representations (ICLR), 2024
Jeremy Irvin
Emily Ruoyu Liu
Joyce Chuyi Chen
Ines Dormoy
Jinyoung Kim
Samar Khanna
Zhuo Zheng
Stefano Ermon
MLLMVLM
486
53
0
28 Jan 2025
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object
  Hallucination in Large Vision-Language Models
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2024
Junzhe Chen
Tianshu Zhang
Shijie Huang
Yuwei Niu
Linfeng Zhang
Lijie Wen
Xuming Hu
MLLMVLM
1.1K
18
0
22 Nov 2024
Hallucination of Multimodal Large Language Models: A Survey
Hallucination of Multimodal Large Language Models: A Survey
Zechen Bai
Pichao Wang
Tianjun Xiao
Tong He
Zongbo Han
Zheng Zhang
Mike Zheng Shou
VLMLRM
805
351
0
29 Apr 2024
1
Page 1 of 1