Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs

31 July 2024

ArXiv (abs)PDF HTML Github

Papers citing "Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs"

50 / 71 papers shown

Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding

1.2K

10 Apr 2026

V-ITI: Mitigating Hallucinations in Multimodal Large Language Models via Visual Inference-Time Intervention

...

187

03 Dec 2025

Med-VCD: Mitigating Hallucination for Medical Large Vision Language Models through Visual Contrastive DecodingComputers in Biology and Medicine (Comput. Biol. Med.), 2025

Zahra Mahdavi

Zahra Khodakaramimaghsoud

Hooman Khaloo

Sina Bakhshandeh Taleshani

Erfan Hashemi

Javad Mirzapour Kaleybar

Omid Nejati Manzari

MLLM VLM

305

01 Dec 2025

Tell Model Where to Look: Mitigating Hallucinations in MLLMs by Vision-Guided Attention

253

25 Nov 2025

Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats

402

21 Nov 2025

Adaptive Residual-Update Steering for Low-Overhead Hallucination Mitigation in Large Vision Language Models

172

13 Nov 2025

Capturing Gaze Shifts for Guidance: Cross-Modal Fusion Enhancement for VLM Hallucination Mitigation

Zheng Qi

Chao Shang

Evangelia Spiliopoulou

Nikolaos Pappas

220

24 Oct 2025

Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context

145

23 Oct 2025

PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning

146

22 Oct 2025

Seeing but Not Believing: Probing the Disconnect Between Visual Attention and Answer Correctness in VLMs

...

165

20 Oct 2025

Watermarking for Factuality: Guiding Vision-Language Models Toward Truth via Tri-layer Contrastive Decoding

223

16 Oct 2025

Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models

189

04 Oct 2025

MaskCD: Mitigating LVLM Hallucinations by Image Head Masked Contrastive Decoding

Jingyuan Deng

Yujiu Yang

MLLM

213

03 Oct 2025

CCD: Mitigating Hallucinations in Radiology MLLMs via Clinical Contrastive Decoding

313

27 Sep 2025

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

239

27 Sep 2025

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow

...

319

26 Sep 2025

Pay More Attention To Audio: Mitigating Imbalance of Cross-Modal Attention in Large Audio Language Models

137

23 Sep 2025

Cross-Layer Vision Smoothing: Enhancing Visual Understanding via Sustained Focus on Key Objects in Large Vision-Language Models

202

16 Sep 2025

ChartGaze: Enhancing Chart Understanding in LVLMs with Eye-Tracking Guided Attention Refinement

Ali Salamatian

Amirhossein Abaskohi

Wan-Cyuan Fan

Mir Rayat Imtiaz Hossain

Leonid Sigal

Giuseppe Carenini

168

16 Sep 2025

Tracing and Mitigating Hallucinations in Multimodal LLMs via Dynamic Attention Localization

357

09 Sep 2025

SPECS: Specificity-Enhanced CLIP-Score for Long Image Caption Evaluation

Xiaofu Chen

Israfel Salazar

Yova Kementchedjhieva

347

04 Sep 2025

Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens

179

03 Sep 2025

Mitigating Multimodal Hallucinations via Gradient-based Self-Reflection

359

03 Sep 2025

OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination

228

31 Aug 2025

GLSim: Detecting Object Hallucinations in LVLMs via Global-Local Similarity

Seongheon Park

Yixuan Li

225

27 Aug 2025

Benchmarking and Bridging Emotion Conflicts for Multimodal Emotion Reasoning

240

02 Aug 2025

MIHBench: Benchmarking and Mitigating Multi-Image Hallucinations in Multimodal Large Language Models

310

01 Aug 2025

TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs

379

29 Jul 2025

OW-CLIP: Data-Efficient Visual Supervision for Open-World Object Detection via Human-AI Collaboration

219

26 Jul 2025

MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models

407

12 Jul 2025

Not All Attention Heads Are What You Need: Refining CLIP's Image Representation with Attention Ablation

158

01 Jul 2025

ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM

410

17 Jun 2025

Revisit What You See: Disclose Language Prior in Vision Tokens for LVLM Decoding

Beomsik Cho

Jaehyung Kim

350

11 Jun 2025

CoMemo: LVLMs Need Image Context with Image Memory

276

06 Jun 2025

LLMs Can Compensate for Deficiencies in Visual Representations

Yova Kementchedjhieva

VLM

259

05 Jun 2025

Mitigating Hallucinations in Large Vision-Language Models via Entity-Centric Multimodal Preference Optimization

397

04 Jun 2025

CLAIM: Mitigating Multilingual Object Hallucination in Large Vision-Language Models with Cross-Lingual Attention InterventionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

210

03 Jun 2025

BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models

218

30 May 2025

Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information

437

29 May 2025

Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration

557

27 May 2025

MLLMs are Deeply Affected by Modality Bias

...

395

24 May 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal DecodingComputer Vision and Pattern Recognition (CVPR), 2025

...

425

22 May 2025

Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression

354

22 May 2025

Make LVLMs Focus: Context-Aware Attention Modulation for Better Multimodal In-Context Learning

...

487

21 May 2025

How Do Large Vision-Language Models See Text in Image? Unveiling the Distinctive Role of OCR Heads

251

21 May 2025

ZeroTuning: Unlocking the Initial Token's Power to Enhance Large Language Models Without Training

522

16 May 2025

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

515

09 Apr 2025

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

534

25 Mar 2025

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

...

426

19 Mar 2025

Grounded Chain-of-Thought for Multimodal Large Language Models

592

17 Mar 2025