v1v2 (latest)

Hallucination of Multimodal Large Language Models: A Survey

29 April 2024

Tianjun Xiao

Zheng Zhang

Papers citing "Hallucination of Multimodal Large Language Models: A Survey"

50 / 334 papers shown

OViP: Online Vision-Language Preference Learning for VLM Hallucination

317

21 May 2025

Incentivizing Truthful Language Models via Peer Elicitation Games

368

19 May 2025

Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

360

17 May 2025

Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language ModelsInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

317

11 May 2025

Perceiving Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models

Aarti Ghatkesar

Uddeshya Upadhyay

VLM

390

08 May 2025

Characterizing the Robustness of Black-Box LLM Planners Under Perturbed Observations with Adaptive Stress Testing

Neeloy Chakraborty

John Pohovey

Melkior Ornik

Katherine Driggs-Campbell

360

08 May 2025

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding

Jordan Lee Boyd-Graber

MLLM

649

02 May 2025

Multimodal Large Language Models for Medicine: A Comprehensive Survey

Jiarui Ye

Hao Tang

LM&MA

501

29 Apr 2025

HyPerAlign: Interpretable Personalized LLM Alignment via Hypothesis Generation

Cristina Garbacea

Chenhao Tan

444

29 Apr 2025

TRACE: Textual Relevance Augmentation and Contextual Encoding for Multimodal Hate Detection

339

24 Apr 2025

Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends

M. Tami

Mohammed Elhenawy

Huthaifa I. Ashqar

334

21 Apr 2025

Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models

293

19 Apr 2025

Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training

515

17 Apr 2025

AeroLite: Tag-Guided Lightweight Generation of Aerial Image Captions

161

13 Apr 2025

Data Metabolism: An Efficient Data Design Schema For Vision Language Model

386

10 Apr 2025

Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local PerceptionComputer Vision and Pattern Recognition (CVPR), 2025

209

09 Apr 2025

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

459

09 Apr 2025

Explaining Low Perception Model Competency with High-Competency Counterfactuals

Sara Pohland

Claire Tomlin

DiffM AAML

293

07 Apr 2025

TARAC: Mitigating Hallucination in LVLMs via Temporal Attention Real-time Accumulative Connection

238

05 Apr 2025

Towards Trustworthy GUI Agents: A Survey

291

30 Mar 2025

AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs

Shuvra S. Bhattacharyya

370

28 Mar 2025

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

438

25 Mar 2025

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations

...

388

19 Mar 2025

Do Multimodal Large Language Models Understand Welding?Information Fusion (Inf. Fusion), 2025

229

18 Mar 2025

Can Large Vision Language Models Read Maps Like a Human?

391

18 Mar 2025

Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence

464

18 Mar 2025

RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration

Hong Qing Yu

Frank McQuade

282

14 Mar 2025

Taxonomic Reasoning for Rare Arthropods: Combining Dense Image Captioning and RAG for Interpretable Classification

322

13 Mar 2025

ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content

Bhavik Chandna

Mariam Aboujenane

Usman Naseem

276

13 Mar 2025

Attention Hijackers: Detect and Disentangle Attention Hijacking in LVLMs for Hallucination Mitigation

544

11 Mar 2025

Hallucinatory Image Tokens: A Training-free EAZY Approach on Detecting and Mitigating Object Hallucinations in LVLMs

411

10 Mar 2025

PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual TrainingInternational Conference on Learning Representations (ICLR), 2025

293

09 Mar 2025

TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction

348

06 Mar 2025

MCiteBench: A Multimodal Benchmark for Generating Text with Citations

456

04 Mar 2025

Evaluating and Predicting Distorted Human Body Parts for Generated Images

345

02 Mar 2025

HalCECE: A Framework for Explainable Hallucination Detection through Conceptual Counterfactuals in Image Captioning

Maria Lymperaiou

Giorgos Filandrianos

Angeliki Dimitriou

Athanasios Voulodimos

Giorgos Stamou

MLLM

210

01 Mar 2025

Octopus: Alleviating Hallucination via Dynamic Contrastive DecodingComputer Vision and Pattern Recognition (CVPR), 2025

295

01 Mar 2025

Towards Statistical Factuality Guarantee for Large Vision-Language Models

347

27 Feb 2025

FilterRAG: Zero-Shot Informed Retrieval-Augmented Generation to Mitigate Hallucinations in VQA

S M Sarwar

463

25 Feb 2025

Mitigating Hallucinations in Diffusion Models through Adaptive Attention Modulation

Trevine Oorloff

Yaser Yacoob

Abhinav Shrivastava

196

24 Feb 2025

LOVA3: Learning to Visual Question Answering, Asking and AssessmentNeural Information Processing Systems (NeurIPS), 2024

417

21 Feb 2025

Mitigating Hallucinations in Large Vision-Language Models via Summary-Guided DecodingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

465

20 Feb 2025

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

...

567

20 Feb 2025

Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization

520

18 Feb 2025

Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent

...

346

17 Feb 2025

Valuable Hallucinations: Realizable Non-realistic Propositions

Qiucheng Chen

Bo Wang

LRM

309

16 Feb 2025

MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal GenerationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

905

06 Feb 2025

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

573

05 Feb 2025

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

259

05 Feb 2025

Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models

488

03 Feb 2025