Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2411.10436
Cited By
Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
15 November 2024
Yuhan Fu
Ruobing Xie
Xingwu Sun
Zhanhui Kang
Xirong Li
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization"
9 / 9 papers shown
Title
RL makes MLLMs see better than SFT
Junha Song
Sangdoo Yun
Dongyoon Han
Jaegul Choo
Byeongho Heo
OffRL
171
0
0
18 Oct 2025
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Wenbin An
Jiahao Nie
Yaqiang Wu
Feng Tian
Shijian Lu
Q. Zheng
MLLM
166
1
0
14 Aug 2025
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs
Yifan Shen
Yuanzhe Liu
Jingyuan Zhu
Xu Cao
Xiaofeng Zhang
Yixiao He
Wenming Ye
James M. Rehg
Ismini Lourentzou
LRM
122
3
0
26 Jun 2025
Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model
Nokimul Hasan Arif
Shadman Rabby
Md Hefzul Hossain Papon
Sabbir Ahmed
MLLM
VLM
275
0
0
29 May 2025
NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-Identification
Shihao Li
Chenglong Li
Aihua Zheng
Andong Lu
Jin Tang
510
1
0
26 May 2025
Aligning Multimodal LLM with Human Preference: A Survey
Tao Yu
Yujiao Shi
Chaoyou Fu
Junkang Wu
Jinda Lu
...
Qingsong Wen
Zheng Zhang
Yan Huang
Liang Wang
Tieniu Tan
781
12
0
18 Mar 2025
Grounded Chain-of-Thought for Multimodal Large Language Models
Qiong Wu
Xiangcong Yang
Weihao Ye
Chenxin Fang
Baiyang Song
Xiaoshuai Sun
Rongrong Ji
LRM
409
23
0
17 Mar 2025
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
Shuo Xing
Peiran Li
Peiran Li
Ruizheng Bai
Longji Xu
Chan-wei Hu
Chengxuan Qian
Huaxiu Yao
Zhengzhong Tu
449
18
0
18 Feb 2025
Hallucination of Multimodal Large Language Models: A Survey
Zechen Bai
Pichao Wang
Tianjun Xiao
Tong He
Zongbo Han
Zheng Zhang
Mike Zheng Shou
VLM
LRM
576
303
0
29 Apr 2024
1