Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.07167
Cited By
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate
9 October 2024
Qidong Huang
Xiaoyi Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Jiaqi Wang
Dahua Lin
Weiming Zhang
Nenghai Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate"
2 / 2 papers shown
Title
A Comprehensive Analysis for Visual Object Hallucination in Large Vision-Language Models
Liqiang Jing
Guiming Hardy Chen
Ehsan Aghazadeh
Xin Eric Wang
Xinya Du
48
0
0
04 May 2025
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
Long Xing
Qidong Huang
Xiaoyi Dong
Jiajie Lu
Pan Zhang
...
Yuhang Cao
Conghui He
Jiaqi Wang
Feng Wu
Dahua Lin
VLM
40
25
0
22 Oct 2024
1