Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
2505.24840
Cited By
Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck
30 May 2025
Yuwen Tan
Yuan Qing
Boqing Gong
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck"
1 / 1 papers shown
Title
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
Ranjan Sapkota
Manoj Karkee
ObjD
VLM
145
9
0
25 Aug 2025
1