ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.24840
  4. Cited By
Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck

Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck

30 May 2025
Yuwen Tan
Yuan Qing
Boqing Gong
ArXiv (abs)PDFHTML

Papers citing "Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck"

1 / 1 papers shown
Title
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
Ranjan Sapkota
Manoj Karkee
ObjDVLM
141
9
0
25 Aug 2025
1