Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2505.13788
Cited By

Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels

Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels

Computer Vision and Pattern Recognition (CVPR), 2025

20 May 2025

ArXiv (abs)PDF HTML

Papers citing "Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels"

2 / 2 papers shown

MINGLE: VLMs for Semantically Complex Region Detection in Urban Scenes

MINGLE: VLMs for Semantically Complex Region Detection in Urban Scenes

Alexandra Kudaeva

Fatimeh Al Ghannam

244

0

0

16 Sep 2025

Multimodal Referring Segmentation: A Survey

Multimodal Referring Segmentation: A Survey

387

11

0

01 Aug 2025

Page 1 of 1