ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.13788
  4. Cited By
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels

Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels

Computer Vision and Pattern Recognition (CVPR), 2025
20 May 2025
Yongshuo Zong
Qin Zhang
Dongsheng An
Zhihua Li
Xiang Xu
Linghan Xu
Zhuowen Tu
Yifan Xing
Onkar Dabeer
    ObjD
ArXiv (abs)PDFHTML

Papers citing "Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels"

2 / 2 papers shown
MINGLE: VLMs for Semantically Complex Region Detection in Urban Scenes
MINGLE: VLMs for Semantically Complex Region Detection in Urban Scenes
Liu Liu
Alexandra Kudaeva
Marco Cipriano
Fatimeh Al Ghannam
Freya Tan
Gerard de Melo
Andres Sevtsuk
245
0
0
16 Sep 2025
Multimodal Referring Segmentation: A Survey
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
387
11
0
01 Aug 2025
1
Page 1 of 1