GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

2 January 2025

Papers citing "GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models"

1 / 1 papers shown

Title
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness Chenming Zhu Tai Wang Wenwei Zhang Jiangmiao Pang Xihui Liu 63 29 0 26 Sep 2024