Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.01428
Cited By
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
2 January 2025
Zhangyang Qi
Zhixiong Zhang
Ye Fang
Jiaqi Wang
Hengshuang Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models"
1 / 1 papers shown
Title
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Chenming Zhu
Tai Wang
Wenwei Zhang
Jiangmiao Pang
Xihui Liu
63
29
0
26 Sep 2024
1