Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.00599
Cited By
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
31 December 2024
Yuqian Yuan
Hang Zhang
Wentong Li
Zesen Cheng
Boqiang Zhang
Long Li
Xin Li
Deli Zhao
Wenqiao Zhang
Yueting Zhuang
Jianke Zhu
Lidong Bing
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"
3 / 3 papers shown
Title
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
Haojian Huang
Haodong Chen
Shengqiong Wu
Meng Luo
Jinlan Fu
Xinya Du
H. Zhang
Hao Fei
AI4TS
58
0
0
17 Apr 2025
FocusedAD: Character-centric Movie Audio Description
Xiaojun Ye
C. Wang
Yiren Song
Sheng Zhou
Liangcheng Li
Jiajun Bu
VGen
37
0
0
16 Apr 2025
V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction
Yiming Zhao
Y. Zeng
Yukun Qi
Y. Liu
Lin Yen-Chen
Zehui Chen
Xikun Bao
Jie Zhao
Feng Zhao
VLM
53
2
0
22 Mar 2025
1