Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.07577
Cited By
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
10 July 2024
Yatai Ji
Shilong Zhang
Jie Wu
Peize Sun
Weifeng Chen
Xuefeng Xiao
Sidi Yang
Y. Yang
Ping Luo
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model"
4 / 4 papers shown
Title
Unhackable Temporal Rewarding for Scalable Video MLLMs
En Yu
Kangheng Lin
Liang Zhao
Yana Wei
Zining Zhu
...
Jianjian Sun
Zheng Ge
X. Zhang
Jingyu Wang
Wenbing Tao
52
4
0
17 Feb 2025
StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification
Yichen He
Yuan Lin
Jianchao Wu
Hanchong Zhang
Yuchen Zhang
Ruicheng Le
VGen
VLM
39
2
0
11 Nov 2024
Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification
Yongcheng Li
Lingcong Cai
Ying Lu
Cheng Lin
Yupeng Zhang
...
Genan Dai
Bowen Zhang
Jingzhou Cao
Xiangzhong Zhang
Xiaomao Fan
33
1
0
14 Aug 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Bin Wang
...
Conghui He
Xingcheng Zhang
Yu Qiao
Dahua Lin
Jiaqi Wang
VLM
MLLM
73
89
0
29 Jan 2024
1