ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.07577
  4. Cited By
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language
  Model

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

10 July 2024
Yatai Ji
Shilong Zhang
Jie Wu
Peize Sun
Weifeng Chen
Xuefeng Xiao
Sidi Yang
Y. Yang
Ping Luo
    VLM
ArXivPDFHTML

Papers citing "IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model"

4 / 4 papers shown
Title
Unhackable Temporal Rewarding for Scalable Video MLLMs
Unhackable Temporal Rewarding for Scalable Video MLLMs
En Yu
Kangheng Lin
Liang Zhao
Yana Wei
Zining Zhu
...
Jianjian Sun
Zheng Ge
X. Zhang
Jingyu Wang
Wenbing Tao
52
4
0
17 Feb 2025
StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification
StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification
Yichen He
Yuan Lin
Jianchao Wu
Hanchong Zhang
Yuchen Zhang
Ruicheng Le
VGen
VLM
39
2
0
11 Nov 2024
Domain-invariant Representation Learning via Segment Anything Model for
  Blood Cell Classification
Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification
Yongcheng Li
Lingcong Cai
Ying Lu
Cheng Lin
Yupeng Zhang
...
Genan Dai
Bowen Zhang
Jingzhou Cao
Xiangzhong Zhang
Xiaomao Fan
33
1
0
14 Aug 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and
  Comprehension in Vision-Language Large Model
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Bin Wang
...
Conghui He
Xingcheng Zhang
Yu Qiao
Dahua Lin
Jiaqi Wang
VLM
MLLM
70
89
0
29 Jan 2024
1