Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.05541
Cited By
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
7 April 2025
Yunlong Tang
Jing Bi
Chao Huang
Susan Liang
Daiki Shimada
Hang Hua
Yunzhong Xiao
Yizhi Song
Pinxin Liu
Mingqian Feng
Junjia Guo
Z. Liu
Luchuan Song
A. Vosoughi
Jinxi He
Liu He
Zeliang Zhang
Jiebo Luo
Chenliang Xu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting"
1 / 1 papers shown
Title
WikiVideo: Article Generation from Multiple Videos
Alexander Martin
Reno Kriz
William Walden
Kate Sanders
Hannah Recknor
Eugene Yang
Francis Ferraro
Benjamin Van Durme
DiffM
VGen
42
1
0
01 Apr 2025
1