Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.19221
Cited By
Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality
28 March 2024
Sishuo Chen
Lei Li
Shuhuai Ren
Rundong Gao
Yuanxin Liu
Xiaohan Bi
Xu Sun
Lu Hou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality"
2 / 2 papers shown
Title
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Chaoyou Fu
Yuhan Dai
Yondong Luo
Lei Li
Shuhuai Ren
...
Tong Bill Xu
Xiawu Zheng
Enhong Chen
Rongrong Ji
Xing Sun
VLM
MLLM
41
216
0
31 May 2024
A Comprehensive Review of Knowledge Distillation in Computer Vision
Sheikh Musa Kaleem
Tufail Rouf
Gousia Habib
Tausifa Jan Saleem
Brejesh Lall
VLM
17
12
0
01 Apr 2024
1