
Title |
|---|
![]() Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data PerspectivesAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
![]() End-to-end Concept Word Detection for Video Captioning, Retrieval, and
Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2016 |