Integrating both Visual and Audio Cues for Enhanced Video Caption

22 November 2017

Papers citing "Integrating both Visual and Audio Cues for Enhanced Video Caption"

3 / 3 papers shown

Title
Visual Sensation and Perception Computational Models for Deep Learning: State of the art, Challenges and Prospects Bing Wei Yudi Zhao K. Hao Lei Gao 33 5 0 08 Sep 2021
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning Tanzila Rahman Bicheng Xu Leonid Sigal 25 77 0 22 Sep 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning Jingwen Chen Yingwei Pan Yehao Li Ting Yao Hongyang Chao Tao Mei 17 104 0 03 May 2019