ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.08097
  4. Cited By

Integrating both Visual and Audio Cues for Enhanced Video Caption

22 November 2017
Wangli Hao
Zhaoxiang Zhang
He Guan
Guibo Zhu
ArXivPDFHTML

Papers citing "Integrating both Visual and Audio Cues for Enhanced Video Caption"

3 / 3 papers shown
Title
Visual Sensation and Perception Computational Models for Deep Learning:
  State of the art, Challenges and Prospects
Visual Sensation and Perception Computational Models for Deep Learning: State of the art, Challenges and Prospects
Bing Wei
Yudi Zhao
K. Hao
Lei Gao
33
5
0
08 Sep 2021
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event
  Captioning
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning
Tanzila Rahman
Bicheng Xu
Leonid Sigal
25
77
0
22 Sep 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video
  Captioning
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Hongyang Chao
Tao Mei
17
104
0
03 May 2019
1