ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.03315
  4. Cited By
Multi-modal Feature Fusion with Feature Attention for VATEX Captioning
  Challenge 2020

Multi-modal Feature Fusion with Feature Attention for VATEX Captioning Challenge 2020

5 June 2020
Ke Lin
Zhuoxin Gan
Liwei Wang
ArXiv (abs)PDFHTML

Papers citing "Multi-modal Feature Fusion with Feature Attention for VATEX Captioning Challenge 2020"

3 / 3 papers shown
Synchronized Audio-Visual Frames with Fractional Positional Encoding for
  Transformers in Video-to-Text Translation
Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text TranslationInternational Conference on Information Photonics (ICIP), 2021
Philipp Harzig
Moritz Einfalt
Rainer Lienhart
ViT
182
4
0
28 Dec 2021
A Comprehensive Review of the Video-to-Text Problem
A Comprehensive Review of the Video-to-Text ProblemArtificial Intelligence Review (AIR), 2021
Jesus Perez-Martin
B. Bustos
S. Guimarães
I. Sipiran
Jorge A. Pérez
Grethel Coello Said
314
20
0
27 Mar 2021
A Comprehensive Review on Recent Methods and Challenges of Video
  Description
A Comprehensive Review on Recent Methods and Challenges of Video Description
Ashutosh Kumar Singh
Thoudam Doren Singh
Sivaji Bandyopadhyay
3DVVLM
301
5
0
30 Nov 2020
1
Page 1 of 1