Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.06458
Cited By
Cross-Modal Graph with Meta Concepts for Video Captioning
14 August 2021
Hao Wang
Guosheng Lin
S. Hoi
C. Miao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-Modal Graph with Meta Concepts for Video Captioning"
4 / 4 papers shown
Title
Accommodating Audio Modality in CLIP for Multimodal Processing
Ludan Ruan
Anwen Hu
Yuqing Song
Liang Zhang
S. Zheng
Qin Jin
VLM
16
10
0
12 Mar 2023
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
63
158
0
27 Aug 2019
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
119
495
0
24 Apr 2018
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
261
10,106
0
16 Nov 2016
1