Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.07951
Cited By
Overview of Tencent Multi-modal Ads Video Understanding Challenge
16 September 2021
Zhenzhi Wang
Liyu Wu
Zhimin Li
Jiangfeng Xiong
Qinglin Lu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Overview of Tencent Multi-modal Ads Video Understanding Challenge"
3 / 3 papers shown
Title
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
231
573
0
22 Apr 2021
Generic Event Boundary Detection: A Benchmark for Event Segmentation
Mike Zheng Shou
Stan Weixian Lei
Weiyao Wang
Deepti Ghadiyaram
Matt Feiszli
VOS
71
70
0
26 Jan 2021
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
398
532
0
21 Jul 2020
1