Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.05610
Cited By
CLIP2TV: Align, Match and Distill for Video-Text Retrieval
10 November 2021
Zijian Gao
J. Liu
Weiqi Sun
S. Chen
Dedan Chang
Lili Zhao
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP2TV: Align, Match and Distill for Video-Text Retrieval"
5 / 5 papers shown
Title
RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos
Tanveer Hannan
Md. Mohaiminul Islam
Thomas Seidl
Gedas Bertasius
26
3
0
11 Dec 2023
STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training
Weihong Zhong
Mao Zheng
Duyu Tang
Xuan Luo
Heng Gong
Xiaocheng Feng
Bing Qin
25
8
0
20 Feb 2023
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIP
VLM
309
778
0
18 Apr 2021
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
410
594
0
21 Jul 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Mohit Bansal
106
275
0
24 Jan 2020
1