CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip
Retrieval

CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval

18 April 2021

Tianrui Li

Papers citing "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

6 / 6 papers shown

Title
ShapeSpeak: Body Shape-Aware Textual Alignment for Visible-Infrared Person Re-Identification Shuanglin Yan Neng Dong Shuang Li Rui Yan Hao Tang Jing Qin 22 0 0 25 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network Daniel Bolya Po-Yao (Bernie) Huang Peize Sun Jang Hyun Cho Andrea Madotto ... Shiyu Dong Nikhila Ravi Daniel Li Piotr Dollár Christoph Feichtenhofer ObjD VOS 62 0 0 17 Apr 2025
Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking Tianyu Zhu M. Jung Jesse Clark 55 1 0 12 Apr 2024
A Straightforward Framework For Video Retrieval Using CLIP Jesús Andrés Portillo-Quintero J. C. Ortíz-Bayliss Hugo Terashima-Marín CLIP 281 106 0 24 Feb 2021
Is Space-Time Attention All You Need for Video Understanding? Gedas Bertasius Heng Wang Lorenzo Torresani ViT 264 1,486 0 09 Feb 2021
Multi-modal Transformer for Video Retrieval Valentin Gabeur Chen Sun Alahari Karteek Cordelia Schmid ViT 381 532 0 21 Jul 2020