Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.12798
Cited By
Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video Retrieval
21 June 2024
Wenjun Li
Shudong Wang
Dong Zhao
Shenghui Xu
Zhaoming Pan
Zhimin Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video Retrieval"
2 / 2 papers shown
Title
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIP
VLM
309
771
0
18 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
1