Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2104.08271
Cited By
v1
v2 (latest)
TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval
IEEE International Conference on Computer Vision (ICCV), 2021
16 April 2021
Ioana Croitoru
Simion-Vlad Bogolin
Marius Leordeanu
Hailin Jin
Andrew Zisserman
Samuel Albanie
Yang Liu
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval"
27 / 77 papers shown
PRVR: Partially Relevant Video Retrieval
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jianfeng Dong
Xianke Chen
Minsong Zhang
Xun Yang
Shujie Chen
Xirong Li
Xun Wang
241
49
0
26 Aug 2022
CrossA11y: Identifying Video Accessibility Issues via Cross-modal Grounding
ACM Symposium on User Interface Software and Technology (UIST), 2022
Xingyu Bruce Liu
Ruolin Wang
Dingzeyu Li
Xiang Ánthony' Chen
Amy Pavel
148
36
0
23 Aug 2022
M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval
Shuo Liu
Weize Quan
Mingyuan Zhou
Sihong Chen
Jian Kang
Zhenlan Zhao
Chen Chen
Dong-Ming Yan
139
3
0
16 Aug 2022
Boosting Video-Text Retrieval with Explicit High-Level Semantics
ACM Multimedia (ACM MM), 2022
Haoran Wang
Di Xu
Dongliang He
Fu Li
Zhong Ji
Jungong Han
Errui Ding
223
16
0
08 Aug 2022
A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval
ACM Multimedia (ACM MM), 2022
Alex Falcon
G. Serra
Oswald Lanz
VGen
203
29
0
03 Aug 2022
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
European Conference on Computer Vision (ECCV), 2022
Yuqi Liu
Pengfei Xiong
Luhui Xu
Shengming Cao
Qin Jin
265
170
0
16 Jul 2022
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval
ACM Multimedia (ACM MM), 2022
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Ming Yan
Ji Zhang
Rongrong Ji
CLIP
VLM
267
400
0
15 Jul 2022
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval
Burak Satar
Erik Cambria
Hanwang Zhang
J. Lim
173
13
0
26 Jun 2022
A CLIP-Hitchhiker's Guide to Long Video Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
CLIP
419
73
0
17 May 2022
Learning to Retrieve Videos by Asking Questions
ACM Multimedia (ACM MM), 2022
Avinash Madasu
Junier Oliva
Gedas Bertasius
VGen
317
19
0
11 May 2022
CenterCLIP: Token Clustering for Efficient Text-Video Retrieval
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Shuai Zhao
Linchao Zhu
Xiaohan Wang
Yi Yang
VLM
CLIP
195
152
0
02 May 2022
Relevance-based Margin for Contrastively-trained Video Retrieval Models
International Conference on Multimedia Retrieval (ICMR), 2022
Alex Falcon
Swathikiran Sudhakaran
G. Serra
Sergio Escalera
Oswald Lanz
365
10
0
27 Apr 2022
Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level Representations
IEEE Access (IEEE Access), 2022
Jie Jiang
Shaobo Min
Weijie Kong
Dihong Gong
Hongfa Wang
Zhifeng Li
Wei Liu
VLM
343
31
0
07 Apr 2022
ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
European Conference on Computer Vision (ECCV), 2022
Yan-Bo Lin
Jie Lei
Joey Tianyi Zhou
Gedas Bertasius
391
53
0
06 Apr 2022
X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
Computer Vision and Pattern Recognition (CVPR), 2022
S. Gorti
Noël Vouitsis
Junwei Ma
Keyvan Golestan
Anthony L. Caterini
Animesh Garg
Guangwei Yu
303
226
0
28 Mar 2022
Learning video retrieval models with relevance-aware online mining
International Conference on Image Analysis and Processing (ICIAP), 2022
Alex Falcon
G. Serra
Oswald Lanz
AI4TS
134
7
0
16 Mar 2022
Disentangled Representation Learning for Text-Video Retrieval
Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
Xiansheng Hua
215
99
0
14 Mar 2022
Multi-Query Video Retrieval
European Conference on Computer Vision (ECCV), 2022
Zeyu Wang
Yu Wu
Karthik Narasimhan
Olga Russakovsky
285
23
0
10 Jan 2022
Sign Language Video Retrieval with Free-Form Textual Queries
Computer Vision and Pattern Recognition (CVPR), 2022
A. Duarte
Samuel Albanie
Xavier Giró-i-Nieto
Gül Varol
SLR
222
36
0
07 Jan 2022
Cross Modal Retrieval with Querybank Normalisation
Computer Vision and Pattern Recognition (CVPR), 2021
Simion-Vlad Bogolin
Ioana Croitoru
Hailin Jin
Yang Liu
Samuel Albanie
293
115
0
23 Dec 2021
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Computer Vision and Pattern Recognition (CVPR), 2021
Dongxu Li
Junnan Li
Hongdong Li
Juan Carlos Niebles
Guosheng Lin
361
214
0
17 Dec 2021
Audio Retrieval with Natural Language Queries: A Benchmark Study
A. Sophia Koepke
Andreea-Maria Oncescu
João F. Henriques
Zeynep Akata
Samuel Albanie
207
118
0
17 Dec 2021
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
374
460
0
08 Dec 2021
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
Fan Hu
Aozhu Chen
Ziyu Wang
Fangming Zhou
Jianfeng Dong
Xirong Li
212
44
0
03 Dec 2021
Object-aware Video-language Pre-training for Retrieval
Alex Jinpeng Wang
Yixiao Ge
Guanyu Cai
Rui Yan
Xudong Lin
Ying Shan
Xiaohu Qie
Mike Zheng Shou
ViT
VLM
280
86
0
01 Dec 2021
Cross-Modal Discrete Representation Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Alexander H. Liu
SouYoung Jin
Cheng-I Jeff Lai
Andrew Rouditchenko
A. Oliva
James R. Glass
SSL
140
52
0
10 Jun 2021
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIP
VLM
1.5K
1,001
0
18 Apr 2021
Previous
1
2