ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.08271
  4. Cited By
TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval
v1v2 (latest)

TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval

IEEE International Conference on Computer Vision (ICCV), 2021
16 April 2021
Ioana Croitoru
Simion-Vlad Bogolin
Marius Leordeanu
Hailin Jin
Andrew Zisserman
Samuel Albanie
Yang Liu
    VGen
ArXiv (abs)PDFHTML

Papers citing "TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval"

27 / 77 papers shown
PRVR: Partially Relevant Video Retrieval
PRVR: Partially Relevant Video RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jianfeng Dong
Xianke Chen
Minsong Zhang
Xun Yang
Shujie Chen
Xirong Li
Xun Wang
241
49
0
26 Aug 2022
CrossA11y: Identifying Video Accessibility Issues via Cross-modal
  Grounding
CrossA11y: Identifying Video Accessibility Issues via Cross-modal GroundingACM Symposium on User Interface Software and Technology (UIST), 2022
Xingyu Bruce Liu
Ruolin Wang
Dingzeyu Li
Xiang Ánthony' Chen
Amy Pavel
148
36
0
23 Aug 2022
M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval
M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval
Shuo Liu
Weize Quan
Mingyuan Zhou
Sihong Chen
Jian Kang
Zhenlan Zhao
Chen Chen
Dong-Ming Yan
139
3
0
16 Aug 2022
Boosting Video-Text Retrieval with Explicit High-Level Semantics
Boosting Video-Text Retrieval with Explicit High-Level SemanticsACM Multimedia (ACM MM), 2022
Haoran Wang
Di Xu
Dongliang He
Fu Li
Zhong Ji
Jungong Han
Errui Ding
223
16
0
08 Aug 2022
A Feature-space Multimodal Data Augmentation Technique for Text-video
  Retrieval
A Feature-space Multimodal Data Augmentation Technique for Text-video RetrievalACM Multimedia (ACM MM), 2022
Alex Falcon
G. Serra
Oswald Lanz
VGen
203
29
0
03 Aug 2022
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
TS2-Net: Token Shift and Selection Transformer for Text-Video RetrievalEuropean Conference on Computer Vision (ECCV), 2022
Yuqi Liu
Pengfei Xiong
Luhui Xu
Shengming Cao
Qin Jin
265
170
0
16 Jul 2022
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text
  Retrieval
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text RetrievalACM Multimedia (ACM MM), 2022
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Ming Yan
Ji Zhang
Rongrong Ji
CLIPVLM
267
400
0
15 Jul 2022
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video
  Retrieval
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval
Burak Satar
Erik Cambria
Hanwang Zhang
J. Lim
173
13
0
26 Jun 2022
A CLIP-Hitchhiker's Guide to Long Video Retrieval
A CLIP-Hitchhiker's Guide to Long Video Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
CLIP
419
73
0
17 May 2022
Learning to Retrieve Videos by Asking Questions
Learning to Retrieve Videos by Asking QuestionsACM Multimedia (ACM MM), 2022
Avinash Madasu
Junier Oliva
Gedas Bertasius
VGen
317
19
0
11 May 2022
CenterCLIP: Token Clustering for Efficient Text-Video Retrieval
CenterCLIP: Token Clustering for Efficient Text-Video RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Shuai Zhao
Linchao Zhu
Xiaohan Wang
Yi Yang
VLMCLIP
195
152
0
02 May 2022
Relevance-based Margin for Contrastively-trained Video Retrieval Models
Relevance-based Margin for Contrastively-trained Video Retrieval ModelsInternational Conference on Multimedia Retrieval (ICMR), 2022
Alex Falcon
Swathikiran Sudhakaran
G. Serra
Sergio Escalera
Oswald Lanz
365
10
0
27 Apr 2022
Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with
  Multi-Level Representations
Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level RepresentationsIEEE Access (IEEE Access), 2022
Jie Jiang
Shaobo Min
Weijie Kong
Dihong Gong
Hongfa Wang
Zhifeng Li
Wei Liu
VLM
343
31
0
07 Apr 2022
ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
ECLIPSE: Efficient Long-range Video Retrieval using Sight and SoundEuropean Conference on Computer Vision (ECCV), 2022
Yan-Bo Lin
Jie Lei
Joey Tianyi Zhou
Gedas Bertasius
391
53
0
06 Apr 2022
X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
X-Pool: Cross-Modal Language-Video Attention for Text-Video RetrievalComputer Vision and Pattern Recognition (CVPR), 2022
S. Gorti
Noël Vouitsis
Junwei Ma
Keyvan Golestan
Anthony L. Caterini
Animesh Garg
Guangwei Yu
303
226
0
28 Mar 2022
Learning video retrieval models with relevance-aware online mining
Learning video retrieval models with relevance-aware online miningInternational Conference on Image Analysis and Processing (ICIAP), 2022
Alex Falcon
G. Serra
Oswald Lanz
AI4TS
134
7
0
16 Mar 2022
Disentangled Representation Learning for Text-Video Retrieval
Disentangled Representation Learning for Text-Video Retrieval
Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
Xiansheng Hua
215
99
0
14 Mar 2022
Multi-Query Video Retrieval
Multi-Query Video RetrievalEuropean Conference on Computer Vision (ECCV), 2022
Zeyu Wang
Yu Wu
Karthik Narasimhan
Olga Russakovsky
285
23
0
10 Jan 2022
Sign Language Video Retrieval with Free-Form Textual Queries
Sign Language Video Retrieval with Free-Form Textual QueriesComputer Vision and Pattern Recognition (CVPR), 2022
A. Duarte
Samuel Albanie
Xavier Giró-i-Nieto
Gül Varol
SLR
222
36
0
07 Jan 2022
Cross Modal Retrieval with Querybank Normalisation
Cross Modal Retrieval with Querybank NormalisationComputer Vision and Pattern Recognition (CVPR), 2021
Simion-Vlad Bogolin
Ioana Croitoru
Hailin Jin
Yang Liu
Samuel Albanie
293
115
0
23 Dec 2021
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
Align and Prompt: Video-and-Language Pre-training with Entity PromptsComputer Vision and Pattern Recognition (CVPR), 2021
Dongxu Li
Junnan Li
Hongdong Li
Juan Carlos Niebles
Guosheng Lin
361
214
0
17 Dec 2021
Audio Retrieval with Natural Language Queries: A Benchmark Study
Audio Retrieval with Natural Language Queries: A Benchmark Study
A. Sophia Koepke
Andreea-Maria Oncescu
João F. Henriques
Zeynep Akata
Samuel Albanie
207
118
0
17 Dec 2021
Prompting Visual-Language Models for Efficient Video Understanding
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLMVLM
374
460
0
08 Dec 2021
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video
  Retrieval
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
Fan Hu
Aozhu Chen
Ziyu Wang
Fangming Zhou
Jianfeng Dong
Xirong Li
212
44
0
03 Dec 2021
Object-aware Video-language Pre-training for Retrieval
Object-aware Video-language Pre-training for Retrieval
Alex Jinpeng Wang
Yixiao Ge
Guanyu Cai
Rui Yan
Xudong Lin
Ying Shan
Xiaohu Qie
Mike Zheng Shou
ViTVLM
280
86
0
01 Dec 2021
Cross-Modal Discrete Representation Learning
Cross-Modal Discrete Representation LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Alexander H. Liu
SouYoung Jin
Cheng-I Jeff Lai
Andrew Rouditchenko
A. Oliva
James R. Glass
SSL
140
52
0
10 Jun 2021
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip
  Retrieval
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
Huaishao Luo
Lei Ji
Ming Zhong
Yang Chen
Wen Lei
Nan Duan
Tianrui Li
CLIPVLM
1.5K
1,001
0
18 Apr 2021
Previous
12