ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.02503
  4. Cited By
Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval

Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval

6 July 2020
Xun Yang
Jianfeng Dong
Yixin Cao
Xun Wang
Meng Wang
Tat-Seng Chua
ArXiv (abs)PDFHTML

Papers citing "Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval"

47 / 47 papers shown
Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
Jianfeng Dong
Lei Huang
Daizong Liu
Xianke Chen
Xun Yang
Changting Lin
Xun Wang
Meng Wang
168
0
0
14 Oct 2025
Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review
Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review
A. Fragomeni
Dima Damen
Michael Wray
268
0
0
29 May 2025
CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework
CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based FrameworkComputer Vision and Pattern Recognition (CVPR), 2025
Yanlong Xu
Haoxuan Qu
Qingbin Liu
Wenxiao Zhang
Xun Yang
1.1K
6
0
04 Mar 2025
Dual-stream Feature Augmentation for Domain Generalization
Dual-stream Feature Augmentation for Domain GeneralizationACM Multimedia (MM), 2024
Shanshan Wang
ALuSi
Xun Yang
Ke Xu
H. Tan
Xingyi Zhang
AAMLOOD
201
8
0
07 Sep 2024
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for
  Image-Text Matching
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching
Xuri Ge
Fuhai Chen
Songpei Xu
Fuxiang Tao
Jie Wang
Joemon M. Jose
262
3
0
05 Jun 2024
An Empirical Study of Excitation and Aggregation Design Adaptions in
  CLIP4Clip for Video-Text Retrieval
An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval
Xiaolun Jing
Genke Yang
Jian Chu
CLIP
301
3
0
25 May 2024
Improving Video Corpus Moment Retrieval with Partial Relevance
  Enhancement
Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement
Danyang Hou
Liang Pang
Huawei Shen
Xueqi Cheng
377
9
0
21 Feb 2024
CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual
  Knowledge Transfer
CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge TransferAAAI Conference on Artificial Intelligence (AAAI), 2023
Yabing Wang
Fan Wang
Jianfeng Dong
Hao Luo
VLM
259
20
0
14 Dec 2023
Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
Finding and Editing Multi-Modal Neurons in Pre-Trained TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Haowen Pan
Yixin Cao
Xiaozhi Wang
Xun Yang
Meng Wang
KELM
378
41
0
13 Nov 2023
Unified Multi-modal Unsupervised Representation Learning for
  Skeleton-based Action Understanding
Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action UnderstandingACM Multimedia (ACM MM), 2023
Shengkai Sun
Daizong Liu
Jianfeng Dong
Xiaoye Qu
Junyu Gao
Xun Yang
Xun Wang
Meng Wang
OffRL
329
31
0
06 Nov 2023
DFIL: Deepfake Incremental Learning by Exploiting Domain-invariant
  Forgery Clues
DFIL: Deepfake Incremental Learning by Exploiting Domain-invariant Forgery CluesACM Multimedia (ACM MM), 2023
Kun Pan
Yifang Yin
Yao Wei
Feng Lin
Zhongjie Ba
Zhenguang Liu
Peng Kuang
Lorenzo Cavallaro
Kui Ren
CLL
387
39
0
18 Sep 2023
Video Infringement Detection via Feature Disentanglement and Mutual
  Information Maximization
Video Infringement Detection via Feature Disentanglement and Mutual Information MaximizationACM Multimedia (ACM MM), 2023
Zhenguang Liu
Xinyang Yu
Ruili Wang
Shuai Ye
Zhe Ma
...
Sifeng He
Feng Qian
Xiao-Yong Zhang
Roger Zimmermann
Lei Yang
346
1
0
13 Sep 2023
Class-level Structural Relation Modelling and Smoothing for Visual
  Representation Learning
Class-level Structural Relation Modelling and Smoothing for Visual Representation LearningACM Multimedia (ACM MM), 2023
Zitan Chen
Zhuang Qi
Xiao Cao
Xiangxian Li
Xiangxu Meng
Lei Meng
312
12
0
08 Aug 2023
Cross-Silo Prototypical Calibration for Federated Learning with Non-IID
  Data
Cross-Silo Prototypical Calibration for Federated Learning with Non-IID DataACM Multimedia (ACM MM), 2023
Zhuang Qi
Lei Meng
Zitan Chen
Han Hu
Hui Lin
Xiangxu Meng
FedML
311
52
0
07 Aug 2023
From Region to Patch: Attribute-Aware Foreground-Background Contrastive
  Learning for Fine-Grained Fashion Retrieval
From Region to Patch: Attribute-Aware Foreground-Background Contrastive Learning for Fine-Grained Fashion RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Jianfeng Dong
Xi Peng
Zhe Ma
Daizong Liu
Xiaoye Qu
Xun Yang
Jixiang Zhu
Baolong Liu
243
18
0
17 May 2023
Transform-Equivariant Consistency Learning for Temporal Sentence
  Grounding
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
Zichuan Xu
Yining Qi
Xing Di
Weining Lu
Yu Cheng
327
12
0
06 May 2023
A Review of Deep Learning for Video Captioning
A Review of Deep Learning for Video CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Moloud Abdar
Meenakshi Kollati
Swaraja Kuraparthi
Farhad Pourpanah
Daniel J. McDuff
...
Shuicheng Yan
Abduallah A. Mohamed
Abbas Khosravi
Xiaoshi Zhong
Fatih Porikli
3DV
273
48
0
22 Apr 2023
Improving Video Retrieval by Adaptive Margin
Improving Video Retrieval by Adaptive MarginAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Feng He
Qi Wang
Zhifan Feng
Wenbin Jiang
Yajuan Lü
Yong Zhu
Xiao Tan
340
25
0
09 Mar 2023
Deep Learning for Video-Text Retrieval: a Review
Deep Learning for Video-Text Retrieval: a ReviewInternational Journal of Multimedia Information Retrieval (IJMIR), 2023
Cunjuan Zhu
Qi Jia
Wei Chen
Yanming Guo
Yu Liu
254
35
0
24 Feb 2023
Multi-video Moment Ranking with Multimodal Clue
Multi-video Moment Ranking with Multimodal Clue
Danyang Hou
Liang Pang
Yanyan Lan
Huawei Shen
Xueqi Cheng
169
1
0
29 Jan 2023
Rethinking the Video Sampling and Reasoning Strategies for Temporal
  Sentence Grounding
Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiahao Zhu
Daizong Liu
Pan Zhou
Xing Di
Yu Cheng
...
Wenzheng Xu
Zichuan Xu
Yao Wan
Lichao Sun
Zeyu Xiong
237
35
0
02 Jan 2023
VLG: General Video Recognition with Web Textual Knowledge
VLG: General Video Recognition with Web Textual KnowledgeInternational Journal of Computer Vision (IJCV), 2022
Jintao Lin
Zhaoyang Liu
Wenhai Wang
Wayne Wu
Limin Wang
380
4
0
03 Dec 2022
Are All Combinations Equal? Combining Textual and Visual Features with
  Multiple Space Learning for Text-Based Video Retrieval
Are All Combinations Equal? Combining Textual and Visual Features with Multiple Space Learning for Text-Based Video Retrieval
Damianos Galanopoulos
Vasileios Mezaris
270
7
0
21 Nov 2022
Efficient Cross-Modal Video Retrieval with Meta-Optimized Frames
Efficient Cross-Modal Video Retrieval with Meta-Optimized FramesIEEE transactions on multimedia (IEEE TMM), 2022
Ning Han
Xun Yang
Ee-Peng Lim
Hao Chen
Qianru Sun
271
9
0
16 Oct 2022
Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
Cross-Lingual Cross-Modal Retrieval with Noise-Robust LearningACM Multimedia (ACM MM), 2022
Yabing Wang
Jianfeng Dong
Tianxiang Liang
Minsong Zhang
Rui Cai
Xun Wang
315
31
0
26 Aug 2022
PRVR: Partially Relevant Video Retrieval
PRVR: Partially Relevant Video RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jianfeng Dong
Xianke Chen
Minsong Zhang
Xun Yang
Shujie Chen
Xirong Li
Xun Wang
340
49
0
26 Aug 2022
Semantic Data Augmentation based Distance Metric Learning for Domain
  Generalization
Semantic Data Augmentation based Distance Metric Learning for Domain GeneralizationACM Multimedia (ACM MM), 2022
Mengzhu Wang
Jianlong Yuan
Qi Qian
Zhibin Wang
Hao Li
384
41
0
02 Aug 2022
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text
  Retrieval
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text RetrievalACM Multimedia (ACM MM), 2022
Yiwei Ma
Guohai Xu
Xiaoshuai Sun
Ming Yan
Ji Zhang
Rongrong Ji
CLIPVLM
309
432
0
15 Jul 2022
Learn to Understand Negation in Video Retrieval
Learn to Understand Negation in Video RetrievalACM Multimedia (ACM MM), 2022
Ziyue Wang
Aozhu Chen
Fan Hu
Xirong Li
SSL
299
18
0
30 Apr 2022
BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian
  Perspective
BP-Triplet Net for Unsupervised Domain Adaptation: A Bayesian PerspectivePattern Recognition (Pattern Recogn.), 2022
Shanshan Wang
Lei Zhang
Pichao Wang
182
32
0
19 Feb 2022
Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval
Hybrid Contrastive Quantization for Efficient Cross-View Video RetrievalThe Web Conference (WWW), 2022
Jinpeng Wang
Bin Chen
Dongliang Liao
Ziyun Zeng
Gongfu Li
Shutao Xia
Jin Xu
267
10
0
07 Feb 2022
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With
  Transformer for Sentence Grounding in Videos
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos
Sangmin Woo
Jinyoung Park
Inyong Koo
Sumin Lee
Minki Jeong
Changick Kim
504
6
0
25 Jan 2022
Reading-strategy Inspired Visual Representation Learning for
  Text-to-Video Retrieval
Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval
Jianfeng Dong
Yabing Wang
Xianke Chen
Xiaoye Qu
Xirong Li
Y. He
Xun Wang
456
80
0
23 Jan 2022
Classification-Then-Grounding: Reformulating Video Scene Graphs as
  Temporal Bipartite Graphs
Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
Kaifeng Gao
Long Chen
Yulei Niu
Jian Shao
Jun Xiao
244
38
0
08 Dec 2021
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video
  Retrieval
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
Fan Hu
Aozhu Chen
Ziyu Wang
Fangming Zhou
Jianfeng Dong
Xirong Li
280
52
0
03 Dec 2021
BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video
  Retrieval
BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval
Ning Han
Jingjing Chen
Chuhao Shi
Yawen Zeng
Guangyi Xiao
Hao Chen
366
17
0
29 Oct 2021
A Novel Patch Convolutional Neural Network for View-based 3D Model
  Retrieval
A Novel Patch Convolutional Neural Network for View-based 3D Model RetrievalACM Multimedia (ACM MM), 2021
Zan Gao
Yuxiang Shao
Weili Guan
Meng Liu
Zhiyong Cheng
Shengyong Chen
3DV3DPC
154
11
0
25 Sep 2021
Adaptive Proposal Generation Network for Temporal Sentence Localization
  in Videos
Adaptive Proposal Generation Network for Temporal Sentence Localization in Videos
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
265
67
0
14 Sep 2021
HANet: Hierarchical Alignment Networks for Video-Text Retrieval
HANet: Hierarchical Alignment Networks for Video-Text RetrievalACM Multimedia (ACM MM), 2021
Peng Wu
Xiangteng He
Mingqian Tang
Yiliang Lv
Jing Liu
254
71
0
26 Jul 2021
Interventional Video Grounding with Dual Contrastive Learning
Interventional Video Grounding with Dual Contrastive LearningComputer Vision and Pattern Recognition (CVPR), 2021
Guoshun Nan
Rui Qiao
Yao Xiao
Jun Liu
Sicong Leng
H. Zhang
Wei Lu
383
164
0
21 Jun 2021
Deconfounded Video Moment Retrieval with Causal Intervention
Deconfounded Video Moment Retrieval with Causal InterventionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Xun Yang
Fuli Feng
Wei Ji
Meng Wang
Tat-Seng Chua
CMLVGen
224
224
0
03 Jun 2021
Fine-Grained Fashion Similarity Prediction by Attribute-Specific
  Embedding Learning
Fine-Grained Fashion Similarity Prediction by Attribute-Specific Embedding LearningIEEE Transactions on Image Processing (TIP), 2021
Jianfeng Dong
Zhe Ma
Xiaofeng Mao
Xun Yang
Yuan He
Richang Hong
S. Ji
OOD
339
46
0
06 Apr 2021
Neural ranking models for document retrieval
Neural ranking models for document retrieval
M. Trabelsi
Zhiyu Zoey Chen
Brian D. Davison
J. Heflin
FedML
252
39
0
23 Feb 2021
Hierarchical Similarity Learning for Language-based Product Image
  Retrieval
Hierarchical Similarity Learning for Language-based Product Image RetrievalIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Zhe Ma
Fenghao Liu
Jianfeng Dong
Xiaoye Qu
Yuan He
S. Ji
VLM
187
7
0
18 Feb 2021
Progressive Localization Networks for Language-based Moment Localization
Progressive Localization Networks for Language-based Moment Localization
Qi Zheng
Jianfeng Dong
Xiaoye Qu
Xun Yang
Yabing Wang
Pan Zhou
Baolong Liu
Xun Wang
325
40
0
02 Feb 2021
SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries
SEA: Sentence Encoder Assembly for Video Retrieval by Textual QueriesIEEE transactions on multimedia (TMM), 2020
Xirong Li
Fangming Zhou
Chaoxi Xu
Jiaqi Ji
Gang Yang
244
62
0
24 Nov 2020
Dual Encoding for Video Retrieval by Text
Dual Encoding for Video Retrieval by Text
Jianfeng Dong
Xirong Li
Chaoxi Xu
Xun Yang
Gang Yang
Xun Wang
Meng Wang
348
2
0
10 Sep 2020
1
Page 1 of 1