ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.09099
  4. Cited By
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
v1v2 (latest)

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

European Conference on Computer Vision (ECCV), 2020
24 January 2020
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
ArXiv (abs)PDFHTML

Papers citing "TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval"

50 / 185 papers shown
IVCR-200K: A Large-Scale Multi-turn Dialogue Benchmark for Interactive Video Corpus Retrieval
IVCR-200K: A Large-Scale Multi-turn Dialogue Benchmark for Interactive Video Corpus RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Ning Han
Yawen Zeng
Shaohua Long
Chengqing Li
Sijie Yang
Dun Tan
Jianfeng Dong
Jingjing Chen
VGen
195
2
0
01 Dec 2025
SilhouetteTell: Practical Video Identification Leveraging Blurred Recordings of Video Subtitles
SilhouetteTell: Practical Video Identification Leveraging Blurred Recordings of Video Subtitles
Guanchong Huang
Song Fang
142
0
0
31 Oct 2025
Mitigating Semantic Collapse in Partially Relevant Video Retrieval
Mitigating Semantic Collapse in Partially Relevant Video Retrieval
WonJun Moon
MinSeok Jung
Gilhan Park
Tae-Young Kim
Cheol-Ho Cho
Woojin Jun
Jae-Pil Heo
211
2
0
31 Oct 2025
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
Zhuo Cao
Heming Du
Bingqing Zhang
Xin Yu
Xue Li
Sen Wang
166
1
0
20 Oct 2025
Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
Jianfeng Dong
Lei Huang
Daizong Liu
Xianke Chen
Xun Yang
Changting Lin
Xun Wang
Meng Wang
173
0
0
14 Oct 2025
Enhancing Partially Relevant Video Retrieval with Robust Alignment Learning
Enhancing Partially Relevant Video Retrieval with Robust Alignment Learning
Long Zhang
Peipei Song
Jianfeng Dong
Kun Li
Xun Yang
248
5
0
01 Sep 2025
ProPy: Building Interactive Prompt Pyramids upon CLIP for Partially Relevant Video Retrieval
ProPy: Building Interactive Prompt Pyramids upon CLIP for Partially Relevant Video Retrieval
Yi Pan
Yujia Zhang
Michael C. Kampffmeyer
Xiaoguang Zhao
178
0
0
26 Aug 2025
Aligning Moments in Time using Video Queries
Aligning Moments in Time using Video Queries
Yogesh Kumar
Uday Agarwal
Manish Gupta
Anand Mishra
362
1
0
21 Aug 2025
Denoise-then-Retrieve: Text-Conditioned Video Denoising for Video Moment Retrieval
Denoise-then-Retrieve: Text-Conditioned Video Denoising for Video Moment RetrievalInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Weijia Liu
Jiuxin Cao
Bo Miao
Zhiheng Fu
Xuelin Zhu
Jiawei Ge
Bo Liu
Mehwish Nasim
Lin Wang
DiffMVGen
198
0
0
15 Aug 2025
HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
Jun Li
Jinpeng Wang
Chaolei Tan
Niu Lian
Long Chen
Yaowei Wang
Min Zhang
Shu-Tao Xia
Bin Chen
328
7
0
23 Jul 2025
Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment
Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment
Yue Zhang
Jilei Sun
Yunhui Guo
Vibhav Gogate
LRM
258
2
0
27 Jun 2025
MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding
Zhiyi Zhu
Xiaoyu Wu
Zihao Liu
Linlin Yang
295
0
0
10 Jun 2025
Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval
Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video RetrievalAAAI Conference on Artificial Intelligence (AAAI), 2025
CH Cho
WJ Moon
W Jun
MS Jung
JP Heo
216
9
0
09 Jun 2025
MamFusion: Multi-Mamba with Temporal Fusion for Partially Relevant Video Retrieval
MamFusion: Multi-Mamba with Temporal Fusion for Partially Relevant Video Retrieval
Xinru Ying
Jiaqi Mo
Jingyang Lin
Canghong Jin
Fangfang Wang
Lina Wei
247
0
0
04 Jun 2025
Uneven Event Modeling for Partially Relevant Video Retrieval
Uneven Event Modeling for Partially Relevant Video Retrieval
Sa Zhu
Huashan Chen
Wanqian Zhang
Jinchao Zhang
Zexian Yang
Xiaoshuai Hao
Bo Li
318
3
0
01 Jun 2025
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine ProjectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yixian Shen
Qi Bi
Jia-Hong Huang
Hongyi Zhu
Andy D. Pimentel
Anuj Pathania
464
4
0
29 May 2025
Robust Relevance Feedback for Interactive Known-Item Video Search
Robust Relevance Feedback for Interactive Known-Item Video SearchInternational Conference on Multimedia Retrieval (ICMR), 2025
Zhixin Ma
Chong-Wah Ngo
210
0
0
21 May 2025
Enhanced Partially Relevant Video Retrieval through Inter- and Intra-Sample Analysis with Coherence Prediction
Enhanced Partially Relevant Video Retrieval through Inter- and Intra-Sample Analysis with Coherence Prediction
Junlong Ren
Gangjian Zhang
Yitao Hu
Jian Shu
Jian Shu
Hui Xiong
508
2
0
28 Apr 2025
Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval
Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval
WonJun Moon
Cheol-Ho Cho
Woojin Jun
Minho Shim
Taeoh Kim
Inwoong Lee
Dongyoon Wee
Jae-Pil Heo
382
4
0
17 Apr 2025
Towards Efficient Partially Relevant Video Retrieval with Active Moment Discovering
Towards Efficient Partially Relevant Video Retrieval with Active Moment DiscoveringIEEE transactions on multimedia (TMM), 2025
Peipei Song
Li Zhang
Long Lan
Weidong Chen
D. Guo
Xun Yang
Meng Wang
255
12
0
15 Apr 2025
Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking
Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking
H. Tran
Tinh-Anh Nguyen-Nhu
Huu-Phong Phan-Nguyen
T. Nguyen
Nhat-Minh Nguyen-Dich
Anh Dao
Huy-Duc Do
Quan Nguyen
Hoang M. Le
Quang-Vinh Dinh
279
3
0
11 Apr 2025
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video SituationComputer Vision and Pattern Recognition (CVPR), 2025
Hao Du
Bo Wu
Yan Lu
Zhendong Mao
276
2
0
08 Apr 2025
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Junyu Xie
Tengda Han
Max Bain
Arsha Nagrani
Eshika Khandelwal
Gül Varol
Weidi Xie
Andrew Zisserman
DiffMVGen
490
6
0
01 Apr 2025
Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning
Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement LearningInternational Symposium on Modeling and Optimization in Mobile, Ad-Hoc and Wireless Networks (WiOpt), 2025
Yubo Zhang
Pedro Botelho
Trevor Gordon
Gil Zussman
I. Kadota
340
1
0
31 Mar 2025
MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion
MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank FusionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Saron Samuel
Dan DeGenaro
Jimena Guallar-Blasco
Kate Sanders
Oluwaseun Eisape
...
David Etter
Efsun Kayi
Matthew Wiesner
Kenton W. Murray
Reno Kriz
518
6
0
26 Mar 2025
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding
Wenxuan Zhu
Bing Li
Cheng Zheng
Jinjie Mai
Jun-Cheng Chen
...
Abdullah Hamdi
Sara Rojas Martinez
Chia-Wen Lin
Mohamed Elhoseiny
Bernard Ghanem
VLM
362
3
0
22 Mar 2025
VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning
VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning
Wenshu Fan
Kevin Qinghong Lin
C. Chen
Mike Zheng Shou
LM&RoLRM
1.1K
41
0
17 Mar 2025
OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
Jiali Yao
Xinran Deng
Xin Gu
Mengrui Dai
Bing Fan
Zhipeng Zhang
Yan Huang
Heng Fan
L. Zhang
448
5
0
13 Mar 2025
Towards Fine-Grained Video Question Answering
Towards Fine-Grained Video Question Answering
Wei Dai
Alan Luo
Zane Durante
Debadutta Dash
Arnold Milstein
Kevin Schulman
Ehsan Adeli
L. Fei-Fei
329
1
0
10 Mar 2025
Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning
Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning
Mohammad Amin Ghanizadeh
Mohammad Javad Dousti
235
8
0
06 Mar 2025
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
MUSE: Mamba is Efficient Multi-scale Learner for Text-video RetrievalAAAI Conference on Artificial Intelligence (AAAI), 2024
Haoran Tang
Meng Cao
Jinfa Huang
Ruyang Liu
Peng Jin
Ge Li
Xiaodan Liang
Mamba
417
10
0
24 Feb 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
413
5
0
18 Jan 2025
Audio-Language Datasets of Scenes and Events: A Survey
Audio-Language Datasets of Scenes and Events: A SurveyIEEE Access (IEEE Access), 2024
Gijs Wijngaard
Elia Formisano
Michele Esposito
M. Dumontier
626
10
0
10 Jan 2025
Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT
Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT
Wen-Dong Jiang
Chih-Yung Chang
Diptendu Sinha Roy
638
2
0
07 Jan 2025
Query-centric Audio-Visual Cognition Network for Moment Retrieval,
  Segmentation and Step-Captioning
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2024
Yunbin Tu
Liang-Sheng Li
Li Su
Qingming Huang
355
1
0
18 Dec 2024
Do Language Models Understand Time?
Do Language Models Understand Time?The Web Conference (WWW), 2024
Xi Ding
Lei Wang
1.0K
13
0
18 Dec 2024
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval
VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval
Dhiman Paul
Md Rizwan Parvez
Nabeel Mohammed
Shafin Rahman
VGen
406
5
0
02 Dec 2024
Dual-task Mutual Reinforcing Embedded Joint Video Paragraph Retrieval
  and Grounding
Dual-task Mutual Reinforcing Embedded Joint Video Paragraph Retrieval and GroundingIEEE transactions on multimedia (IEEE TMM), 2024
Ming Wang
Huafeng Li
Yafei Zhang
Jinxing Li
Minghong Xie
Dapeng Tao
AI4TS
294
5
0
26 Nov 2024
Grounded Video Caption Generation
Grounded Video Caption Generation
Evangelos Kazakos
Cordelia Schmid
Josef Sivic
328
0
0
12 Nov 2024
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text
  Understanding
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text UnderstandingACM Multimedia (MM), 2024
Jongbhin Woo
H. Ryu
Youngjoon Jang
Jae-Won Cho
Joon Son Chung
262
5
0
17 Oct 2024
MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval
MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video RetrievalComputer Vision and Pattern Recognition (CVPR), 2024
Reno Kriz
Kate Sanders
David Etter
Kenton W. Murray
Cameron Carpenter
...
Alexander Martin
Ronald Colaianni
Nolan King
Eugene Yang
Benjamin Van Durme
VGen
599
10
0
15 Oct 2024
Audio Description Generation in the Era of LLMs and VLMs: A Review of
  Transferable Generative AI Technologies
Audio Description Generation in the Era of LLMs and VLMs: A Review of Transferable Generative AI TechnologiesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yingqiang Gao
Lukas Fischer
Alexa Lintner
Sarah Ebling
304
8
0
11 Oct 2024
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained
  Video Understanding
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video UnderstandingNeural Information Processing Systems (NeurIPS), 2024
Houlun Chen
Xin Wang
Hong Chen
Zeyang Zhang
Wei Feng
Bin Huang
Jia Jia
Wenwu Zhu
VGen
388
10
0
11 Oct 2024
Realizing Video Summarization from the Path of Language-based Semantic
  Understanding
Realizing Video Summarization from the Path of Language-based Semantic Understanding
Kuan-Chen Mu
Zhi-Yi Chin
Wei-Chen Chiu
216
1
0
06 Oct 2024
Language-based Audio Moment Retrieval
Language-based Audio Moment RetrievalIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Hokuto Munakata
Taichi Nishimura
Shota Nakada
Tatsuya Komatsu
607
3
0
24 Sep 2024
Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal
  Grounding
Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal Grounding
Kaijing Ma
Haojian Huang
Jin Chen
Haodong Chen
Pengliang Ji
...
Han Fang
Chao Ban
Hao Sun
Mulin. Chen
Xuelong Li
284
11
0
29 Aug 2024
QD-VMR: Query Debiasing with Contextual Understanding Enhancement for
  Video Moment Retrieval
QD-VMR: Query Debiasing with Contextual Understanding Enhancement for Video Moment Retrieval
Chenghua Gao
Min Li
Jianshuo Liu
Junxing Ren
Lin Chen
Haoyu Liu
Bo Meng
Jitao Fu
Wenwen Su
192
2
0
23 Aug 2024
Disentangle and denoise: Tackling context misalignment for video moment
  retrieval
Disentangle and denoise: Tackling context misalignment for video moment retrieval
Kaijing Ma
Han Fang
Xianghao Zang
Chao Ban
Lanxiang Zhou
Zhongjiang He
Yongxiang Li
Hao Sun
Zerun Feng
Xingsong Hou
272
2
0
14 Aug 2024
ActPrompt: In-Domain Feature Adaptation via Action Cues for Video
  Temporal Grounding
ActPrompt: In-Domain Feature Adaptation via Action Cues for Video Temporal Grounding
Yubin Wang
Xinyang Jiang
De Cheng
Dongsheng Li
Cairong Zhao
VLM
279
2
0
13 Aug 2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding
  from TV Dramas and Synopses
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and SynopsesACM Multimedia (MM), 2024
Chaolei Tan
Zihang Lin
Junfu Pu
Chen Ma
Wei-Yi Pei
Zhi Qu
Yexin Wang
Ying Shan
Wei-Shi Zheng
Jianfang Hu
AI4TS
478
3
0
03 Aug 2024
1234
Next
Page 1 of 4