ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.09308
  4. Cited By
Look Closer to Ground Better: Weakly-Supervised Temporal Grounding of
  Sentence in Video

Look Closer to Ground Better: Weakly-Supervised Temporal Grounding of Sentence in Video

25 January 2020
Zhenfang Chen
Lin Ma
Tong Lu
Peng Tang
Kwan-Yee K. Wong
ArXiv (abs)PDFHTML

Papers citing "Look Closer to Ground Better: Weakly-Supervised Temporal Grounding of Sentence in Video"

29 / 29 papers shown
ResidualViT for Efficient Temporally Dense Video Encoding
ResidualViT for Efficient Temporally Dense Video Encoding
Mattia Soldan
Fabian Caba Heilbron
Bernard Ghanem
Josef Sivic
Bryan C. Russell
224
1
0
16 Sep 2025
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Yi Liu
Han Zhang
Hongjie Zhang
Yuanmin Huang
Z. Ling
Yu Qiao
Limin Wang
Yun Wang
AI4TS
492
2
0
10 May 2025
Cross-modal Causal Relation Alignment for Video Question Grounding
Cross-modal Causal Relation Alignment for Video Question GroundingComputer Vision and Pattern Recognition (CVPR), 2025
Weixing Chen
Wenshu Fan
Binglin Chen
Jiandong Su
Yongsen Zheng
Guanbin Li
BDLVGenCML
343
12
0
05 Mar 2025
TimeRefine: Temporal Grounding with Time Refining Video LLM
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
606
10
0
12 Dec 2024
Commonsense for Zero-Shot Natural Language Video Localization
Commonsense for Zero-Shot Natural Language Video LocalizationAAAI Conference on Artificial Intelligence (AAAI), 2023
Meghana Holla
Ismini Lourentzou
401
6
0
29 Dec 2023
Grounding-Prompter: Prompting LLM with Multimodal Information for
  Temporal Sentence Grounding in Long Videos
Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos
Houlun Chen
Xin Wang
Hong Chen
Zihan Song
Jia Jia
Wenwu Zhu
LRM
278
19
0
28 Dec 2023
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video
  Grounding
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding
Haifeng Huang
Yang Zhao
Zehan Wang
Yan Xia
Zhou Zhao
303
1
0
21 Dec 2023
EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video
  Grounding with Multimodal Large Language Model
EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language ModelIEEE transactions on multimedia (IEEE TMM), 2023
Guozhang Li
Xinpeng Ding
De Cheng
Jie Li
Nannan Wang
Xinbo Gao
506
5
0
05 Dec 2023
Learning Temporal Sentence Grounding From Narrated EgoVideos
Learning Temporal Sentence Grounding From Narrated EgoVideosBritish Machine Vision Conference (BMVC), 2023
Kevin Flanagan
Dima Damen
Michael Wray
244
3
0
26 Oct 2023
SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval
SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment RetrievalIEEE International Conference on Computer Vision (ICCV), 2023
Sunjae Yoon
Gwanhyeong Koo
Dahyun Kim
Changdong Yoo
427
22
0
08 Oct 2023
Counterfactual Cross-modality Reasoning for Weakly Supervised Video
  Moment Localization
Counterfactual Cross-modality Reasoning for Weakly Supervised Video Moment LocalizationACM Multimedia (ACM MM), 2023
Zezhong Lv
Fuchun Sun
Ji-Rong Wen
319
23
0
10 Aug 2023
D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with
  Glance Annotation
D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance AnnotationIEEE International Conference on Computer Vision (ICCV), 2023
Hanjun Li
Xiujun Shu
Su He
Ruizhi Qiao
Wei Wen
Taian Guo
Bei Gan
Xing Sun
251
20
0
08 Aug 2023
Constraint and Union for Partially-Supervised Temporal Sentence
  Grounding
Constraint and Union for Partially-Supervised Temporal Sentence Grounding
Chen Ju
Haicheng Wang
Jinxian Liu
Chaofan Ma
Ya Zhang
Peisen Zhao
Jianlong Chang
Qi Tian
227
18
0
20 Feb 2023
Hypotheses Tree Building for One-Shot Temporal Sentence Localization
Hypotheses Tree Building for One-Shot Temporal Sentence LocalizationAAAI Conference on Artificial Intelligence (AAAI), 2023
Daizong Liu
Xiang Fang
Pan Zhou
Xing Di
Weining Lu
Yu Cheng
282
29
0
05 Jan 2023
Language-free Training for Zero-shot Video Grounding
Language-free Training for Zero-shot Video GroundingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Dahye Kim
Jungin Park
Jiyoung Lee
S. Park
Kwanghoon Sohn
251
33
0
24 Oct 2022
Weakly-Supervised Temporal Article Grounding
Weakly-Supervised Temporal Article GroundingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Long Chen
Yulei Niu
Brian Chen
Xudong Lin
G. Han
Christopher Thomas
Hammad A. Ayyubi
Heng Ji
Shih-Fu Chang
AI4TS
235
14
0
22 Oct 2022
Masked Motion Encoding for Self-Supervised Video Representation Learning
Masked Motion Encoding for Self-Supervised Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2022
Xinyu Sun
Peihao Chen
Liang-Chieh Chen
Chan Li
Thomas H. Li
Zhuliang Yu
Chuang Gan
402
47
0
12 Oct 2022
Dilated Context Integrated Network with Cross-Modal Consensus for
  Temporal Emotion Localization in Videos
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in VideosACM Multimedia (ACM MM), 2022
Juncheng Billy Li
Junlin Xie
Linchao Zhu
Long Qian
Siliang Tang
...
Haochen Shi
Shengyu Zhang
Longhui Wei
Qi Tian
Yueting Zhuang
288
16
0
03 Aug 2022
Tragedy Plus Time: Capturing Unintended Human Activities from
  Weakly-labeled Videos
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Arnav Chakravarthy
Zhiyuan Fang
Yezhou Yang
185
2
0
28 Apr 2022
Multi-Scale Self-Contrastive Learning with Hard Negative Mining for
  Weakly-Supervised Query-based Video Grounding
Multi-Scale Self-Contrastive Learning with Hard Negative Mining for Weakly-Supervised Query-based Video Grounding
Shentong Mo
Daizong Liu
Wei Hu
SSL
171
8
0
08 Mar 2022
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With
  Transformer for Sentence Grounding in Videos
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos
Sangmin Woo
Jinyoung Park
Inyong Koo
Sumin Lee
Minki Jeong
Changick Kim
506
6
0
25 Jan 2022
Temporal Sentence Grounding in Videos: A Survey and Future Directions
Temporal Sentence Grounding in Videos: A Survey and Future DirectionsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
3DGS
478
59
0
20 Jan 2022
A Survey on Temporal Sentence Grounding in Videos
A Survey on Temporal Sentence Grounding in Videos
Xiaohan Lan
Yitian Yuan
Xin Eric Wang
Zhi Wang
Wenwu Zhu
401
59
0
16 Sep 2021
Zero-shot Natural Language Video Localization
Zero-shot Natural Language Video LocalizationIEEE International Conference on Computer Vision (ICCV), 2021
Jinwoo Nam
Daechul Ahn
Luan Tuyen Chau
S. Ha
Jonghyun Choi
431
58
0
29 Aug 2021
COOT: Cooperative Hierarchical Transformer for Video-Text Representation
  Learning
COOT: Cooperative Hierarchical Transformer for Video-Text Representation LearningNeural Information Processing Systems (NeurIPS), 2020
Simon Ging
Mohammadreza Zolfaghari
Hamed Pirsiavash
Thomas Brox
ViTCLIP
279
178
0
01 Nov 2020
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment
  Retrieval in Videos
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos
Zhu Zhang
Zhijie Lin
Zhou Zhao
Jieming Zhu
Xiuqiang He
187
78
0
19 Aug 2020
Weak Supervision and Referring Attention for Temporal-Textual
  Association Learning
Weak Supervision and Referring Attention for Temporal-Textual Association Learning
Zhiyuan Fang
Shu Kong
Zhe Wang
Charless C. Fowlkes
Yezhou Yang
169
20
0
21 Jun 2020
Weakly-Supervised Multi-Level Attentional Reconstruction Network for
  Grounding Textual Queries in Videos
Weakly-Supervised Multi-Level Attentional Reconstruction Network for Grounding Textual Queries in Videos
Yijun Song
Jingwen Wang
Lin Ma
Zhou Yu
Jun Yu
219
72
0
16 Mar 2020
Cops-Ref: A new Dataset and Task on Compositional Referring Expression
  Comprehension
Cops-Ref: A new Dataset and Task on Compositional Referring Expression ComprehensionComputer Vision and Pattern Recognition (CVPR), 2020
Zhenfang Chen
Peng Wang
Lin Ma
Kwan-Yee K. Wong
Qi Wu
ObjD
299
84
0
01 Mar 2020
1
Page 1 of 1