ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.06829
  4. Cited By
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding
  Natural Language Descriptions in Videos

Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos

21 January 2019
Dongliang He
Xiang Zhao
Jizhou Huang
Fu Li
Xiao-Chang Liu
Shilei Wen
ArXiv (abs)PDFHTML

Papers citing "Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos"

21 / 71 papers shown
Title
A Survey on Natural Language Video Localization
A Survey on Natural Language Video Localization
Xinfang Liu
Xiushan Nie
Zhifang Tan
Jie Guo
Yilong Yin
189
9
0
01 Apr 2021
Embracing Uncertainty: Decoupling and De-bias for Robust Temporal
  Grounding
Embracing Uncertainty: Decoupling and De-bias for Robust Temporal GroundingComputer Vision and Pattern Recognition (CVPR), 2021
Hao Zhou
Chongyang Zhang
Yan Luo
Yanjun Chen
Chuanping Hu
101
54
0
31 Mar 2021
Context-aware Biaffine Localizing Network for Temporal Sentence
  Grounding
Context-aware Biaffine Localizing Network for Temporal Sentence GroundingComputer Vision and Pattern Recognition (CVPR), 2021
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
Yu Cheng
Wei Wei
Zichuan Xu
Yulai Xie
117
172
0
22 Mar 2021
Boundary Proposal Network for Two-Stage Natural Language Video
  Localization
Boundary Proposal Network for Two-Stage Natural Language Video LocalizationAAAI Conference on Artificial Intelligence (AAAI), 2021
Shaoning Xiao
Long Chen
Songyang Zhang
Wei Ji
Jian Shao
Lu Ye
Jun Xiao
131
174
0
15 Mar 2021
Natural Language Video Localization: A Revisit in Span-based Question
  Answering Framework
Natural Language Video Localization: A Revisit in Span-based Question Answering FrameworkIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
293
101
0
26 Feb 2021
A Closer Look at Temporal Sentence Grounding in Videos: Dataset and
  Metric
A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric
Yitian Yuan
Xiaohan Lan
Xin Wang
Long Chen
Zhi Wang
Wenwu Zhu
154
63
0
22 Jan 2021
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with
  Natural Language
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural LanguageIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Songyang Zhang
Houwen Peng
Jianlong Fu
Yijuan Lu
Jiebo Luo
135
62
0
04 Dec 2020
Data-efficient Alignment of Multimodal Sequences by Aligning Gradient
  Updates and Internal Feature Distributions
Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature DistributionsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Jianan Wang
Boyang Albert Li
Xiangyu Fan
Jing-Hua Lin
Yanwei Fu
86
3
0
15 Nov 2020
Actor and Action Modular Network for Text-based Video Segmentation
Actor and Action Modular Network for Text-based Video SegmentationIEEE Transactions on Image Processing (TIP), 2020
Jianhua Yang
Yan Huang
K. Niu
Linjiang Huang
Zhanyu Ma
Liang Wang
144
12
0
02 Nov 2020
A Simple Yet Effective Method for Video Temporal Grounding with
  Cross-Modality Attention
A Simple Yet Effective Method for Video Temporal Grounding with Cross-Modality Attention
Binjie Zhang
Yu Li
Chun Yuan
D. Xu
Pin Jiang
Ying Shan
79
5
0
23 Sep 2020
Reinforcement Learning for Weakly Supervised Temporal Grounding of
  Natural Language in Untrimmed Videos
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed VideosACM Multimedia (ACM MM), 2020
Jie Wu
Guanbin Li
Xiaoguang Han
Liang Lin
OffRLAI4TS
124
61
0
18 Sep 2020
Text-based Localization of Moments in a Video Corpus
Text-based Localization of Moments in a Video Corpus
Sudipta Paul
Niluthpol Chowdhury Mithun
Amit K. Roy-Chowdhury
83
20
0
20 Aug 2020
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment
  Retrieval in Videos
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos
Zhu Zhang
Zhijie Lin
Zhou Zhao
Jieming Zhu
Xiuqiang He
132
76
0
19 Aug 2020
Language Guided Networks for Cross-modal Moment Retrieval
Language Guided Networks for Cross-modal Moment Retrieval
Kun Liu
Huadong Ma
Chuang Gan
59
2
0
18 Jun 2020
Span-based Localizing Network for Natural Language Video Localization
Span-based Localizing Network for Natural Language Video LocalizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
218
352
0
29 Apr 2020
Local-Global Video-Text Interactions for Temporal Grounding
Local-Global Video-Text Interactions for Temporal GroundingComputer Vision and Pattern Recognition (CVPR), 2020
Jonghwan Mun
Minsu Cho
Bohyung Han
137
307
0
16 Apr 2020
Dense Regression Network for Video Grounding
Dense Regression Network for Video GroundingComputer Vision and Pattern Recognition (CVPR), 2020
Runhao Zeng
Haoming Xu
Wenbing Huang
Peihao Chen
Zhuliang Yu
Chuang Gan
234
313
0
07 Apr 2020
Tree-Structured Policy based Progressive Reinforcement Learning for
  Temporally Language Grounding in Video
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in VideoAAAI Conference on Artificial Intelligence (AAAI), 2020
Jie Wu
Guanbin Li
Si Liu
Liang Lin
OffRL
112
113
0
18 Jan 2020
Exploiting Temporal Relationships in Video Moment Localization with
  Natural Language
Exploiting Temporal Relationships in Video Moment Localization with Natural LanguageACM Multimedia (ACM MM), 2019
Songyang Zhang
Jinsong Su
Jiebo Luo
100
77
0
11 Aug 2019
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective
  Untrimmed Video Recognition
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2019
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Shilei Wen
133
132
0
31 Jul 2019
Tripping through time: Efficient Localization of Activities in Videos
Tripping through time: Efficient Localization of Activities in Videos
Meera Hahn
Asim Kadav
James M. Rehg
H. Graf
261
90
0
22 Apr 2019
Previous
12