Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1901.06829
Cited By
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos
21 January 2019
Dongliang He
Xiang Zhao
Jizhou Huang
Fu Li
Xiao-Chang Liu
Shilei Wen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos"
21 / 71 papers shown
Title
A Survey on Natural Language Video Localization
Xinfang Liu
Xiushan Nie
Zhifang Tan
Jie Guo
Yilong Yin
189
9
0
01 Apr 2021
Embracing Uncertainty: Decoupling and De-bias for Robust Temporal Grounding
Computer Vision and Pattern Recognition (CVPR), 2021
Hao Zhou
Chongyang Zhang
Yan Luo
Yanjun Chen
Chuanping Hu
101
54
0
31 Mar 2021
Context-aware Biaffine Localizing Network for Temporal Sentence Grounding
Computer Vision and Pattern Recognition (CVPR), 2021
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
Yu Cheng
Wei Wei
Zichuan Xu
Yulai Xie
117
172
0
22 Mar 2021
Boundary Proposal Network for Two-Stage Natural Language Video Localization
AAAI Conference on Artificial Intelligence (AAAI), 2021
Shaoning Xiao
Long Chen
Songyang Zhang
Wei Ji
Jian Shao
Lu Ye
Jun Xiao
131
174
0
15 Mar 2021
Natural Language Video Localization: A Revisit in Span-based Question Answering Framework
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
293
101
0
26 Feb 2021
A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric
Yitian Yuan
Xiaohan Lan
Xin Wang
Long Chen
Zhi Wang
Wenwu Zhu
154
63
0
22 Jan 2021
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Songyang Zhang
Houwen Peng
Jianlong Fu
Yijuan Lu
Jiebo Luo
135
62
0
04 Dec 2020
Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature Distributions
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Jianan Wang
Boyang Albert Li
Xiangyu Fan
Jing-Hua Lin
Yanwei Fu
86
3
0
15 Nov 2020
Actor and Action Modular Network for Text-based Video Segmentation
IEEE Transactions on Image Processing (TIP), 2020
Jianhua Yang
Yan Huang
K. Niu
Linjiang Huang
Zhanyu Ma
Liang Wang
144
12
0
02 Nov 2020
A Simple Yet Effective Method for Video Temporal Grounding with Cross-Modality Attention
Binjie Zhang
Yu Li
Chun Yuan
D. Xu
Pin Jiang
Ying Shan
79
5
0
23 Sep 2020
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos
ACM Multimedia (ACM MM), 2020
Jie Wu
Guanbin Li
Xiaoguang Han
Liang Lin
OffRL
AI4TS
124
61
0
18 Sep 2020
Text-based Localization of Moments in a Video Corpus
Sudipta Paul
Niluthpol Chowdhury Mithun
Amit K. Roy-Chowdhury
83
20
0
20 Aug 2020
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos
Zhu Zhang
Zhijie Lin
Zhou Zhao
Jieming Zhu
Xiuqiang He
132
76
0
19 Aug 2020
Language Guided Networks for Cross-modal Moment Retrieval
Kun Liu
Huadong Ma
Chuang Gan
59
2
0
18 Jun 2020
Span-based Localizing Network for Natural Language Video Localization
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
218
352
0
29 Apr 2020
Local-Global Video-Text Interactions for Temporal Grounding
Computer Vision and Pattern Recognition (CVPR), 2020
Jonghwan Mun
Minsu Cho
Bohyung Han
137
307
0
16 Apr 2020
Dense Regression Network for Video Grounding
Computer Vision and Pattern Recognition (CVPR), 2020
Runhao Zeng
Haoming Xu
Wenbing Huang
Peihao Chen
Zhuliang Yu
Chuang Gan
234
313
0
07 Apr 2020
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video
AAAI Conference on Artificial Intelligence (AAAI), 2020
Jie Wu
Guanbin Li
Si Liu
Liang Lin
OffRL
112
113
0
18 Jan 2020
Exploiting Temporal Relationships in Video Moment Localization with Natural Language
ACM Multimedia (ACM MM), 2019
Songyang Zhang
Jinsong Su
Jiebo Luo
100
77
0
11 Aug 2019
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2019
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Shilei Wen
133
132
0
31 Jul 2019
Tripping through time: Efficient Localization of Activities in Videos
Meera Hahn
Asim Kadav
James M. Rehg
H. Graf
261
90
0
22 Apr 2019
Previous
1
2