Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.11438
Cited By
Reconstruction Network for Video Captioning
30 March 2018
Bairui Wang
Lin Ma
Wei Zhang
W. Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reconstruction Network for Video Captioning"
35 / 135 papers shown
Title
Hierarchical Memory Decoding for Video Captioning
Aming Wu
Yahong Han
14
2
0
27 Feb 2020
Object Relational Graph with Teacher-Recommended Learning for Video Captioning
Ziqi Zhang
Yaya Shi
Chunfen Yuan
Bing Li
Peijin Wang
Weiming Hu
Zhengjun Zha
VLM
18
271
0
26 Feb 2020
Meaning guided video captioning
Rushi J. Babariya
Toru Tamaki
16
3
0
12 Dec 2019
Non-Autoregressive Coarse-to-Fine Video Captioning
Bang-ju Yang
Yuexian Zou
Fenglin Liu
Can Zhang
11
11
0
27 Nov 2019
Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning
Huanhou Xiao
Jinglun Shi
6
24
0
05 Nov 2019
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning
Tao Jin
Siyu Huang
Yingming Li
Zhongfei Zhang
14
20
0
01 Nov 2019
Diverse Video Captioning Through Latent Variable Expansion
Huanhou Xiao
Jinglun Shi
DiffM
24
15
0
26 Oct 2019
Label-Conditioned Next-Frame Video Generation with Neural Flows
Sergey Tarasenko
VGen
21
1
0
16 Oct 2019
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
66
163
0
27 Aug 2019
SF-Net: Structured Feature Network for Continuous Sign Language Recognition
Zhaoyang Yang
Zhenmei Shi
Xiaoyong Shen
Yu-Wing Tai
SLR
27
63
0
04 Aug 2019
Learning Visual Actions Using Multiple Verb-Only Labels
Michael Wray
Dima Damen
17
7
0
25 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
15
132
0
22 Jul 2019
Watch It Twice: Video Captioning with a Refocused Video Encoder
Xiangxi Shi
Jianfei Cai
Shafiq R. Joty
Jiuxiang Gu
6
29
0
21 Jul 2019
Object-aware Aggregation with Bidirectional Temporal Graph for Video Captioning
Junchao Zhang
Yuxin Peng
8
170
0
11 Jun 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Zhenfang Chen
Lin Ma
Wenhan Luo
Kwan-Yee Kenneth Wong
15
101
0
06 Jun 2019
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
15
67
0
03 Jun 2019
Learning to Generate Grounded Visual Captions without Localization Supervision
Chih-Yao Ma
Yannis Kalantidis
Ghassan AlRegib
Peter Vajda
Marcus Rohrbach
Z. Kira
SSL
8
10
0
01 Jun 2019
Hallucinating Optical Flow Features for Video Classification
Yongyi Tang
Lin Ma
Lianqiang Zhou
11
19
0
28 May 2019
Memory-Attended Recurrent Network for Video Captioning
Wenjie Pei
Jiyuan Zhang
Xiangrong Wang
Lei Ke
Xiaoyong Shen
Yu-Wing Tai
9
200
0
10 May 2019
Spatio-temporal Video Re-localization by Warp LSTM
Yang Feng
Lin Ma
Wei Liu
Jiebo Luo
16
38
0
10 May 2019
Streamlined Dense Video Captioning
Jonghwan Mun
L. Yang
Zhou Ren
N. Xu
Bohyung Han
14
136
0
08 Apr 2019
Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
Jiangliu Wang
Jianbo Jiao
Linchao Bao
Shengfeng He
Yunhui Liu
W. Liu
SSL
13
204
0
07 Apr 2019
End-to-End Video Captioning
Silvio Olivastri
Gurkirt Singh
Fabio Cuzzolin
16
18
0
04 Apr 2019
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
Nayyer Aafaq
Naveed Akhtar
W. Liu
Syed Zulqarnain Gilani
Ajmal Saeed Mian
18
204
0
27 Feb 2019
Hierarchical Photo-Scene Encoder for Album Storytelling
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Feng-Li Zhang
11
28
0
02 Feb 2019
Adversarial Inference for Multi-Sentence Video Description
J. S. Park
Marcus Rohrbach
Trevor Darrell
Anna Rohrbach
14
79
0
13 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
22
20
0
07 Dec 2018
Multi-granularity Generator for Temporal Action Proposal
Yuan Liu
Lin Ma
Yifeng Zhang
W. Liu
Shih-Fu Chang
16
193
0
28 Nov 2018
Y^2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences
Simon Denman
Mingyang Shang
Sabesan Sivapalan
Yu-Shen Liu
Matthias Zwicker
3DV
6
53
0
07 Nov 2018
Non-local NetVLAD Encoding for Video Classification
Yongyi Tang
Xing Zhang
Jingwen Wang
Shaoxiang Chen
Lin Ma
Yu-Gang Jiang
11
41
0
29 Sep 2018
Video Re-localization
Yang Feng
Lin Ma
W. Liu
Tong Zhang
Jiebo Luo
13
71
0
05 Aug 2018
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
W. Liu
Tong Zhang
ObjD
19
233
0
26 Jul 2018
Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction
Xiangxi Shi
Jianfei Cai
Jiuxiang Gu
Shafiq R. Joty
8
18
0
08 Jul 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Saeed Mian
W. Liu
Syed Zulqarnain Gilani
Mubarak Shah
6
91
0
01 Jun 2018
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
W. Zhang
Qingming Huang
12
200
0
05 Mar 2018
Previous
1
2
3