ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.11438
  4. Cited By
Reconstruction Network for Video Captioning

Reconstruction Network for Video Captioning

30 March 2018
Bairui Wang
Lin Ma
Wei Zhang
W. Liu
ArXivPDFHTML

Papers citing "Reconstruction Network for Video Captioning"

35 / 135 papers shown
Title
Hierarchical Memory Decoding for Video Captioning
Hierarchical Memory Decoding for Video Captioning
Aming Wu
Yahong Han
14
2
0
27 Feb 2020
Object Relational Graph with Teacher-Recommended Learning for Video
  Captioning
Object Relational Graph with Teacher-Recommended Learning for Video Captioning
Ziqi Zhang
Yaya Shi
Chunfen Yuan
Bing Li
Peijin Wang
Weiming Hu
Zhengjun Zha
VLM
18
271
0
26 Feb 2020
Meaning guided video captioning
Meaning guided video captioning
Rushi J. Babariya
Toru Tamaki
16
3
0
12 Dec 2019
Non-Autoregressive Coarse-to-Fine Video Captioning
Non-Autoregressive Coarse-to-Fine Video Captioning
Bang-ju Yang
Yuexian Zou
Fenglin Liu
Can Zhang
11
11
0
27 Nov 2019
Video Captioning with Text-based Dynamic Attention and Step-by-Step
  Learning
Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning
Huanhou Xiao
Jinglun Shi
6
24
0
05 Nov 2019
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video
  Captioning
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning
Tao Jin
Siyu Huang
Yingming Li
Zhongfei Zhang
14
20
0
01 Nov 2019
Diverse Video Captioning Through Latent Variable Expansion
Diverse Video Captioning Through Latent Variable Expansion
Huanhou Xiao
Jinglun Shi
DiffM
24
15
0
26 Oct 2019
Label-Conditioned Next-Frame Video Generation with Neural Flows
Label-Conditioned Next-Frame Video Generation with Neural Flows
Sergey Tarasenko
VGen
21
1
0
16 Oct 2019
Controllable Video Captioning with POS Sequence Guidance Based on Gated
  Fusion Network
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
66
163
0
27 Aug 2019
SF-Net: Structured Feature Network for Continuous Sign Language
  Recognition
SF-Net: Structured Feature Network for Continuous Sign Language Recognition
Zhaoyang Yang
Zhenmei Shi
Xiaoyong Shen
Yu-Wing Tai
SLR
27
63
0
04 Aug 2019
Learning Visual Actions Using Multiple Verb-Only Labels
Learning Visual Actions Using Multiple Verb-Only Labels
Michael Wray
Dima Damen
17
7
0
25 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
15
132
0
22 Jul 2019
Watch It Twice: Video Captioning with a Refocused Video Encoder
Watch It Twice: Video Captioning with a Refocused Video Encoder
Xiangxi Shi
Jianfei Cai
Shafiq R. Joty
Jiuxiang Gu
6
29
0
21 Jul 2019
Object-aware Aggregation with Bidirectional Temporal Graph for Video
  Captioning
Object-aware Aggregation with Bidirectional Temporal Graph for Video Captioning
Junchao Zhang
Yuxin Peng
8
170
0
11 Jun 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Zhenfang Chen
Lin Ma
Wenhan Luo
Kwan-Yee Kenneth Wong
15
101
0
06 Jun 2019
Reconstruct and Represent Video Contents for Captioning via
  Reinforcement Learning
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
15
67
0
03 Jun 2019
Learning to Generate Grounded Visual Captions without Localization
  Supervision
Learning to Generate Grounded Visual Captions without Localization Supervision
Chih-Yao Ma
Yannis Kalantidis
Ghassan AlRegib
Peter Vajda
Marcus Rohrbach
Z. Kira
SSL
8
10
0
01 Jun 2019
Hallucinating Optical Flow Features for Video Classification
Hallucinating Optical Flow Features for Video Classification
Yongyi Tang
Lin Ma
Lianqiang Zhou
11
19
0
28 May 2019
Memory-Attended Recurrent Network for Video Captioning
Memory-Attended Recurrent Network for Video Captioning
Wenjie Pei
Jiyuan Zhang
Xiangrong Wang
Lei Ke
Xiaoyong Shen
Yu-Wing Tai
9
200
0
10 May 2019
Spatio-temporal Video Re-localization by Warp LSTM
Spatio-temporal Video Re-localization by Warp LSTM
Yang Feng
Lin Ma
Wei Liu
Jiebo Luo
16
38
0
10 May 2019
Streamlined Dense Video Captioning
Streamlined Dense Video Captioning
Jonghwan Mun
L. Yang
Zhou Ren
N. Xu
Bohyung Han
14
136
0
08 Apr 2019
Self-supervised Spatio-temporal Representation Learning for Videos by
  Predicting Motion and Appearance Statistics
Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
Jiangliu Wang
Jianbo Jiao
Linchao Bao
Shengfeng He
Yunhui Liu
W. Liu
SSL
13
204
0
07 Apr 2019
End-to-End Video Captioning
End-to-End Video Captioning
Silvio Olivastri
Gurkirt Singh
Fabio Cuzzolin
16
18
0
04 Apr 2019
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding
  for Video Captioning
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
Nayyer Aafaq
Naveed Akhtar
W. Liu
Syed Zulqarnain Gilani
Ajmal Saeed Mian
18
204
0
27 Feb 2019
Hierarchical Photo-Scene Encoder for Album Storytelling
Hierarchical Photo-Scene Encoder for Album Storytelling
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Feng-Li Zhang
11
28
0
02 Feb 2019
Adversarial Inference for Multi-Sentence Video Description
Adversarial Inference for Multi-Sentence Video Description
J. S. Park
Marcus Rohrbach
Trevor Darrell
Anna Rohrbach
14
79
0
13 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
22
20
0
07 Dec 2018
Multi-granularity Generator for Temporal Action Proposal
Multi-granularity Generator for Temporal Action Proposal
Yuan Liu
Lin Ma
Yifeng Zhang
W. Liu
Shih-Fu Chang
16
193
0
28 Nov 2018
Y^2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by
  Joint Reconstruction and Prediction of View and Word Sequences
Y^2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences
Simon Denman
Mingyang Shang
Sabesan Sivapalan
Yu-Shen Liu
Matthias Zwicker
3DV
6
53
0
07 Nov 2018
Non-local NetVLAD Encoding for Video Classification
Non-local NetVLAD Encoding for Video Classification
Yongyi Tang
Xing Zhang
Jingwen Wang
Shaoxiang Chen
Lin Ma
Yu-Gang Jiang
11
41
0
29 Sep 2018
Video Re-localization
Video Re-localization
Yang Feng
Lin Ma
W. Liu
Tong Zhang
Jiebo Luo
13
71
0
05 Aug 2018
Recurrent Fusion Network for Image Captioning
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
W. Liu
Tong Zhang
ObjD
19
233
0
26 Jul 2018
Video Captioning with Boundary-aware Hierarchical Language Decoding and
  Joint Video Prediction
Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction
Xiangxi Shi
Jianfei Cai
Jiuxiang Gu
Shafiq R. Joty
8
18
0
08 Jul 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Saeed Mian
W. Liu
Syed Zulqarnain Gilani
Mubarak Shah
6
91
0
01 Jun 2018
Less Is More: Picking Informative Frames for Video Captioning
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
W. Zhang
Qingming Huang
12
200
0
05 Mar 2018
Previous
123