Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1611.07675
Cited By
Video Captioning with Transferred Semantic Attributes
23 November 2016
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Video Captioning with Transferred Semantic Attributes"
50 / 115 papers shown
Learning to Discretely Compose Reasoning Module Networks for Video Captioning
International Joint Conference on Artificial Intelligence (IJCAI), 2020
Ganchao Tan
Daqing Liu
Meng Wang
Zhengjun Zha
LRM
236
78
0
17 Jul 2020
Bifurcated backbone strategy for RGB-D salient object detection
Yingjie Zhai
Deng-Ping Fan
Jufeng Yang
Ali Borji
Ling Shao
Junwei Han
Liang Wang
248
139
0
06 Jul 2020
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Yingwei Pan
Yehao Li
Jianjie Luo
Jun Xu
Ting Yao
Tao Mei
210
61
0
05 Jul 2020
A Transformer-based Audio Captioning Model with Keyword Estimation
Yuma Koizumi
Ryo Masumura
Kyosuke Nishida
Masahiro Yasuda
Shoichiro Saito
295
56
0
01 Jul 2020
SACT: Self-Aware Multi-Space Feature Composition Transformer for Multinomial Attention for Video Captioning
C. Sur
129
7
0
25 Jun 2020
Language Guided Networks for Cross-modal Moment Retrieval
Kun Liu
Huadong Ma
Chuang Gan
149
2
0
18 Jun 2020
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Vladimir E. Iashin
Esa Rahtu
224
128
0
17 May 2020
Consistent Multiple Sequence Decoding
Bicheng Xu
Leonid Sigal
171
0
0
02 Apr 2020
Detection and Description of Change in Visual Streams
Davis Gilton
Ruotian Luo
Rebecca Willett
Gregory Shakhnarovich
AI4TS
181
4
0
27 Mar 2020
Multi-modal Dense Video Captioning
Vladimir E. Iashin
Esa Rahtu
325
199
0
17 Mar 2020
Video Caption Dataset for Describing Human Actions in Japanese
International Conference on Language Resources and Evaluation (LREC), 2020
Yutaro Shigeto
Yuya Yoshikawa
Jiaqing Lin
A. Takeuchi
110
3
0
10 Mar 2020
Better Captioning with Sequence-Level Exploration
Computer Vision and Pattern Recognition (CVPR), 2020
Jia Chen
Qin Jin
143
12
0
08 Mar 2020
On the Evaluation of Intelligent Process Automation
AAAI Conference on Artificial Intelligence (AAAI), 2019
Deborah Ferreira
Julia Rozanova
K. Dubba
Dell Zhang
André Freitas
178
11
0
08 Jan 2020
Vision and Language: from Visual Perception to Content Creation
APSIPA Transactions on Signal and Information Processing (APSIPA TSIP), 2019
Tao Mei
Wei Zhang
Ting Yao
VLM
182
8
0
26 Dec 2019
Action Modifiers: Learning from Adverbs in Instructional Videos
Computer Vision and Pattern Recognition (CVPR), 2019
Hazel Doughty
Ivan Laptev
W. Mayol-Cuevas
Dima Damen
343
38
0
13 Dec 2019
Non-Autoregressive Coarse-to-Fine Video Captioning
Bang-ju Yang
Yuexian Zou
Fenglin Liu
Can Zhang
437
11
0
27 Nov 2019
Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models
International Conferences on Pattern Recognition and Artificial Intelligence (ICCPRAI), 2019
Menatallh Hammad
May Hammad
Mohamed Elshenawy
105
2
0
22 Nov 2019
Empirical Autopsy of Deep Video Captioning Frameworks
Nayyer Aafaq
Naveed Akhtar
Wei Liu
Lin Wang
119
6
0
21 Nov 2019
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Tao Jin
Siyu Huang
Yingming Li
Zhongfei Zhang
205
21
0
01 Nov 2019
Diverse Video Captioning Through Latent Variable Expansion
Pattern Recognition Letters (PR), 2019
Huanhou Xiao
Jinglun Shi
DiffM
317
15
0
26 Oct 2019
ViP: Video Platform for PyTorch
Madan Ravi Ganesh
Eric Hofesmann
Nathan Louis
Jason J. Corso
ViT
103
0
0
07 Oct 2019
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
IEEE International Conference on Computer Vision (ICCV), 2019
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
227
177
0
27 Aug 2019
Mocycle-GAN: Unpaired Video-to-Video Translation
ACM Multimedia (ACM MM), 2019
Yang Chen
Yingwei Pan
Ting Yao
Xinmei Tian
Tao Mei
GAN
177
95
0
26 Aug 2019
3-D Scene Graph: A Sparse and Semantic Representation of Physical Environments for Intelligent Agents
IEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2019
Ue-Hwan Kim
Jin-Man Park
Taek-jin Song
Jong-hwan Kim
3DV
173
124
0
14 Aug 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Journal of Artificial Intelligence Research (JAIR), 2019
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
407
142
0
22 Jul 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Zhaofan Qiu
Dong Li
Yehao Li
Qi Cai
Yingwei Pan
Ting Yao
127
8
0
14 Jun 2019
Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers
Manjot Bilkhu
Siyang Wang
Tushar Dobhal
ViT
98
18
0
06 Jun 2019
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
184
72
0
03 Jun 2019
Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations
Neural Information Processing Systems (NeurIPS), 2019
Fenglin Liu
Yuanxin Liu
Xuancheng Ren
Xiaodong He
Xu Sun
VLM
228
90
0
15 May 2019
Multimodal Semantic Attention Network for Video Captioning
IEEE International Conference on Multimedia and Expo (ICME), 2019
Liang Sun
Bing Li
Chunfen Yuan
Zhengjun Zha
Weiming Hu
166
11
0
08 May 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
AAAI Conference on Artificial Intelligence (AAAI), 2019
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Hongyang Chao
Tao Mei
177
104
0
03 May 2019
Pointing Novel Objects in Image Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
197
73
0
25 Apr 2019
Streamlined Dense Video Captioning
Jonghwan Mun
L. Yang
Zhou Ren
N. Xu
Bohyung Han
256
160
1
08 Apr 2019
Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval
Yadan Luo
Ziwei Wang
Zi Huang
Yang Yang
Huimin Lu
109
7
0
05 Apr 2019
End-to-End Video Captioning
Silvio Olivastri
Gurkirt Singh
Fabio Cuzzolin
146
20
0
04 Apr 2019
Scene Understanding for Autonomous Manipulation with Deep Learning
A. Nguyen
135
6
0
23 Mar 2019
V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation
A. Nguyen
Thanh-Toan Do
Ian Reid
D. Caldwell
Nikos G. Tsagarakis
119
21
0
23 Mar 2019
Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
Computer Vision and Pattern Recognition (CVPR), 2019
Nayyer Aafaq
Naveed Akhtar
Wen Liu
Syed Zulqarnain Gilani
Lin Wang
212
220
0
27 Feb 2019
Hierarchical Photo-Scene Encoder for Album Storytelling
AAAI Conference on Artificial Intelligence (AAAI), 2019
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Feng-Li Zhang
150
29
0
02 Feb 2019
Not All Words are Equal: Video-specific Information Loss for Video Captioning
Jiarong Dong
Ke Gao
Xiaokai Chen
Junbo Guo
Juan Cao
Yongdong Zhang
122
8
0
01 Jan 2019
DART: Domain-Adversarial Residual-Transfer Networks for Unsupervised Cross-Domain Image Classification
Xianghong Fang
Haoli Bai
Ziyi Guo
Bin Shen
Guosheng Lin
Zenglin Xu
OOD
80
44
0
30 Dec 2018
Hierarchical LSTMs with Adaptive Attention for Visual Captioning
Jingkuan Song
Xiangpeng Li
Lianli Gao
Heng Tao Shen
162
231
0
26 Dec 2018
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
307
203
0
17 Dec 2018
Real-Time Referring Expression Comprehension by Single-Stage Grounding Network
Xinpeng Chen
Lin Ma
Jingyuan Chen
Zequn Jie
Wen Liu
Jiebo Luo
ObjD
181
125
0
09 Dec 2018
MTLE: A Multitask Learning Encoder of Visual Feature Representations for Video and Movie Description
Oliver A. Nina
Washington Garcia
Scott Clouse
Alper Yilmaz
177
4
0
19 Sep 2018
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
338
893
0
19 Sep 2018
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Victor Escorcia
Ranjay Krishna
S. Buch
Cuong Duc Dao
233
66
0
11 Aug 2018
Move Forward and Tell: A Progressive Generator of Video Descriptions
Yilei Xiong
Bo Dai
Dahua Lin
173
115
0
26 Jul 2018
Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction
Xiangxi Shi
Jianfei Cai
Jiuxiang Gu
Shafiq Joty
110
19
0
08 Jul 2018
YH Technologies at ActivityNet Challenge 2018
Ting Yao
Xue Li
107
11
0
29 Jun 2018
Previous
1
2
3
Next