v1v2v3 (latest)

Translating Videos to Natural Language Using Deep Recurrent Neural Networks

North American Chapter of the Association for Computational Linguistics (NAACL), 2014

15 December 2014

Subhashini Venugopalan

Papers citing "Translating Videos to Natural Language Using Deep Recurrent Neural Networks"

50 / 334 papers shown

Title
Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated OrthogonalizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2018 Preksha Nema Shreyas Shetty Parag Jain Anirban Laha Karthik Sankaranarayanan Mitesh M. Khapra 145 40 0 20 Apr 2018
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning Xinze Wang Yuan-fang Wang William Yang Wang 118 79 0 15 Apr 2018
Multilevel Language and Vision Integration for Text-to-Clip Retrieval Huijuan Xu Kun He Bryan A. Plummer Leonid Sigal Stan Sclaroff Kate Saenko CLIP 198 345 0 13 Apr 2018
Deep Differential Recurrent Neural Networks Naifan Zhuang T. Kieu Guo-Jun Qi K. Hua 80 3 0 11 Apr 2018
End-to-End Dense Video Captioning with Masked Transformer Luowei Zhou Yingbo Zhou Jason J. Corso R. Socher Caiming Xiong 221 574 0 03 Apr 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching Arsha Nagrani Samuel Albanie Andrew Zisserman CVBM 357 241 0 01 Apr 2018
Reconstruction Network for Video Captioning Bairui Wang Lin Ma Wei Zhang Wen Liu 192 338 0 30 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning Lijun Li Boqing Gong 126 59 0 21 Mar 2018
Joint Event Detection and Description in Continuous Video Streams Huijuan Xu Boyang Albert Li Vasili Ramanishka Leonid Sigal Kate Saenko 128 58 0 28 Feb 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH) Pelin Dogan Boyang Albert Li Leonid Sigal Markus Gross AI4TS 206 24 0 19 Feb 2018
Probabilistic Recurrent State-Space Models Andreas Doerr Christian Daniel Martin Schiegg D. Nguyen-Tuong S. Schaal Marc Toussaint Sebastian Trimpe 256 134 0 31 Jan 2018
Video-based Sign Language Recognition without Temporal Segmentation Jie Huang Wen-gang Zhou Qilin Zhang Houqiang Li Weiping Li SLR 248 446 0 30 Jan 2018
Deep Neural Networks In Fully Connected CRF For Image Labeling With Social Network Metadata Chengjiang Long Roddy Collins E. Swears A. Hoogs 149 24 0 27 Jan 2018
Foreground Segmentation Using a Triplet Convolutional Neural Network for Multiscale Feature Encoding Long Ang Lim H. Keles 141 206 0 07 Jan 2018
Recent Advances in Recurrent Neural Networks Hojjat Salehinejad Sharan Sankar Joseph Barfett E. Colak S. Valaee AI4TS 448 678 0 29 Dec 2017
Visual Features for Context-Aware Speech Recognition Abhinav Gupta Yajie Miao Leonardo Neves Florian Metze 137 43 0 01 Dec 2017
Video Captioning via Hierarchical Reinforcement Learning Xin Eric Wang Wenhu Chen Jiawei Wu Yuan-fang Wang William Yang Wang 194 248 0 29 Nov 2017
Convolutional Image Captioning J. Aneja Aditya Deshpande Alex Schwing VLM 218 388 0 24 Nov 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption Wangli Hao Zhaoxiang Zhang He Guan Guibo Zhu 164 37 0 22 Nov 2017
Excitation Backprop for RNNs Sarah Adel Bargal Andrea Zunino Donghyun Kim Jianming Zhang Vittorio Murino Stan Sclaroff 270 48 0 18 Nov 2017
Grounded Objects and Interactions for Video Captioning Chih-Yao Ma Asim Kadav I. Melvin Z. Kira G. Al-Regib H. Graf 109 6 0 16 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding Chih-Yao Ma Asim Kadav I. Melvin Z. Kira G. Al-Regib H. Graf 172 149 0 16 Nov 2017
Whodunnit? Crime Drama as a Case for Natural Language Understanding Lea Frermann Shay B. Cohen Mirella Lapata 100 27 0 31 Oct 2017
Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance Aditya Mogadala Umanga Bista Lexing Xie Achim Rettinger 152 7 0 17 Oct 2017
Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks Anh Nguyen Dimitrios Kanoulas L. Muratore D. Caldwell Nikos G. Tsagarakis 145 73 0 01 Oct 2017
Predicting Visual Features from Text for Image and Video Caption Retrieval Jianfeng Dong Xirong Li Cees G. M. Snoek 188 238 0 05 Sep 2017
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community John E. Ball Derek T. Anderson Chee Seng Chan 177 521 0 01 Sep 2017
Video Captioning with Guidance of Multimodal Latent Topics Shizhe Chen Jia Chen Qin Jin Alexander G. Hauptmann 187 71 0 31 Aug 2017
Generating Video Descriptions with Topic Guidance Shizhe Chen Jia Chen Qin Jin 135 21 0 31 Aug 2017
Going Deeper with Semantics: Video Activity Interpretation using Semantic Contextualization Sathyanarayanan N. Aakur F. Souza Sudeep Sarkar 91 10 0 11 Aug 2017
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video CaptioningIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2017 Jingkuan Song Yuyu Guo Lianli Gao Xuelong Li Alan Hanjalic Heng Tao Shen 170 228 0 08 Aug 2017
Reinforced Video Captioning with Entailment RewardsConference on Empirical Methods in Natural Language Processing (EMNLP), 2017 Ramakanth Pasunuru Joey Tianyi Zhou 145 118 0 07 Aug 2017
Localizing Moments in Video with Natural Language Lisa Anne Hendricks Oliver Wang Eli Shechtman Josef Sivic Trevor Darrell Bryan C. Russell 381 1,097 0 04 Aug 2017
Supervising Neural Attention Models for Video Captioning by Human Gaze Data Youngjae Yu Jongwook Choi Yeonhwa Kim Kyung Yoo Sang-Hun Lee Gunhee Kim 188 70 0 19 Jul 2017
Large-scale Video Classification guided by Batch Normalized LSTM Translator Jae Hyeon Yoo VLM 65 13 0 13 Jul 2017
Automated Audio Captioning with Recurrent Neural Networks Konstantinos Drossos Sharath Adavanne Maria Sandsten 166 139 0 30 Jun 2017
Hierarchical Label Inference for Video Classification Nelson Nauata Jonathan Smith Greg Mori BDL 183 2 0 15 Jun 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video CaptioningInternational Joint Conference on Artificial Intelligence (IJCAI), 2017 Jingkuan Song Zhao Guo Lianli Gao Wu Liu Dongxiang Zhang Heng Tao Shen 175 168 0 05 Jun 2017
Learning by Association - A versatile semi-supervised training method for neural networksComputer Vision and Pattern Recognition (CVPR), 2017 Philip Häusser A. Mordvintsev Zorah Lähner BDL 120 122 0 03 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy T. Baltrušaitis Chaitanya Ahuja Louis-Philippe Morency 468 3,535 0 26 May 2017
Learning a bidirectional mapping between human whole-body motion and natural language using deep recurrent neural networks Matthias Plappert Christian Mandery Tamim Asfour 3DH 307 138 0 18 May 2017
Dense-Captioning Events in Videos Ranjay Krishna Kenji Hata F. Ren Li Fei-Fei Juan Carlos Niebles 384 1,429 0 02 May 2017
Multi-Task Video Captioning with Video and Entailment Generation Ramakanth Pasunuru Joey Tianyi Zhou 182 120 0 24 Apr 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions Amir Mazaheri Dong Zhang M. Shah 91 12 0 15 Apr 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks Liwei Wang Yin Li Jing-ling Huang Svetlana Lazebnik VLM 205 530 0 11 Apr 2017
Weakly Supervised Dense Video Captioning Zhiqiang Shen Jianguo Li Zhou Su Minjun Li Yurong Chen Yu-Gang Jiang Xiangyang Xue 183 140 0 05 Apr 2017
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation Albert Gatt E. Krahmer LM&MA ELM 381 872 0 29 Mar 2017
Improving Classification by Improving Labelling: Introducing Probabilistic Multi-Label Object Interaction Recognition Michael Wray Davide Moltisanti W. Mayol-Cuevas Dima Damen 138 2 0 24 Mar 2017
Joint Intermodal and Intramodal Label Transfers for Extremely Rare or Unseen Classes Guo-Jun Qi Wen Liu Charu C. Aggarwal Thomas Huang VLM 162 54 0 22 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning Abhishek Das Satwik Kottur J. M. F. Moura Stefan Lee Dhruv Batra OffRL 298 429 0 20 Mar 2017