Title
Improving Interpretability of Deep Neural Networks with Semantic Information Yinpeng Dong Hang Su Jun Zhu Bo Zhang 178 130 0 12 Mar 2017
A Survey on Content-Aware Video Analysis for Sports H. Shih 142 210 0 03 Mar 2017
Image-Grounded Conversations: Multimodal Context for Natural Question and Response GenerationInternational Joint Conference on Natural Language Processing (IJCNLP), 2017 N. Mostafazadeh Chris Brockett W. Dolan Michel Galley Jianfeng Gao Georgios P. Spithourakis Lucy Vanderwende 295 190 0 28 Jan 2017
Attention-Based Multimodal Fusion for Video DescriptionIEEE International Conference on Computer Vision (ICCV), 2017 Chiori Hori Takaaki Hori Teng-Yok Lee Kazuhiro Sumi J. Hershey Tim K. Marks 308 376 0 11 Jan 2017
Top-down Visual Saliency Guided by CaptionsComputer Vision and Pattern Recognition (CVPR), 2016 Vasili Ramanishka Abir Das Jianming Zhang Kate Saenko 163 147 0 21 Dec 2016
Video Captioning with Multi-Faceted Attention Xiang Long Chuang Gan Gerard de Melo 161 88 0 01 Dec 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning Lorenzo Baraldi C. Grana Rita Cucchiara 263 196 0 28 Nov 2016
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Linchao Zhu Zhongwen Xu Yi Yang 150 78 0 28 Nov 2016
Visual Dialog Abhishek Das Satwik Kottur Khushi Gupta Avi Singh Deshraj Yadav José M. F. Moura Devi Parikh Dhruv Batra 318 1,056 0 26 Nov 2016
Semantic Compositional Networks for Visual Captioning Zhe Gan Chuang Gan Xiaodong He Yunchen Pu Kenneth Tran Jianfeng Gao Lawrence Carin Li Deng CoGe 261 443 0 23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text Yunchen Pu Martin Renqiang Min Zhe Gan Lawrence Carin 186 14 0 23 Nov 2016
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering Tegan Maharaj Nicolas Ballas Anna Rohrbach Aaron Courville C. Pal VGen 157 113 0 23 Nov 2016
Video Captioning with Transferred Semantic Attributes Yingwei Pan Ting Yao Houqiang Li Tao Mei 138 336 0 23 Nov 2016
Recurrent Memory Addressing for describing videos A. Jain Abhinav Agarwalla Kumar Krishna Agrawal Pabitra Mitra 128 10 0 20 Nov 2016
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning Long Chen Hanwang Zhang Jun Xiao Liqiang Nie Jian Shao Wei Liu Tat-Seng Chua 432 1,776 0 17 Nov 2016
Multimodal Memory Modelling for Video Captioning Junbo Wang Wei Wang Yan Huang Liang Wang Tieniu Tan 191 146 0 17 Nov 2016
Learning long-term dependencies for action recognition with a biologically-inspired deep network Yemin Shi Yonghong Tian Yaowei Wang Tiejun Huang 159 65 0 16 Nov 2016
Memory-augmented Attention Modelling for Videos Rasool Fakoor Abdel-rahman Mohamed Margaret Mitchell S. B. Kang Pushmeet Kohli 260 20 0 07 Nov 2016
Inference Compilation and Universal Probabilistic Programming T. Le A. G. Baydin Frank Wood UQCV 442 142 0 31 Oct 2016
Spatio-Temporal Attention Models for Grounded Video Captioning M. Zanfir Elisabeta Marinoiu C. Sminchisescu 204 51 0 17 Oct 2016
Video Fill in the Blank with Merging LSTMs Amir Mazaheri Dong Zhang M. Shah 118 18 0 13 Oct 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2016 Youngjae Yu Hyungjin Ko Jongwook Choi Gunhee Kim 376 239 0 10 Oct 2016
Prediction of Manipulation Actions Cornelia Fermuller Fang Wang Yezhou Yang Konstantinos Zampogiannis Yi Zhang Francisco Barranco Michael Pfeiffer 172 51 0 03 Oct 2016
A Survey of Multi-View Representation Learning Yingming Li Ming Yang Zhongfei Zhang AI4TS 3DV 558 580 0 03 Oct 2016
Recurrent Convolutional Networks for Pulmonary Nodule Detection in CT Imaging P. Ypsilantis Giovanni Montana MedIm 123 32 0 28 Sep 2016
Pose-Selective Max Pooling for Measuring Similarity Xiang Xiang T. Tran CVBM 196 5 0 22 Sep 2016
Deep Learning for Video Classification and Captioning Zuxuan Wu Ting Yao Yanwei Fu Yu-Gang Jiang 3DV VLM 131 139 0 22 Sep 2016
Learning to generalize to new compositions in image understanding Yuval Atzmon Jonathan Berant Vahid Kezami Amir Globerson Gal Chechik 151 70 0 27 Aug 2016
Title Generation for User Generated VideosEuropean Conference on Computer Vision (ECCV), 2016 Kuo-Hao Zeng Tseng-Hung Chen Juan Carlos Niebles Min Sun 160 71 0 25 Aug 2016
Learning Joint Representations of Videos and Sentences with Web Image Search Mayu Otani Yuta Nakashima Esa Rahtu J. Heikkilä N. Yokoya 150 95 0 08 Aug 2016
A Comprehensive Survey on Cross-modal Retrieval Jen-tse Huang Qiyue Yin Wei Wang Shu Wu Liang Wang 170 318 0 21 Jul 2016
Hierarchical Deep Temporal Models for Group Activity Recognition Mostafa S. Ibrahim S. Muralidharan Zhiwei Deng Arash Vahdat Greg Mori 296 484 0 09 Jul 2016
Domain Adaptation for Neural Networks by Parameter Augmentation Yusuke Watanabe Kazuma Hashimoto Yoshimasa Tsuruoka OOD 137 6 0 01 Jul 2016
Bidirectional Long-Short Term Memory for Video Description Yi Bin Yang Yang Zi Huang Fumin Shen Xing Xu Heng Tao Shen 143 66 0 15 Jun 2016
Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network Yu Liu Jianlong Fu Tao Mei C. Chen 126 4 0 02 Jun 2016
Video Summarization with Long Short-term Memory Ke Zhang Wei-Lun Chao Fei Sha Kristen Grauman 252 743 0 26 May 2016
Movie Description Anna Rohrbach Atousa Torabi Marcus Rohrbach Niket Tandon C. Pal Hugo Larochelle Aaron Courville Bernt Schiele 3DV VGen 230 386 0 12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering Mateusz Malinowski Marcus Rohrbach Mario Fritz 216 104 0 09 May 2016
Dependency Parsing with LSTMs: An Empirical Evaluation A. Kuncoro Yu Sawai Kevin Duh Yuji Matsumoto 126 3 0 22 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition Marcus Rohrbach VLM 106 4 0 12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description Yuncheng Li Yale Song Liangliang Cao Joel R. Tetreault Larry Goldberg A. Jaimes Jiebo Luo 199 295 0 10 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text Subhashini Venugopalan Lisa Anne Hendricks Raymond J. Mooney Kate Saenko VLM 164 121 0 06 Apr 2016
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project Guntis Barzdins Steve Renals D. Gosko 63 6 0 05 Apr 2016
Character-Level Question Answering with Attention David Golub Xiaodong He 275 189 0 04 Apr 2016
Do You See What I Mean? Visual Resolution of Linguistic Ambiguities Yevgeni Berzak Andrei Barbu Daniel Harari Boris Katz S. Ullman 100 35 0 26 Mar 2016
Attentive Contexts for Object Detection Jianan Li Yunchao Wei Xiaodan Liang Jian Dong Tingfa Xu Jiashi Feng Shuicheng Yan ObjD 124 230 0 24 Mar 2016
Super Mario as a String: Platformer Level Generation Via LSTMs A. Summerville Michael Mateas 179 153 0 02 Mar 2016
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision Suraj Srinivas Ravi Kiran Sarvadevabhatla Konda Reddy Mopuri N. Prabhu S. Kruthiventi R. Venkatesh Babu OOD 157 219 0 25 Jan 2016
MovieQA: Understanding Stories in Movies through Question-Answering Makarand Tapaswi Yukun Zhu Rainer Stiefelhagen Antonio Torralba R. Urtasun Sanja Fidler 240 784 0 09 Dec 2015
A Deep Structured Model with Radius-Margin Bound for 3D Human Activity Recognition Liang Lin Keze Wang W. Zuo Ming Wang Jiebo Luo Lei Zhang HAI BDL 154 104 0 05 Dec 2015

1 2 3 4 5 6 7

Home
Papers
1412.4729
Cited By

v1v2v3 (latest)

Translating Videos to Natural Language Using Deep Recurrent Neural Networks

North American Chapter of the Association for Computational Linguistics (NAACL), 2014

15 December 2014

Subhashini Venugopalan

Papers citing "Translating Videos to Natural Language Using Deep Recurrent Neural Networks"

50 / 334 papers shown

Title
Improving Interpretability of Deep Neural Networks with Semantic Information Yinpeng Dong Hang Su Jun Zhu Bo Zhang 178 130 0 12 Mar 2017
A Survey on Content-Aware Video Analysis for Sports H. Shih 142 210 0 03 Mar 2017
Image-Grounded Conversations: Multimodal Context for Natural Question and Response GenerationInternational Joint Conference on Natural Language Processing (IJCNLP), 2017 N. Mostafazadeh Chris Brockett W. Dolan Michel Galley Jianfeng Gao Georgios P. Spithourakis Lucy Vanderwende 295 190 0 28 Jan 2017
Attention-Based Multimodal Fusion for Video DescriptionIEEE International Conference on Computer Vision (ICCV), 2017 Chiori Hori Takaaki Hori Teng-Yok Lee Kazuhiro Sumi J. Hershey Tim K. Marks 308 376 0 11 Jan 2017
Top-down Visual Saliency Guided by CaptionsComputer Vision and Pattern Recognition (CVPR), 2016 Vasili Ramanishka Abir Das Jianming Zhang Kate Saenko 163 147 0 21 Dec 2016
Video Captioning with Multi-Faceted Attention Xiang Long Chuang Gan Gerard de Melo 161 88 0 01 Dec 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning Lorenzo Baraldi C. Grana Rita Cucchiara 263 196 0 28 Nov 2016
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Linchao Zhu Zhongwen Xu Yi Yang 150 78 0 28 Nov 2016
Visual Dialog Abhishek Das Satwik Kottur Khushi Gupta Avi Singh Deshraj Yadav José M. F. Moura Devi Parikh Dhruv Batra 318 1,056 0 26 Nov 2016
Semantic Compositional Networks for Visual Captioning Zhe Gan Chuang Gan Xiaodong He Yunchen Pu Kenneth Tran Jianfeng Gao Lawrence Carin Li Deng CoGe 261 443 0 23 Nov 2016
Adaptive Feature Abstraction for Translating Video to Text Yunchen Pu Martin Renqiang Min Zhe Gan Lawrence Carin 186 14 0 23 Nov 2016
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering Tegan Maharaj Nicolas Ballas Anna Rohrbach Aaron Courville C. Pal VGen 157 113 0 23 Nov 2016
Video Captioning with Transferred Semantic Attributes Yingwei Pan Ting Yao Houqiang Li Tao Mei 138 336 0 23 Nov 2016
Recurrent Memory Addressing for describing videos A. Jain Abhinav Agarwalla Kumar Krishna Agrawal Pabitra Mitra 128 10 0 20 Nov 2016
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning Long Chen Hanwang Zhang Jun Xiao Liqiang Nie Jian Shao Wei Liu Tat-Seng Chua 432 1,776 0 17 Nov 2016
Multimodal Memory Modelling for Video Captioning Junbo Wang Wei Wang Yan Huang Liang Wang Tieniu Tan 191 146 0 17 Nov 2016
Learning long-term dependencies for action recognition with a biologically-inspired deep network Yemin Shi Yonghong Tian Yaowei Wang Tiejun Huang 159 65 0 16 Nov 2016
Memory-augmented Attention Modelling for Videos Rasool Fakoor Abdel-rahman Mohamed Margaret Mitchell S. B. Kang Pushmeet Kohli 260 20 0 07 Nov 2016
Inference Compilation and Universal Probabilistic Programming T. Le A. G. Baydin Frank Wood UQCV 442 142 0 31 Oct 2016
Spatio-Temporal Attention Models for Grounded Video Captioning M. Zanfir Elisabeta Marinoiu C. Sminchisescu 204 51 0 17 Oct 2016
Video Fill in the Blank with Merging LSTMs Amir Mazaheri Dong Zhang M. Shah 118 18 0 13 Oct 2016
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2016 Youngjae Yu Hyungjin Ko Jongwook Choi Gunhee Kim 376 239 0 10 Oct 2016
Prediction of Manipulation Actions Cornelia Fermuller Fang Wang Yezhou Yang Konstantinos Zampogiannis Yi Zhang Francisco Barranco Michael Pfeiffer 172 51 0 03 Oct 2016
A Survey of Multi-View Representation Learning Yingming Li Ming Yang Zhongfei Zhang AI4TS 3DV 558 580 0 03 Oct 2016
Recurrent Convolutional Networks for Pulmonary Nodule Detection in CT Imaging P. Ypsilantis Giovanni Montana MedIm 123 32 0 28 Sep 2016
Pose-Selective Max Pooling for Measuring Similarity Xiang Xiang T. Tran CVBM 196 5 0 22 Sep 2016
Deep Learning for Video Classification and Captioning Zuxuan Wu Ting Yao Yanwei Fu Yu-Gang Jiang 3DV VLM 131 139 0 22 Sep 2016
Learning to generalize to new compositions in image understanding Yuval Atzmon Jonathan Berant Vahid Kezami Amir Globerson Gal Chechik 151 70 0 27 Aug 2016
Title Generation for User Generated VideosEuropean Conference on Computer Vision (ECCV), 2016 Kuo-Hao Zeng Tseng-Hung Chen Juan Carlos Niebles Min Sun 160 71 0 25 Aug 2016
Learning Joint Representations of Videos and Sentences with Web Image Search Mayu Otani Yuta Nakashima Esa Rahtu J. Heikkilä N. Yokoya 150 95 0 08 Aug 2016
A Comprehensive Survey on Cross-modal Retrieval Jen-tse Huang Qiyue Yin Wei Wang Shu Wu Liang Wang 170 318 0 21 Jul 2016
Hierarchical Deep Temporal Models for Group Activity Recognition Mostafa S. Ibrahim S. Muralidharan Zhiwei Deng Arash Vahdat Greg Mori 296 484 0 09 Jul 2016
Domain Adaptation for Neural Networks by Parameter Augmentation Yusuke Watanabe Kazuma Hashimoto Yoshimasa Tsuruoka OOD 137 6 0 01 Jul 2016
Bidirectional Long-Short Term Memory for Video Description Yi Bin Yang Yang Zi Huang Fumin Shen Xing Xu Heng Tao Shen 143 66 0 15 Jun 2016
Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network Yu Liu Jianlong Fu Tao Mei C. Chen 126 4 0 02 Jun 2016
Video Summarization with Long Short-term Memory Ke Zhang Wei-Lun Chao Fei Sha Kristen Grauman 252 743 0 26 May 2016
Movie Description Anna Rohrbach Atousa Torabi Marcus Rohrbach Niket Tandon C. Pal Hugo Larochelle Aaron Courville Bernt Schiele 3DV VGen 230 386 0 12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering Mateusz Malinowski Marcus Rohrbach Mario Fritz 216 104 0 09 May 2016
Dependency Parsing with LSTMs: An Empirical Evaluation A. Kuncoro Yu Sawai Kevin Duh Yuji Matsumoto 126 3 0 22 Apr 2016
Attributes as Semantic Units between Natural Language and Visual Recognition Marcus Rohrbach VLM 106 4 0 12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description Yuncheng Li Yale Song Liangliang Cao Joel R. Tetreault Larry Goldberg A. Jaimes Jiebo Luo 199 295 0 10 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text Subhashini Venugopalan Lisa Anne Hendricks Raymond J. Mooney Kate Saenko VLM 164 121 0 06 Apr 2016
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project Guntis Barzdins Steve Renals D. Gosko 63 6 0 05 Apr 2016
Character-Level Question Answering with Attention David Golub Xiaodong He 275 189 0 04 Apr 2016
Do You See What I Mean? Visual Resolution of Linguistic Ambiguities Yevgeni Berzak Andrei Barbu Daniel Harari Boris Katz S. Ullman 100 35 0 26 Mar 2016
Attentive Contexts for Object Detection Jianan Li Yunchao Wei Xiaodan Liang Jian Dong Tingfa Xu Jiashi Feng Shuicheng Yan ObjD 124 230 0 24 Mar 2016
Super Mario as a String: Platformer Level Generation Via LSTMs A. Summerville Michael Mateas 179 153 0 02 Mar 2016
A Taxonomy of Deep Convolutional Neural Nets for Computer Vision Suraj Srinivas Ravi Kiran Sarvadevabhatla Konda Reddy Mopuri N. Prabhu S. Kruthiventi R. Venkatesh Babu OOD 157 219 0 25 Jan 2016
MovieQA: Understanding Stories in Movies through Question-Answering Makarand Tapaswi Yukun Zhu Rainer Stiefelhagen Antonio Torralba R. Urtasun Sanja Fidler 240 784 0 09 Dec 2015
A Deep Structured Model with Radius-Margin Bound for 3D Human Activity Recognition Liang Lin Keze Wang W. Zuo Ming Wang Jiebo Luo Lei Zhang HAI BDL 154 104 0 05 Dec 2015

1 2 3 4 5 6 7