ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.4729
  4. Cited By
Translating Videos to Natural Language Using Deep Recurrent Neural
  Networks
v1v2v3 (latest)

Translating Videos to Natural Language Using Deep Recurrent Neural Networks

North American Chapter of the Association for Computational Linguistics (NAACL), 2014
15 December 2014
Subhashini Venugopalan
Huijuan Xu
Jeff Donahue
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
ArXiv (abs)PDFHTML

Papers citing "Translating Videos to Natural Language Using Deep Recurrent Neural Networks"

50 / 334 papers shown
Title
Generating Descriptions from Structured Data Using a Bifocal Attention
  Mechanism and Gated Orthogonalization
Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated OrthogonalizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2018
Preksha Nema
Shreyas Shetty
Parag Jain
Anirban Laha
Karthik Sankaranarayanan
Mitesh M. Khapra
145
40
0
20 Apr 2018
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal
  Attentions for Video Captioning
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning
Xinze Wang
Yuan-fang Wang
William Yang Wang
118
79
0
15 Apr 2018
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Huijuan Xu
Kun He
Bryan A. Plummer
Leonid Sigal
Stan Sclaroff
Kate Saenko
CLIP
198
345
0
13 Apr 2018
Deep Differential Recurrent Neural Networks
Deep Differential Recurrent Neural Networks
Naifan Zhuang
T. Kieu
Guo-Jun Qi
K. Hua
80
3
0
11 Apr 2018
End-to-End Dense Video Captioning with Masked Transformer
End-to-End Dense Video Captioning with Masked Transformer
Luowei Zhou
Yingbo Zhou
Jason J. Corso
R. Socher
Caiming Xiong
221
574
0
03 Apr 2018
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Seeing Voices and Hearing Faces: Cross-modal biometric matching
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
CVBM
357
241
0
01 Apr 2018
Reconstruction Network for Video Captioning
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
192
338
0
30 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
126
59
0
21 Mar 2018
Joint Event Detection and Description in Continuous Video Streams
Joint Event Detection and Description in Continuous Video Streams
Huijuan Xu
Boyang Albert Li
Vasili Ramanishka
Leonid Sigal
Kate Saenko
128
58
0
28 Feb 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
206
24
0
19 Feb 2018
Probabilistic Recurrent State-Space Models
Probabilistic Recurrent State-Space Models
Andreas Doerr
Christian Daniel
Martin Schiegg
D. Nguyen-Tuong
S. Schaal
Marc Toussaint
Sebastian Trimpe
256
134
0
31 Jan 2018
Video-based Sign Language Recognition without Temporal Segmentation
Video-based Sign Language Recognition without Temporal Segmentation
Jie Huang
Wen-gang Zhou
Qilin Zhang
Houqiang Li
Weiping Li
SLR
248
446
0
30 Jan 2018
Deep Neural Networks In Fully Connected CRF For Image Labeling With
  Social Network Metadata
Deep Neural Networks In Fully Connected CRF For Image Labeling With Social Network Metadata
Chengjiang Long
Roddy Collins
E. Swears
A. Hoogs
149
24
0
27 Jan 2018
Foreground Segmentation Using a Triplet Convolutional Neural Network for
  Multiscale Feature Encoding
Foreground Segmentation Using a Triplet Convolutional Neural Network for Multiscale Feature Encoding
Long Ang Lim
H. Keles
141
206
0
07 Jan 2018
Recent Advances in Recurrent Neural Networks
Recent Advances in Recurrent Neural Networks
Hojjat Salehinejad
Sharan Sankar
Joseph Barfett
E. Colak
S. Valaee
AI4TS
448
678
0
29 Dec 2017
Visual Features for Context-Aware Speech Recognition
Visual Features for Context-Aware Speech Recognition
Abhinav Gupta
Yajie Miao
Leonardo Neves
Florian Metze
137
43
0
01 Dec 2017
Video Captioning via Hierarchical Reinforcement Learning
Video Captioning via Hierarchical Reinforcement Learning
Xin Eric Wang
Wenhu Chen
Jiawei Wu
Yuan-fang Wang
William Yang Wang
194
248
0
29 Nov 2017
Convolutional Image Captioning
Convolutional Image Captioning
J. Aneja
Aditya Deshpande
Alex Schwing
VLM
218
388
0
24 Nov 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption
Wangli Hao
Zhaoxiang Zhang
He Guan
Guibo Zhu
164
37
0
22 Nov 2017
Excitation Backprop for RNNs
Excitation Backprop for RNNs
Sarah Adel Bargal
Andrea Zunino
Donghyun Kim
Jianming Zhang
Vittorio Murino
Stan Sclaroff
270
48
0
18 Nov 2017
Grounded Objects and Interactions for Video Captioning
Grounded Objects and Interactions for Video Captioning
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
109
6
0
16 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video
  Understanding
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
172
149
0
16 Nov 2017
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Lea Frermann
Shay B. Cohen
Mirella Lapata
100
27
0
31 Oct 2017
Describing Natural Images Containing Novel Objects with Knowledge Guided
  Assitance
Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance
Aditya Mogadala
Umanga Bista
Lexing Xie
Achim Rettinger
152
7
0
17 Oct 2017
Translating Videos to Commands for Robotic Manipulation with Deep
  Recurrent Neural Networks
Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks
Anh Nguyen
Dimitrios Kanoulas
L. Muratore
D. Caldwell
Nikos G. Tsagarakis
145
73
0
01 Oct 2017
Predicting Visual Features from Text for Image and Video Caption
  Retrieval
Predicting Visual Features from Text for Image and Video Caption Retrieval
Jianfeng Dong
Xirong Li
Cees G. M. Snoek
188
238
0
05 Sep 2017
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories,
  Tools and Challenges for the Community
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community
John E. Ball
Derek T. Anderson
Chee Seng Chan
177
521
0
01 Sep 2017
Video Captioning with Guidance of Multimodal Latent Topics
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
187
71
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
135
21
0
31 Aug 2017
Going Deeper with Semantics: Video Activity Interpretation using
  Semantic Contextualization
Going Deeper with Semantics: Video Activity Interpretation using Semantic Contextualization
Sathyanarayanan N. Aakur
F. Souza
Sudeep Sarkar
91
10
0
11 Aug 2017
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video
  Captioning
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video CaptioningIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2017
Jingkuan Song
Yuyu Guo
Lianli Gao
Xuelong Li
Alan Hanjalic
Heng Tao Shen
170
228
0
08 Aug 2017
Reinforced Video Captioning with Entailment Rewards
Reinforced Video Captioning with Entailment RewardsConference on Empirical Methods in Natural Language Processing (EMNLP), 2017
Ramakanth Pasunuru
Joey Tianyi Zhou
145
118
0
07 Aug 2017
Localizing Moments in Video with Natural Language
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
381
1,097
0
04 Aug 2017
Supervising Neural Attention Models for Video Captioning by Human Gaze
  Data
Supervising Neural Attention Models for Video Captioning by Human Gaze Data
Youngjae Yu
Jongwook Choi
Yeonhwa Kim
Kyung Yoo
Sang-Hun Lee
Gunhee Kim
188
70
0
19 Jul 2017
Large-scale Video Classification guided by Batch Normalized LSTM
  Translator
Large-scale Video Classification guided by Batch Normalized LSTM Translator
Jae Hyeon Yoo
VLM
65
13
0
13 Jul 2017
Automated Audio Captioning with Recurrent Neural Networks
Automated Audio Captioning with Recurrent Neural Networks
Konstantinos Drossos
Sharath Adavanne
Maria Sandsten
166
139
0
30 Jun 2017
Hierarchical Label Inference for Video Classification
Hierarchical Label Inference for Video Classification
Nelson Nauata
Jonathan Smith
Greg Mori
BDL
183
2
0
15 Jun 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Hierarchical LSTM with Adjusted Temporal Attention for Video CaptioningInternational Joint Conference on Artificial Intelligence (IJCAI), 2017
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
175
168
0
05 Jun 2017
Learning by Association - A versatile semi-supervised training method
  for neural networks
Learning by Association - A versatile semi-supervised training method for neural networksComputer Vision and Pattern Recognition (CVPR), 2017
Philip Häusser
A. Mordvintsev
Zorah Lähner
BDL
120
122
0
03 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
468
3,535
0
26 May 2017
Learning a bidirectional mapping between human whole-body motion and
  natural language using deep recurrent neural networks
Learning a bidirectional mapping between human whole-body motion and natural language using deep recurrent neural networks
Matthias Plappert
Christian Mandery
Tamim Asfour
3DH
307
138
0
18 May 2017
Dense-Captioning Events in Videos
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
384
1,429
0
02 May 2017
Multi-Task Video Captioning with Video and Entailment Generation
Multi-Task Video Captioning with Video and Entailment Generation
Ramakanth Pasunuru
Joey Tianyi Zhou
182
120
0
24 Apr 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal
  Attentions
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions
Amir Mazaheri
Dong Zhang
M. Shah
91
12
0
15 Apr 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Learning Two-Branch Neural Networks for Image-Text Matching Tasks
Liwei Wang
Yin Li
Jing-ling Huang
Svetlana Lazebnik
VLM
205
530
0
11 Apr 2017
Weakly Supervised Dense Video Captioning
Weakly Supervised Dense Video Captioning
Zhiqiang Shen
Jianguo Li
Zhou Su
Minjun Li
Yurong Chen
Yu-Gang Jiang
Xiangyang Xue
183
140
0
05 Apr 2017
Survey of the State of the Art in Natural Language Generation: Core
  tasks, applications and evaluation
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation
Albert Gatt
E. Krahmer
LM&MAELM
381
872
0
29 Mar 2017
Improving Classification by Improving Labelling: Introducing
  Probabilistic Multi-Label Object Interaction Recognition
Improving Classification by Improving Labelling: Introducing Probabilistic Multi-Label Object Interaction Recognition
Michael Wray
Davide Moltisanti
W. Mayol-Cuevas
Dima Damen
138
2
0
24 Mar 2017
Joint Intermodal and Intramodal Label Transfers for Extremely Rare or
  Unseen Classes
Joint Intermodal and Intramodal Label Transfers for Extremely Rare or Unseen Classes
Guo-Jun Qi
Wen Liu
Charu C. Aggarwal
Thomas Huang
VLM
162
54
0
22 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement
  Learning
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
298
429
0
20 Mar 2017
Previous
1234567
Next