ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXivPDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 468 papers shown
Title
Going Deeper into Action Recognition: A Survey
Going Deeper into Action Recognition: A Survey
Samitha Herath
Mehrtash Harandi
Fatih Porikli
13
610
0
16 May 2016
Movie Description
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
27
353
0
12 May 2016
Convolutional Two-Stream Network Fusion for Video Action Recognition
Convolutional Two-Stream Network Fusion for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
31
2,603
0
22 Apr 2016
Online Action Detection
Online Action Detection
R. D. Geest
E. Gavves
Amir Ghodrati
Zhenyang Li
Cees G. M. Snoek
Tinne Tuytelaars
OffRL
13
151
0
21 Apr 2016
Online Human Action Detection using Joint Classification-Regression
  Recurrent Neural Networks
Online Human Action Detection using Joint Classification-Regression Recurrent Neural Networks
Yanghao Li
Cuiling Lan
Junliang Xing
Wenjun Zeng
Chunfen Yuan
Jiaying Liu
11
209
0
19 Apr 2016
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length
  Image Tagging
Annotation Order Matters: Recurrent Image Annotator for Arbitrary Length Image Tagging
Jiren Jin
Hideki Nakayama
3DV
VLM
14
69
0
18 Apr 2016
Learning Temporal Regularity in Video Sequences
Learning Temporal Regularity in Video Sequences
Mahmudul Hasan
Jonghyun Choi
J. Neumann
A. Roy-Chowdhury
L. Davis
27
1,087
0
15 Apr 2016
Learning Visual Storylines with Skipping Recurrent Neural Networks
Learning Visual Storylines with Skipping Recurrent Neural Networks
Gunnar A. Sigurdsson
Xinlei Chen
Abhinav Gupta
13
38
0
14 Apr 2016
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis
Amir Shahroudy
Jun Liu
T. Ng
G. Wang
61
2,457
0
11 Apr 2016
Learning to Track at 100 FPS with Deep Regression Networks
Learning to Track at 100 FPS with Deep Regression Networks
David Held
Sebastian Thrun
Silvio Savarese
OffRL
17
1,190
0
06 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined
  from Text
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
20
117
0
06 Apr 2016
Minimal Gated Unit for Recurrent Neural Networks
Minimal Gated Unit for Recurrent Neural Networks
Guoxiang Zhou
Jianxin Wu
Chen-Da Liu-Zhang
Zhi-Hua Zhou
20
325
0
31 Mar 2016
Rich Image Captioning in the Wild
Rich Image Captioning in the Wild
Kenneth Tran
Xiaodong He
Lei Zhang
Jian Sun
Cornelia Carapcea
Chris Thrasher
Chris Buehler
Chris Sienkiewicz
VLM
11
123
0
30 Mar 2016
BreakingNews: Article Annotation by Image and Text Processing
BreakingNews: Article Annotation by Image and Text Processing
Arnau Ramisa
F. Yan
Francesc Moreno-Noguer
K. Mikolajczyk
21
105
0
23 Mar 2016
Accelerating Deep Neural Network Training with Inconsistent Stochastic
  Gradient Descent
Accelerating Deep Neural Network Training with Inconsistent Stochastic Gradient Descent
Linnan Wang
Yi Yang
Martin Renqiang Min
S. Chakradhar
6
91
0
17 Mar 2016
From virtual demonstration to real-world manipulation using LSTM and MDN
From virtual demonstration to real-world manipulation using LSTM and MDN
Rouhollah Rahmatizadeh
P. Abolghasemi
Aman Behal
Ladislau Bölöni
10
14
0
12 Mar 2016
Learning a Deep Model for Human Action Recognition from Novel Viewpoints
Learning a Deep Model for Human Action Recognition from Novel Viewpoints
Hossein Rahmani
Ajmal Saeed Mian
M. Shah
19
209
0
02 Feb 2016
Order-aware Convolutional Pooling for Video Based Action Recognition
Order-aware Convolutional Pooling for Video Based Action Recognition
Peng Wang
Lingqiao Liu
Chunhua Shen
Heng Tao Shen
24
25
0
31 Jan 2016
Brain4Cars: Car That Knows Before You Do via Sensory-Fusion Deep
  Learning Architecture
Brain4Cars: Car That Knows Before You Do via Sensory-Fusion Deep Learning Architecture
Ashesh Jain
H. Koppula
Shane Soh
Bharad Raghavan
Avi Singh
Ashutosh Saxena
26
126
0
05 Jan 2016
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
19
1,159
0
24 Nov 2015
Delving Deeper into Convolutional Networks for Learning Video
  Representations
Delving Deeper into Convolutional Networks for Learning Video Representations
Nicolas Ballas
L. Yao
C. Pal
Aaron Courville
MDE
17
692
0
19 Nov 2015
Generating Sentences from a Continuous Space
Generating Sentences from a Continuous Space
Samuel R. Bowman
Luke Vilnis
Oriol Vinyals
Andrew M. Dai
Rafal Jozefowicz
Samy Bengio
DRL
15
2,340
0
19 Nov 2015
Structural-RNN: Deep Learning on Spatio-Temporal Graphs
Structural-RNN: Deep Learning on Spatio-Temporal Graphs
Ashesh Jain
Amir Zamir
Silvio Savarese
Ashutosh Saxena
GNN
46
1,080
0
17 Nov 2015
Deep Compositional Captioning: Describing Novel Object Categories
  without Paired Training Data
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data
Lisa Anne Hendricks
Subhashini Venugopalan
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
Trevor Darrell
CoGe
14
284
0
17 Nov 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for
  Visual Question Answering
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu
Kate Saenko
22
760
0
17 Nov 2015
Visual7W: Grounded Question Answering in Images
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
22
871
0
11 Nov 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu
Jiang Wang
Zhiheng Huang
Yi Yang
W. Xu
42
560
0
26 Oct 2015
Guiding Long-Short Term Memory for Image Caption Generation
Guiding Long-Short Term Memory for Image Caption Generation
Xu Jia
E. Gavves
Basura Fernando
Tinne Tuytelaars
VLM
14
101
0
16 Sep 2015
Learning Contextual Dependencies with Convolutional Hierarchical
  Recurrent Neural Networks
Learning Contextual Dependencies with Convolutional Hierarchical Recurrent Neural Networks
Zhen Zuo
Bing Shuai
G. Wang
Xiao Liu
Xingxing Wang
B. Wang
11
93
0
13 Sep 2015
Recurrent Network Models for Human Dynamics
Recurrent Network Models for Human Dynamics
Katerina Fragkiadaki
Sergey Levine
Panna Felsen
Jitendra Malik
23
30
0
02 Aug 2015
Describing Multimedia Content using Attention-based Encoder--Decoder
  Networks
Describing Multimedia Content using Attention-based Encoder--Decoder Networks
Kyunghyun Cho
Aaron Courville
Yoshua Bengio
32
410
0
04 Jul 2015
Aligning where to see and what to tell: image caption with region-based
  attention and scene factorization
Aligning where to see and what to tell: image caption with region-based attention and scene factorization
Junqi Jin
Kun Fu
Runpeng Cui
Fei Sha
Changshui Zhang
26
117
0
20 Jun 2015
Reading Scene Text in Deep Convolutional Sequences
Reading Scene Text in Deep Convolutional Sequences
Pan He
Weilin Huang
Yu Qiao
Chen Change Loy
Xiaoou Tang
16
307
0
14 Jun 2015
Convolutional LSTM Network: A Machine Learning Approach for
  Precipitation Nowcasting
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
215
7,902
0
13 Jun 2015
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to
  Action Sequences
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences
Hongyuan Mei
Mohit Bansal
Matthew R. Walter
LM&Ro
13
242
0
12 Jun 2015
Learning language through pictures
Learning language through pictures
Grzegorz Chrupała
Ákos Kádár
A. Alishahi
VLM
SSL
27
65
0
11 Jun 2015
P-CNN: Pose-based CNN Features for Action Recognition
P-CNN: Pose-based CNN Features for Action Recognition
Guilhem Chéron
Ivan Laptev
Cordelia Schmid
18
606
0
11 Jun 2015
Pointer Networks
Pointer Networks
Oriol Vinyals
Meire Fortunato
Navdeep Jaitly
22
3,012
0
09 Jun 2015
Scheduled Sampling for Sequence Prediction with Recurrent Neural
  Networks
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Samy Bengio
Oriol Vinyals
Navdeep Jaitly
Noam M. Shazeer
21
2,017
0
09 Jun 2015
Visualizing and Understanding Recurrent Networks
Visualizing and Understanding Recurrent Networks
A. Karpathy
Justin Johnson
Li Fei-Fei
HAI
14
1,096
0
05 Jun 2015
Learning to track for spatio-temporal action localization
Learning to track for spatio-temporal action localization
Philippe Weinzaepfel
Zaïd Harchaoui
Cordelia Schmid
21
338
0
05 Jun 2015
Beyond Temporal Pooling: Recurrence and Temporal Convolutions for
  Gesture Recognition in Video
Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video
Lionel Pigou
Aaron van den Oord
Sander Dieleman
Mieke Van Herreweghe
J. Dambre
25
254
0
05 Jun 2015
Learning to Answer Questions From Image Using Convolutional Neural
  Network
Learning to Answer Questions From Image Using Convolutional Neural Network
Lin Ma
Zhengdong Lu
Hang Li
13
262
0
01 Jun 2015
Visual Madlibs: Fill in the blank Image Generation and Question
  Answering
Visual Madlibs: Fill in the blank Image Generation and Question Answering
Licheng Yu
Eunbyung Park
Alexander C. Berg
Tamara L. Berg
VLM
MLLM
24
98
0
31 May 2015
Weakly-Supervised Alignment of Video With Text
Weakly-Supervised Alignment of Video With Text
Piotr Bojanowski
Rémi Lajugie
Edouard Grave
Francis R. Bach
Ivan Laptev
Jean Ponce
Cordelia Schmid
19
134
0
22 May 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image
  Question Answering
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
W. Xu
23
497
0
21 May 2015
Jointly Modeling Embedding and Translation to Bridge Video and Language
Jointly Modeling Embedding and Translation to Bridge Video and Language
Yingwei Pan
Tao Mei
Ting Yao
Houqiang Li
Y. Rui
27
534
0
07 May 2015
Language Models for Image Captioning: The Quirks and What Works
Language Models for Image Captioning: The Quirks and What Works
Jacob Devlin
Hao Cheng
Hao Fang
Saurabh Gupta
Li Deng
Xiaodong He
Geoffrey Zweig
Margaret Mitchell
22
281
0
07 May 2015
Ask Your Neurons: A Neural-based Approach to Answering Questions about
  Images
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
19
596
0
05 May 2015
VQA: Visual Question Answering
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
28
5,361
0
03 May 2015
Previous
123...1089
Next