Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1411.4389
Cited By
v1
v2
v3
v4 (latest)
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Computer Vision and Pattern Recognition (CVPR), 2014
17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Long-term Recurrent Convolutional Networks for Visual Recognition and Description"
50 / 1,728 papers shown
Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation
Sulabh Katiyar
S. Borgohain
VLM
132
15
0
22 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Computer Vision and Pattern Recognition (CVPR), 2021
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
428
273
0
20 Feb 2021
One-shot action recognition in challenging therapy scenarios
Alberto Sabater
Laura Santos
J. Santos-Victor
Alexandre Bernardino
Luis Montesano
Ana C. Murillo
396
40
0
17 Feb 2021
Classification of multivariate weakly-labelled time-series with attention
S. Rahman
Chang Wei Tan
138
0
0
16 Feb 2021
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
EgoV
206
22
0
16 Feb 2021
Win-Fail Action Recognition
Paritosh Parmar
B. Morris
155
6
0
15 Feb 2021
Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
VLM
122
23
0
14 Feb 2021
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
TTA
185
47
0
14 Feb 2021
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
International Conference on Learning Representations (ICLR), 2021
Yue Meng
Yikang Shen
Chung-Ching Lin
P. Sattigeri
Leonid Karlinsky
Kate Saenko
A. Oliva
Rogerio Feris
291
70
0
10 Feb 2021
In Defense of Scene Graphs for Image Captioning
IEEE International Conference on Computer Vision (ICCV), 2021
Kien Nguyen
Subarna Tripathi
Bang Du
T. Guha
Truong Thao Nguyen
203
52
0
09 Feb 2021
Face Recognition using 3D CNNs
Transactions on Computer Systems and Networks (TCSN), 2021
N. Mishra
S. Singh
3DH
CVBM
85
5
0
02 Feb 2021
Automatic Detection of B-lines in Lung Ultrasound Videos From Severe Dengue Patients
IEEE International Symposium on Biomedical Imaging (ISBI), 2021
H. Kerdegari
P. T. Nhat
A. McBride
Vital Consortium
Reza Razavi
N. V. Hao
L. Thwaites
S. Yacoub
Alberto Gómez
181
10
0
01 Feb 2021
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
774
474
0
01 Feb 2021
Open-domain Topic Identification of Out-of-domain Utterances using Wikipedia
A. Augustin
Alexandros Papangelis
M. Kotti
Pavlos Vougiouklis
James Z. Hare
N. Braunschweiler
100
0
0
26 Jan 2021
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers
Mahsa Shafaei
C. Smailis
I. Kakadiaris
Thamar Solorio
795
3
0
26 Jan 2021
Probability Trajectory: One New Movement Description for Trajectory Prediction
Pei Lv
Hui Wei
Tianxin Gu
Yuzhen Zhang
Xiaoheng Jiang
Bing Zhou
Mingliang Xu
138
0
0
26 Jan 2021
B-HAR: an open-source baseline framework for in depth study of human activity recognition datasets and workflows
IEEE Access (IEEE Access), 2021
Florenc Demrozi
Cristian Turetta
G. Pravadelli
141
10
0
23 Jan 2021
Human Interaction Recognition Framework based on Interacting Body Part Attention
Pattern Recognition (Pattern Recogn.), 2021
Dong-Gyu Lee
Seong-Whan Lee
167
27
0
22 Jan 2021
Bridging the gap between Human Action Recognition and Online Action Detection
Alban Main De Boissiere
R. Noumeir
181
0
0
21 Jan 2021
Learning rich touch representations through cross-modal self-supervision
Conference on Robot Learning (CoRL), 2021
Martina Zambelli
Y. Aytar
Francesco Visin
Yuxiang Zhou
R. Hadsell
SSL
195
17
0
21 Jan 2021
Diagnostic Captioning: A Survey
Knowledge and Information Systems (KAIS), 2021
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
231
31
0
18 Jan 2021
Neural networks behave as hash encoders: An empirical study
Fengxiang He
Shiye Lei
Jianmin Ji
Dacheng Tao
165
3
0
14 Jan 2021
Multimodal Engagement Analysis from Facial Videos in the Classroom
IEEE Transactions on Affective Computing (TAC), 2021
Ömer Sümer
Patricia Goldberg
S. D’Mello
Peter Gerjets
Ulrich Trautwein
Enkelejda Kasneci
337
112
0
11 Jan 2021
Unifying Relational Sentence Generation and Retrieval for Medical Image Report Composition
IEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2020
Fuyu Wang
Xiaodan Liang
Lin Xu
Liang Lin
MedIm
182
36
0
09 Jan 2021
A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules
Mehran Taghian
A. Asadi
Reza Safabakhsh
AI4TS
182
2
0
08 Jan 2021
Dairy Cow rumination detection: A deep learning approach
International Workshop on Distributed Computing for Emerging Smart Networks (DCESN), 2021
Safa Ayadi
Ahmed Ben Said
Rateb Jabbar
Chafik Aloulou
Achraf Chabbouh
Ahmed Ben Achballah
68
26
0
07 Jan 2021
Reinforcement Learning with Latent Flow
Neural Information Processing Systems (NeurIPS), 2021
Wenling Shang
Xiaofei Wang
A. Srinivas
Aravind Rajeswaran
Yang Gao
Pieter Abbeel
Michael Laskin
OffRL
158
26
0
06 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
International Conference on Pattern Recognition (ICPR), 2021
Alina Roitberg
Monica Haurilet
Manuel Martínez
Rainer Stiefelhagen
UQCV
105
7
0
02 Jan 2021
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
167
74
0
31 Dec 2020
3D Human motion anticipation and classification
Emad Barsoum
J. Kender
Zicheng Liu
3DH
124
2
0
31 Dec 2020
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2020
Hengduo Li
Zuxuan Wu
Abhinav Shrivastava
L. Davis
275
34
0
29 Dec 2020
Tensor Representations for Action Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Piotr Koniusz
Lei Wang
A. Cherian
388
80
0
28 Dec 2020
Learning to predict synchronization of coupled oscillators on randomly generated graphs
Scientific Reports (Sci Rep), 2020
Hardeep Bassi
Richard P. Yim
Rohith Koduluka
Joshua Vendrow
Cherlin Zhu
Hanbaek Lyu
230
5
0
28 Dec 2020
Human Action Recognition from Various Data Modalities: A Review
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
580
691
0
22 Dec 2020
Anchor-Based Spatio-Temporal Attention 3D Convolutional Networks for Dynamic 3D Point Cloud Sequences
IEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2020
Guangming Wang
Muyao Chen
Hanwen Liu
Yehui Yang
Yanfeng Guo
Hesheng Wang
3DPC
138
29
0
20 Dec 2020
SMART Frame Selection for Action Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2020
Shreyank N. Gowda
Marcus Rohrbach
Laura Sevilla-Lara
234
163
0
19 Dec 2020
TDN: Temporal Difference Networks for Efficient Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2020
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
430
461
0
18 Dec 2020
Smoothed Gaussian Mixture Models for Video Classification and Recommendation
Sirjan Kafle
Aman Gupta
Xue Xia
A. Sankar
Xi Chen
Di Wen
Liang Zhang
110
0
0
17 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding
British Machine Vision Conference (BMVC), 2020
Bo He
Xitong Yang
Zuxuan Wu
Hao Chen
Ser-Nam Lim
Abhinav Shrivastava
ViT
176
27
0
15 Dec 2020
NUTA: Non-uniform Temporal Aggregation for Action Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Hao Chen
Joseph Tighe
ViT
119
17
0
15 Dec 2020
Intrinsic Image Captioning Evaluation
Chao Zeng
Sam Kwong
99
1
0
14 Dec 2020
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish
Machine Translation (MT), 2020
Begum Citamak
Ozan Caglayan
Menekse Kuyu
Erkut Erdem
Aykut Erdem
Pranava Madhyastha
Lucia Specia
205
9
0
13 Dec 2020
Convolutional LSTM Neural Networks for Modeling Wildland Fire Dynamics
J. Burge
M. Bonanni
M. Ihme
Lily Hu
164
23
0
11 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
280
209
0
11 Dec 2020
A Log-likelihood Regularized KL Divergence for Video Prediction with A 3D Convolutional Variational Recurrent Network
Haziq Razali
Basura Fernando
DRL
159
6
0
11 Dec 2020
Developing Motion Code Embedding for Action Recognition in Videos
International Conference on Pattern Recognition (ICPR), 2020
Maxat Alibayev
D. Paulius
Yu Sun
159
1
0
10 Dec 2020
Driving Behavior Explanation with Multi-level Fusion
Pattern Recognition (Pattern Recognit.), 2020
H. Ben-younes
Éloi Zablocki
Patrick Pérez
Matthieu Cord
141
37
0
09 Dec 2020
Understanding Action Sequences based on Video Captioning for Learning-from-Observation
Iori Yanokura
Naoki Wake
Kazuhiro Sasabuchi
Katsushi Ikeuchi
Masayuki Inaba
224
6
0
09 Dec 2020
Robust Image Captioning
Daniel Yarnell
Xian Wang
101
0
0
06 Dec 2020
Understanding Guided Image Captioning Performance across Domains
Conference on Computational Natural Language Learning (CoNLL), 2020
Edwin G. Ng
Bo Pang
P. Sharma
Radu Soricut
369
28
0
04 Dec 2020
Previous
1
2
3
...
8
9
10
...
33
34
35
Next