Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4389
Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Long-term Recurrent Convolutional Networks for Visual Recognition and Description"
50 / 468 papers shown
Title
B-HAR: an open-source baseline framework for in depth study of human activity recognition datasets and workflows
Florenc Demrozi
Cristian Turetta
G. Pravadelli
15
8
0
23 Jan 2021
Human Interaction Recognition Framework based on Interacting Body Part Attention
Dong-Gyu Lee
Seong-Whan Lee
51
25
0
22 Jan 2021
Diagnostic Captioning: A Survey
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
89
26
0
18 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
Alina Roitberg
Monica Haurilet
Manuel Martínez
Rainer Stiefelhagen
UQCV
29
6
0
02 Jan 2021
Tensor Representations for Action Recognition
Piotr Koniusz
Lei Wang
A. Cherian
29
68
0
28 Dec 2020
Learning to predict synchronization of coupled oscillators on randomly generated graphs
Hardeep Bassi
Richard P. Yim
Rohith Koduluka
Joshua Vendrow
Cherlin Zhu
Hanbaek Lyu
11
5
0
28 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding
Bo He
Xitong Yang
Zuxuan Wu
Hao Chen
Ser-Nam Lim
Abhinav Shrivastava
ViT
19
27
0
15 Dec 2020
Convolutional LSTM Neural Networks for Modeling Wildland Fire Dynamics
J. Burge
M. Bonanni
M. Ihme
Lily Hu
11
19
0
11 Dec 2020
Robust Image Captioning
Daniel Yarnell
Xian Wang
13
0
0
06 Dec 2020
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
Jing Su
Qingyun Dai
Frank Guerin
Mian Zhou
19
24
0
03 Dec 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition
T. Ayral
M. Pedersoli
Simon L Bacon
Eric Granger
CVBM
3DH
13
11
0
10 Nov 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation
Zhengyuan Yang
Amanda Kay
Yuncheng Li
Wendi F. Cross
Jiebo Luo
25
17
0
30 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Rameswar Panda
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
21
95
0
22 Oct 2020
HAA500: Human-Centric Atomic Action Dataset with Curated Videos
Jihoon Chung
Cheng-hsin Wuu
Hsuan-ru Yang
Yu-Wing Tai
Chi-Keung Tang
13
43
0
11 Sep 2020
Temporal Context Aggregation for Video Retrieval with Contrastive Learning
Jie Shao
Xin Wen
Bingchen Zhao
Xiangyang Xue
AI4TS
30
4
0
04 Aug 2020
Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition
M. E. Kalfaoglu
Sinan Kalkan
Aydin Alatan
3DPC
17
140
0
03 Aug 2020
Adversarial Bipartite Graph Learning for Video Domain Adaptation
Yadan Luo
Zi Huang
Zijian Wang
Zheng-Wei Zhang
Mahsa Baktashmotlagh
8
51
0
31 Jul 2020
Enriching Video Captions With Contextual Text
Philipp Rimle
Pelin Dogan
Markus Gross
17
3
0
29 Jul 2020
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
9
23
0
25 Jul 2020
A Comprehensive Study on Deep Learning-based Methods for Sign Language Recognition
Nikolas Adaloglou
Theocharis Chatzis
Ilias Papastratis
Andreas Stergioulas
Georgios Th. Papadopoulos
Vassia Zacharopoulou
George J. Xydopoulos
Klimnis Atzakas
D. Papazachariou
P. Daras
24
37
0
24 Jul 2020
Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Ganchao Tan
Daqing Liu
Meng Wang
Zhengjun Zha
LRM
21
73
0
17 Jul 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning
Riccardo Del Chiaro
Bartlomiej Twardowski
Andrew D. Bagdanov
Joost van de Weijer
CLL
VLM
22
40
0
13 Jul 2020
Learning for Video Compression with Recurrent Auto-Encoder and Recurrent Probability Model
Ren Yang
Fabian Mentzer
Luc Van Gool
Radu Timofte
12
138
0
24 Jun 2020
Improving Image Captioning with Better Use of Captions
Zhan Shi
Xu Zhou
Xipeng Qiu
Xiao-Dan Zhu
22
121
0
21 Jun 2020
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
Junting Pan
Siyu Chen
Zheng Shou
Yu Liu
Jing Shao
Hongsheng Li
3DPC
15
150
0
14 Jun 2020
VirTex: Learning Visual Representations from Textual Annotations
Karan Desai
Justin Johnson
SSL
VLM
19
432
0
11 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Lingyu Zhu
Esa Rahtu
14
23
0
04 Jun 2020
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition
Miao Yin
Siyu Liao
Xiao-Yang Liu
Xiaodong Wang
Bo Yuan
26
24
0
09 May 2020
Low-latency hand gesture recognition with a low resolution thermal imager
Maarten Vandersteegen
Wouter Reusen
Kristof Van Beeck
14
15
0
24 Apr 2020
Recursive Social Behavior Graph for Trajectory Prediction
Jianhua Sun
Qinhong Jiang
Cewu Lu
GNN
16
158
0
22 Apr 2020
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Hirokatsu Kataoka
Tenga Wakamiya
Kensho Hara
Y. Satoh
3DPC
20
87
0
10 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
66
998
0
09 Apr 2020
Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models
Liam Hiley
Alun D. Preece
Y. Hicks
Supriyo Chakraborty
Prudhvi K. Gurram
Richard J. Tomsett
FAtt
9
14
0
31 Mar 2020
Fashion Meets Computer Vision: A Survey
Wen-Huang Cheng
Sijie Song
Chieh-Yun Chen
S. Hidayati
Jiaying Liu
AI4TS
28
96
0
31 Mar 2020
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
19
178
0
28 Mar 2020
A Driver Fatigue Recognition Algorithm Based on Spatio-Temporal Feature Sequence
Chen Zhang
Xiaobo Lu
Zhiliang Huang
3DH
CVBM
6
7
0
18 Mar 2020
Identity Recognition in Intelligent Cars with Behavioral Data and LSTM-ResNet Classifier
Michael Hammann
Maximilian Kraus
S. Shafaei
Alois C. Knoll
11
3
0
02 Mar 2020
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media Retrieval
Hadi Abdi Khojasteh
Ebrahim Ansari
Parvin Razzaghi
Akbar Karimi
VLM
6
4
0
23 Feb 2020
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Peng-Tao Xu
Kun Liu
Tao Xiang
Timothy M. Hospedales
Zhanyu Ma
Jun Guo
Yi-Zhe Song
16
32
0
21 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
23
16
0
15 Feb 2020
Temporal Interlacing Network
Hao Shao
Shengju Qian
Yu Liu
8
91
0
17 Jan 2020
Understanding and mitigating gradient pathologies in physics-informed neural networks
Sifan Wang
Yujun Teng
P. Perdikaris
AI4CE
PINN
16
289
0
13 Jan 2020
Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics
Ling-yu Duan
Jiaying Liu
Wenhan Yang
Tiejun Huang
Wen Gao
22
186
0
10 Jan 2020
Personalizing Fast-Forward Videos Based on Visual and Textual Features from Social Network
W. Ramos
M. Silva
Edson Roteia Araujo Junior
Alan C. Neves
Erickson R. Nascimento
14
6
0
29 Dec 2019
Emotion Recognition from Speech
Kannan Venkataramanan
H. Rajamohan
11
15
0
22 Dec 2019
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter Optimization
Jeroen Zegers
Hugo Van hamme
BDL
26
7
0
19 Dec 2019
Going Beneath the Surface: Evaluating Image Captioning for Grammaticality, Truthfulness and Diversity
Huiyuan Xie
Tom Sherborne
A. Kuhnle
Ann A. Copestake
DiffM
17
9
0
19 Dec 2019
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
14
868
0
17 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
27
251
0
10 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
16
126
0
02 Dec 2019
Previous
1
2
3
4
5
6
...
8
9
10
Next