ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXivPDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 468 papers shown
Title
B-HAR: an open-source baseline framework for in depth study of human
  activity recognition datasets and workflows
B-HAR: an open-source baseline framework for in depth study of human activity recognition datasets and workflows
Florenc Demrozi
Cristian Turetta
G. Pravadelli
15
8
0
23 Jan 2021
Human Interaction Recognition Framework based on Interacting Body Part
  Attention
Human Interaction Recognition Framework based on Interacting Body Part Attention
Dong-Gyu Lee
Seong-Whan Lee
51
25
0
22 Jan 2021
Diagnostic Captioning: A Survey
Diagnostic Captioning: A Survey
John Pavlopoulos
Vasiliki Kougia
Ion Androutsopoulos
D. Papamichail
3DV
MedIm
89
26
0
18 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and
  the CARING Models
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
Alina Roitberg
Monica Haurilet
Manuel Martínez
Rainer Stiefelhagen
UQCV
29
6
0
02 Jan 2021
Tensor Representations for Action Recognition
Tensor Representations for Action Recognition
Piotr Koniusz
Lei Wang
A. Cherian
29
68
0
28 Dec 2020
Learning to predict synchronization of coupled oscillators on randomly
  generated graphs
Learning to predict synchronization of coupled oscillators on randomly generated graphs
Hardeep Bassi
Richard P. Yim
Rohith Koduluka
Joshua Vendrow
Cherlin Zhu
Hanbaek Lyu
11
5
0
28 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding
GTA: Global Temporal Attention for Video Action Understanding
Bo He
Xitong Yang
Zuxuan Wu
Hao Chen
Ser-Nam Lim
Abhinav Shrivastava
ViT
19
27
0
15 Dec 2020
Convolutional LSTM Neural Networks for Modeling Wildland Fire Dynamics
Convolutional LSTM Neural Networks for Modeling Wildland Fire Dynamics
J. Burge
M. Bonanni
M. Ihme
Lily Hu
11
19
0
11 Dec 2020
Robust Image Captioning
Robust Image Captioning
Daniel Yarnell
Xian Wang
13
0
0
06 Dec 2020
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling
Jing Su
Qingyun Dai
Frank Guerin
Mian Zhou
19
24
0
03 Dec 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial
  Expression Recognition
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition
T. Ayral
M. Pedersoli
Simon L Bacon
Eric Granger
CVBM
3DH
13
11
0
10 Nov 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom
  Interpretation
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation
Zhengyuan Yang
Amanda Kay
Yuncheng Li
Wendi F. Cross
Jiebo Luo
25
17
0
30 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action
  Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Rameswar Panda
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
21
95
0
22 Oct 2020
HAA500: Human-Centric Atomic Action Dataset with Curated Videos
HAA500: Human-Centric Atomic Action Dataset with Curated Videos
Jihoon Chung
Cheng-hsin Wuu
Hsuan-ru Yang
Yu-Wing Tai
Chi-Keung Tang
13
43
0
11 Sep 2020
Temporal Context Aggregation for Video Retrieval with Contrastive
  Learning
Temporal Context Aggregation for Video Retrieval with Contrastive Learning
Jie Shao
Xin Wen
Bingchen Zhao
Xiangyang Xue
AI4TS
30
4
0
04 Aug 2020
Late Temporal Modeling in 3D CNN Architectures with BERT for Action
  Recognition
Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition
M. E. Kalfaoglu
Sinan Kalkan
Aydin Alatan
3DPC
17
140
0
03 Aug 2020
Adversarial Bipartite Graph Learning for Video Domain Adaptation
Adversarial Bipartite Graph Learning for Video Domain Adaptation
Yadan Luo
Zi Huang
Zijian Wang
Zheng-Wei Zhang
Mahsa Baktashmotlagh
8
51
0
31 Jul 2020
Enriching Video Captions With Contextual Text
Enriching Video Captions With Contextual Text
Philipp Rimle
Pelin Dogan
Markus Gross
17
3
0
29 Jul 2020
Approximated Bilinear Modules for Temporal Modeling
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
9
23
0
25 Jul 2020
A Comprehensive Study on Deep Learning-based Methods for Sign Language
  Recognition
A Comprehensive Study on Deep Learning-based Methods for Sign Language Recognition
Nikolas Adaloglou
Theocharis Chatzis
Ilias Papastratis
Andreas Stergioulas
Georgios Th. Papadopoulos
Vassia Zacharopoulou
George J. Xydopoulos
Klimnis Atzakas
D. Papazachariou
P. Daras
24
37
0
24 Jul 2020
Learning to Discretely Compose Reasoning Module Networks for Video
  Captioning
Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Ganchao Tan
Daqing Liu
Meng Wang
Zhengjun Zha
LRM
21
73
0
17 Jul 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image
  Captioning
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning
Riccardo Del Chiaro
Bartlomiej Twardowski
Andrew D. Bagdanov
Joost van de Weijer
CLL
VLM
22
40
0
13 Jul 2020
Learning for Video Compression with Recurrent Auto-Encoder and Recurrent
  Probability Model
Learning for Video Compression with Recurrent Auto-Encoder and Recurrent Probability Model
Ren Yang
Fabian Mentzer
Luc Van Gool
Radu Timofte
12
138
0
24 Jun 2020
Improving Image Captioning with Better Use of Captions
Improving Image Captioning with Better Use of Captions
Zhan Shi
Xu Zhou
Xipeng Qiu
Xiao-Dan Zhu
22
121
0
21 Jun 2020
Actor-Context-Actor Relation Network for Spatio-Temporal Action
  Localization
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
Junting Pan
Siyu Chen
Zheng Shou
Yu Liu
Jing Shao
Hongsheng Li
3DPC
15
150
0
14 Jun 2020
VirTex: Learning Visual Representations from Textual Annotations
VirTex: Learning Visual Representations from Textual Annotations
Karan Desai
Justin Johnson
SSL
VLM
19
432
0
11 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter
  Network
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Lingyu Zhu
Esa Rahtu
14
23
0
04 Jun 2020
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor
  Decomposition
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition
Miao Yin
Siyu Liao
Xiao-Yang Liu
Xiaodong Wang
Bo Yuan
26
24
0
09 May 2020
Low-latency hand gesture recognition with a low resolution thermal
  imager
Low-latency hand gesture recognition with a low resolution thermal imager
Maarten Vandersteegen
Wouter Reusen
Kristof Van Beeck
14
15
0
24 Apr 2020
Recursive Social Behavior Graph for Trajectory Prediction
Recursive Social Behavior Graph for Trajectory Prediction
Jianhua Sun
Qinhong Jiang
Cewu Lu
GNN
16
158
0
22 Apr 2020
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Hirokatsu Kataoka
Tenga Wakamiya
Kensho Hara
Y. Satoh
3DPC
20
87
0
10 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
66
998
0
09 Apr 2020
Explaining Motion Relevance for Activity Recognition in Video Deep
  Learning Models
Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models
Liam Hiley
Alun D. Preece
Y. Hicks
Supriyo Chakraborty
Prudhvi K. Gurram
Richard J. Tomsett
FAtt
9
14
0
31 Mar 2020
Fashion Meets Computer Vision: A Survey
Fashion Meets Computer Vision: A Survey
Wen-Huang Cheng
Sijie Song
Chieh-Yun Chen
S. Hidayati
Jiaying Liu
AI4TS
28
96
0
31 Mar 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
19
178
0
28 Mar 2020
A Driver Fatigue Recognition Algorithm Based on Spatio-Temporal Feature
  Sequence
A Driver Fatigue Recognition Algorithm Based on Spatio-Temporal Feature Sequence
Chen Zhang
Xiaobo Lu
Zhiliang Huang
3DH
CVBM
6
7
0
18 Mar 2020
Identity Recognition in Intelligent Cars with Behavioral Data and
  LSTM-ResNet Classifier
Identity Recognition in Intelligent Cars with Behavioral Data and LSTM-ResNet Classifier
Michael Hammann
Maximilian Kraus
S. Shafaei
Alois C. Knoll
11
3
0
02 Mar 2020
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media
  Retrieval
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media Retrieval
Hadi Abdi Khojasteh
Ebrahim Ansari
Parvin Razzaghi
Akbar Karimi
VLM
6
4
0
23 Feb 2020
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Peng-Tao Xu
Kun Liu
Tao Xiang
Timothy M. Hospedales
Zhanyu Ma
Jun Guo
Yi-Zhe Song
16
32
0
21 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image
  Captioning With R-CNN Feature Distribution Composition (FDC)
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
23
16
0
15 Feb 2020
Temporal Interlacing Network
Temporal Interlacing Network
Hao Shao
Shengju Qian
Yu Liu
8
91
0
17 Jan 2020
Understanding and mitigating gradient pathologies in physics-informed
  neural networks
Understanding and mitigating gradient pathologies in physics-informed neural networks
Sifan Wang
Yujun Teng
P. Perdikaris
AI4CE
PINN
16
289
0
13 Jan 2020
Video Coding for Machines: A Paradigm of Collaborative Compression and
  Intelligent Analytics
Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics
Ling-yu Duan
Jiaying Liu
Wenhan Yang
Tiejun Huang
Wen Gao
22
186
0
10 Jan 2020
Personalizing Fast-Forward Videos Based on Visual and Textual Features
  from Social Network
Personalizing Fast-Forward Videos Based on Visual and Textual Features from Social Network
W. Ramos
M. Silva
Edson Roteia Araujo Junior
Alan C. Neves
Erickson R. Nascimento
14
6
0
29 Dec 2019
Emotion Recognition from Speech
Emotion Recognition from Speech
Kannan Venkataramanan
H. Rajamohan
11
15
0
22 Dec 2019
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper
  Parameter Optimization
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter Optimization
Jeroen Zegers
Hugo Van hamme
BDL
26
7
0
19 Dec 2019
Going Beneath the Surface: Evaluating Image Captioning for
  Grammaticality, Truthfulness and Diversity
Going Beneath the Surface: Evaluating Image Captioning for Grammaticality, Truthfulness and Diversity
Huiyuan Xie
Tom Sherborne
A. Kuhnle
Ann A. Copestake
DiffM
17
9
0
19 Dec 2019
Meshed-Memory Transformer for Image Captioning
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
14
868
0
17 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
27
251
0
10 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little
  Network and Depthwise Temporal Aggregation
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
16
126
0
02 Dec 2019
Previous
123456...8910
Next