v1v2v3v4 (latest)

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Computer Vision and Pattern Recognition (CVPR), 2014

17 November 2014

Jeff Donahue

Lisa Anne Hendricks

Marcus Rohrbach

Subhashini Venugopalan

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 1,728 papers shown

Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation

Sulabh Katiyar

S. Borgohain

VLM

132

22 Feb 2021

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image CaptioningComputer Vision and Pattern Recognition (CVPR), 2021

428

273

20 Feb 2021

One-shot action recognition in challenging therapy scenarios

396

17 Feb 2021

Classification of multivariate weakly-labelled time-series with attention

S. Rahman

Chang Wei Tan

138

16 Feb 2021

Learning to Recognize Actions on Objects in Egocentric Video with Attention DictionariesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Swathikiran Sudhakaran

Sergio Escalera

Oswald Lanz

EgoV

206

16 Feb 2021

Win-Fail Action Recognition

Paritosh Parmar

B. Morris

155

15 Feb 2021

Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model

122

14 Feb 2021

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2021

185

14 Feb 2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action RecognitionInternational Conference on Learning Representations (ICLR), 2021

291

10 Feb 2021

In Defense of Scene Graphs for Image CaptioningIEEE International Conference on Computer Vision (ICCV), 2021

203

09 Feb 2021

Face Recognition using 3D CNNsTransactions on Computer Systems and Networks (TCSN), 2021

N. Mishra

S. Singh

3DH CVBM

02 Feb 2021

Automatic Detection of B-lines in Lung Ultrasound Videos From Severe Dengue PatientsIEEE International Symposium on Biomedical Imaging (ISBI), 2021

181

01 Feb 2021

Video Transformer Network

774

474

01 Feb 2021

Open-domain Topic Identification of Out-of-domain Utterances using Wikipedia

A. Augustin

Alexandros Papangelis

100

26 Jan 2021

A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers

795

26 Jan 2021

Probability Trajectory: One New Movement Description for Trajectory Prediction

138

26 Jan 2021

B-HAR: an open-source baseline framework for in depth study of human activity recognition datasets and workflowsIEEE Access (IEEE Access), 2021

Florenc Demrozi

Cristian Turetta

G. Pravadelli

141

23 Jan 2021

Human Interaction Recognition Framework based on Interacting Body Part AttentionPattern Recognition (Pattern Recogn.), 2021

Dong-Gyu Lee

Seong-Whan Lee

167

22 Jan 2021

Bridging the gap between Human Action Recognition and Online Action Detection

Alban Main De Boissiere

R. Noumeir

181

21 Jan 2021

Learning rich touch representations through cross-modal self-supervisionConference on Robot Learning (CoRL), 2021

195

21 Jan 2021

Diagnostic Captioning: A SurveyKnowledge and Information Systems (KAIS), 2021

231

18 Jan 2021

Neural networks behave as hash encoders: An empirical study

165

14 Jan 2021

Multimodal Engagement Analysis from Facial Videos in the ClassroomIEEE Transactions on Affective Computing (TAC), 2021

337

112

11 Jan 2021

Unifying Relational Sentence Generation and Retrieval for Medical Image Report CompositionIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2020

Xiaodan Liang

182

09 Jan 2021

A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules

182

08 Jan 2021

Dairy Cow rumination detection: A deep learning approachInternational Workshop on Distributed Computing for Emerging Smart Networks (DCESN), 2021

07 Jan 2021

Reinforcement Learning with Latent FlowNeural Information Processing Systems (NeurIPS), 2021

Aravind Rajeswaran

Pieter Abbeel

158

06 Jan 2021

Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING ModelsInternational Conference on Pattern Recognition (ICPR), 2021

105

02 Jan 2021

Text-Free Image-to-Speech Synthesis Using Learned Segmental UnitsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

167

31 Dec 2020

3D Human motion anticipation and classification

Emad Barsoum

J. Kender

Zicheng Liu

3DH

124

31 Dec 2020

2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2020

Hengduo Li

Zuxuan Wu

Abhinav Shrivastava

L. Davis

275

29 Dec 2020

Tensor Representations for Action RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

Piotr Koniusz

Lei Wang

A. Cherian

388

28 Dec 2020

Learning to predict synchronization of coupled oscillators on randomly generated graphsScientific Reports (Sci Rep), 2020

230

28 Dec 2020

Human Action Recognition from Various Data Modalities: A ReviewIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

Zehua Sun

Jun Liu

580

691

22 Dec 2020

Anchor-Based Spatio-Temporal Attention 3D Convolutional Networks for Dynamic 3D Point Cloud SequencesIEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2020

Guangming Wang

Hesheng Wang

138

20 Dec 2020

SMART Frame Selection for Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2020

Shreyank N. Gowda

Marcus Rohrbach

Laura Sevilla-Lara

234

163

19 Dec 2020

TDN: Temporal Difference Networks for Efficient Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2020

Limin Wang

Zhan Tong

Bin Ji

Gangshan Wu

430

461

18 Dec 2020

Smoothed Gaussian Mixture Models for Video Classification and Recommendation

110

17 Dec 2020

GTA: Global Temporal Attention for Video Action UnderstandingBritish Machine Vision Conference (BMVC), 2020

Bo He

Xitong Yang

Zuxuan Wu

Hao Chen

Ser-Nam Lim

Abhinav Shrivastava

ViT

176

15 Dec 2020

NUTA: Non-uniform Temporal Aggregation for Action RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

Hao Chen

119

15 Dec 2020

Intrinsic Image Captioning Evaluation

Chao Zeng

Sam Kwong

14 Dec 2020

MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in TurkishMachine Translation (MT), 2020

Pranava Madhyastha

205

13 Dec 2020

Convolutional LSTM Neural Networks for Modeling Wildland Fire Dynamics

164

11 Dec 2020

A Comprehensive Study of Deep Video Action Recognition

Yi Zhu

Xinyu Li

Chunhui Liu

Mohammadreza Zolfaghari

280

209

11 Dec 2020

A Log-likelihood Regularized KL Divergence for Video Prediction with A 3D Convolutional Variational Recurrent Network

Haziq Razali

Basura Fernando

DRL

159

11 Dec 2020

Developing Motion Code Embedding for Action Recognition in VideosInternational Conference on Pattern Recognition (ICPR), 2020

Maxat Alibayev

D. Paulius

Yu Sun

159

10 Dec 2020

Driving Behavior Explanation with Multi-level FusionPattern Recognition (Pattern Recognit.), 2020

141

09 Dec 2020

Understanding Action Sequences based on Video Captioning for Learning-from-Observation

224

09 Dec 2020

Robust Image Captioning

Daniel Yarnell

Xian Wang

101

06 Dec 2020

Understanding Guided Image Captioning Performance across DomainsConference on Computational Natural Language Learning (CoNLL), 2020

369

04 Dec 2020