Long-term Recurrent Convolutional Networks for Visual Recognition and Description

17 November 2014

Jeff Donahue

Lisa Anne Hendricks

Marcus Rohrbach

Subhashini Venugopalan

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 468 papers shown

Title
B-HAR: an open-source baseline framework for in depth study of human activity recognition datasets and workflows Florenc Demrozi Cristian Turetta G. Pravadelli 15 8 0 23 Jan 2021
Human Interaction Recognition Framework based on Interacting Body Part Attention Dong-Gyu Lee Seong-Whan Lee 51 25 0 22 Jan 2021
Diagnostic Captioning: A Survey John Pavlopoulos Vasiliki Kougia Ion Androutsopoulos D. Papamichail 3DV MedIm 89 26 0 18 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models Alina Roitberg Monica Haurilet Manuel Martínez Rainer Stiefelhagen UQCV 29 6 0 02 Jan 2021
Tensor Representations for Action Recognition Piotr Koniusz Lei Wang A. Cherian 29 68 0 28 Dec 2020
Learning to predict synchronization of coupled oscillators on randomly generated graphs Hardeep Bassi Richard P. Yim Rohith Koduluka Joshua Vendrow Cherlin Zhu Hanbaek Lyu 11 5 0 28 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding Bo He Xitong Yang Zuxuan Wu Hao Chen Ser-Nam Lim Abhinav Shrivastava ViT 19 27 0 15 Dec 2020
Convolutional LSTM Neural Networks for Modeling Wildland Fire Dynamics J. Burge M. Bonanni M. Ihme Lily Hu 11 19 0 11 Dec 2020
Robust Image Captioning Daniel Yarnell Xian Wang 13 0 0 06 Dec 2020
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling Jing Su Qingyun Dai Frank Guerin Mian Zhou 19 24 0 03 Dec 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition T. Ayral M. Pedersoli Simon L Bacon Eric Granger CVBM 3DH 13 11 0 10 Nov 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation Zhengyuan Yang Amanda Kay Yuncheng Li Wendi F. Cross Jiebo Luo 25 17 0 30 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition Chun-Fu Chen Rameswar Panda K. Ramakrishnan Rogerio Feris J. M. Cohn A. Oliva Quanfu Fan 21 95 0 22 Oct 2020
HAA500: Human-Centric Atomic Action Dataset with Curated Videos Jihoon Chung Cheng-hsin Wuu Hsuan-ru Yang Yu-Wing Tai Chi-Keung Tang 13 43 0 11 Sep 2020
Temporal Context Aggregation for Video Retrieval with Contrastive Learning Jie Shao Xin Wen Bingchen Zhao Xiangyang Xue AI4TS 30 4 0 04 Aug 2020
Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition M. E. Kalfaoglu Sinan Kalkan Aydin Alatan 3DPC 17 140 0 03 Aug 2020
Adversarial Bipartite Graph Learning for Video Domain Adaptation Yadan Luo Zi Huang Zijian Wang Zheng-Wei Zhang Mahsa Baktashmotlagh 8 51 0 31 Jul 2020
Enriching Video Captions With Contextual Text Philipp Rimle Pelin Dogan Markus Gross 17 3 0 29 Jul 2020
Approximated Bilinear Modules for Temporal Modeling Xinqi Zhu Chang Xu Langwen Hui Cewu Lu Dacheng Tao 9 23 0 25 Jul 2020
A Comprehensive Study on Deep Learning-based Methods for Sign Language Recognition Nikolas Adaloglou Theocharis Chatzis Ilias Papastratis Andreas Stergioulas Georgios Th. Papadopoulos Vassia Zacharopoulou George J. Xydopoulos Klimnis Atzakas D. Papazachariou P. Daras 24 37 0 24 Jul 2020
Learning to Discretely Compose Reasoning Module Networks for Video Captioning Ganchao Tan Daqing Liu Meng Wang Zhengjun Zha LRM 21 73 0 17 Jul 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning Riccardo Del Chiaro Bartlomiej Twardowski Andrew D. Bagdanov Joost van de Weijer CLL VLM 22 40 0 13 Jul 2020
Learning for Video Compression with Recurrent Auto-Encoder and Recurrent Probability Model Ren Yang Fabian Mentzer Luc Van Gool Radu Timofte 12 138 0 24 Jun 2020
Improving Image Captioning with Better Use of Captions Zhan Shi Xu Zhou Xipeng Qiu Xiao-Dan Zhu 22 121 0 21 Jun 2020
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization Junting Pan Siyu Chen Zheng Shou Yu Liu Jing Shao Hongsheng Li 3DPC 15 150 0 14 Jun 2020
VirTex: Learning Visual Representations from Textual Annotations Karan Desai Justin Johnson SSL VLM 19 432 0 11 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network Lingyu Zhu Esa Rahtu 14 23 0 04 Jun 2020
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition Miao Yin Siyu Liao Xiao-Yang Liu Xiaodong Wang Bo Yuan 26 24 0 09 May 2020
Low-latency hand gesture recognition with a low resolution thermal imager Maarten Vandersteegen Wouter Reusen Kristof Van Beeck 14 15 0 24 Apr 2020
Recursive Social Behavior Graph for Trajectory Prediction Jianhua Sun Qinhong Jiang Cewu Lu GNN 16 158 0 22 Apr 2020
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs? Hirokatsu Kataoka Tenga Wakamiya Kensho Hara Y. Satoh 3DPC 20 87 0 10 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition Christoph Feichtenhofer 66 998 0 09 Apr 2020
Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models Liam Hiley Alun D. Preece Y. Hicks Supriyo Chakraborty Prudhvi K. Gurram Richard J. Tomsett FAtt 9 14 0 31 Mar 2020
Fashion Meets Computer Vision: A Survey Wen-Huang Cheng Sijie Song Chieh-Yun Chen S. Hidayati Jiaying Liu AI4TS 28 96 0 31 Mar 2020
Actor-Transformers for Group Activity Recognition Kirill Gavrilyuk Ryan Sanford Mehrsan Javan Cees G. M. Snoek ViT 19 178 0 28 Mar 2020
A Driver Fatigue Recognition Algorithm Based on Spatio-Temporal Feature Sequence Chen Zhang Xiaobo Lu Zhiliang Huang 3DH CVBM 6 7 0 18 Mar 2020
Identity Recognition in Intelligent Cars with Behavioral Data and LSTM-ResNet Classifier Michael Hammann Maximilian Kraus S. Shafaei Alois C. Knoll 11 3 0 02 Mar 2020
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media Retrieval Hadi Abdi Khojasteh Ebrahim Ansari Parvin Razzaghi Akbar Karimi VLM 6 4 0 23 Feb 2020
Fine-Grained Instance-Level Sketch-Based Video Retrieval Peng-Tao Xu Kun Liu Tao Xiang Timothy M. Hospedales Zhanyu Ma Jun Guo Yi-Zhe Song 16 32 0 21 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC) C. Sur 23 16 0 15 Feb 2020
Temporal Interlacing Network Hao Shao Shengju Qian Yu Liu 8 91 0 17 Jan 2020
Understanding and mitigating gradient pathologies in physics-informed neural networks Sifan Wang Yujun Teng P. Perdikaris AI4CE PINN 16 289 0 13 Jan 2020
Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics Ling-yu Duan Jiaying Liu Wenhan Yang Tiejun Huang Wen Gao 22 186 0 10 Jan 2020
Personalizing Fast-Forward Videos Based on Visual and Textual Features from Social Network W. Ramos M. Silva Edson Roteia Araujo Junior Alan C. Neves Erickson R. Nascimento 14 6 0 29 Dec 2019
Emotion Recognition from Speech Kannan Venkataramanan H. Rajamohan 11 15 0 22 Dec 2019
CNN-LSTM models for Multi-Speaker Source Separation using Bayesian Hyper Parameter Optimization Jeroen Zegers Hugo Van hamme BDL 26 7 0 19 Dec 2019
Going Beneath the Surface: Evaluating Image Captioning for Grammaticality, Truthfulness and Diversity Huiyuan Xie Tom Sherborne A. Kuhnle Ann A. Copestake DiffM 17 9 0 19 Dec 2019
Meshed-Memory Transformer for Image Captioning Marcella Cornia Matteo Stefanini Lorenzo Baraldi Rita Cucchiara 14 868 0 17 Dec 2019
Listen to Look: Action Recognition by Previewing Audio Ruohan Gao Tae-Hyun Oh Kristen Grauman Lorenzo Torresani VLM 27 251 0 10 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation Quanfu Fan Chun-Fu Chen Hilde Kuehne Marco Pistoia David D. Cox 16 126 0 02 Dec 2019