A Closer Look at Spatiotemporal Convolutions for Action Recognition

30 November 2017

Heng Wang

Papers citing "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

50 / 1,270 papers shown

Title
Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing Florian Dubost Erin Hong Nandita Bhaskhar Siyi Tang D. Rubin Christopher Lee-Messer NoLa 16 0 0 28 Nov 2020
Recent Progress in Appearance-based Action Recognition J. Humphreys Zhe Chen Dacheng Tao 24 0 0 25 Nov 2020
A3D: Adaptive 3D Networks for Video Action Recognition Sijie Zhu Taojiannan Yang Matías Mendieta Chong Chen 3DH 32 12 0 24 Nov 2020
Play Fair: Frame Attributions in Video Models Will Price Dima Damen FAtt 31 5 0 24 Nov 2020
KShapeNet: Riemannian network on Kendall shape space for Skeleton based Action Recognition Racha Friji Hassen Drira F. Chaieb S. Kurtek Hamza Kchok 3DPC 22 2 0 24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks Humam Alwassel Silvio Giancola Guohao Li 33 123 0 23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning Zehua Zhang David J. Crandall AI4TS SSL 28 23 0 23 Nov 2020
The complementarity of a diverse range of deep learning features extracted from video content for video recommendation A. Almeida J. D. Villiers A. Freitas Mergandran Velayudan 19 16 0 21 Nov 2020
DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets Jianpeng Zhang Yutong Xie Yong-quan Xia Chunhua Shen 22 155 0 20 Nov 2020
Master Thesis: Neural Sign Language Translation by Learning Tokenization Alptekin Orbay SLR 12 0 0 18 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions Mohsen Fayyaz Emad Bahrami Rad Ali Diba M. Noroozi Ehsan Adeli Luc Van Gool Juergen Gall 3DPC 24 30 0 17 Nov 2020
Audio-Visual Event Recognition through the lens of Adversary Juncheng Li Kaixin Ma Shuhui Qu Po-Yao (Bernie) Huang Florian Metze AAML 8 9 0 15 Nov 2020
ActBERT: Learning Global-Local Video-Text Representations Linchao Zhu Yi Yang ViT 49 417 0 14 Nov 2020
Adding Knowledge to Unsupervised Algorithms for the Recognition of Intent Stuart Synakowski Qianli Feng Aleix M. Martinez OCL 14 6 0 12 Nov 2020
Ontology-driven Event Type Classification in Images Eric Müller-Budack Matthias Springstein Sherzod Hakimov Kevin Mrutzek Ralph Ewerth 19 9 0 09 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos Alexandros Stergiou R. Poppe 29 1 0 08 Nov 2020
Predictive Process Model Monitoring using Recurrent Neural Networks Johannes De Smedt Jochen De Weerdt 25 0 0 05 Nov 2020
Mutual Modality Learning for Video Action Classification Stepan Alekseevich Komkov Maksim Dzabraev Aleksandr Petiushko 27 9 0 04 Nov 2020
Learning Representations from Audio-Visual Spatial Alignment Pedro Morgado Yi Li Nuno Vasconcelos SSL 27 121 0 03 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition Zihao Wang Chen Lin Lu Sheng Junjie Yan Jing Shao ViT 17 7 0 02 Nov 2020
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning L. Tao Xueting Wang T. Yamasaki VLM SSL 23 14 0 29 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching Haoyuan Zhang Yonghong Hou Pichao Wang Zihui Guo Wanqing Li 32 15 0 29 Oct 2020
Spatio-temporal Features for Generalized Detection of Deepfake Videos Ipek Ganiyusufoglu L. Ngô N. Savov Sezer Karaoglu Theo Gevers 32 41 0 22 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition Chun-Fu Chen Yikang Shen K. Ramakrishnan Rogerio Feris J. M. Cohn A. Oliva Quanfu Fan 23 95 0 22 Oct 2020
Extraction of Discrete Spectra Modes from Video Data Using a Deep Convolutional Koopman Network S. Leask V. McDonell 11 1 0 19 Oct 2020
Hierarchical Conditional Relation Networks for Multimodal Video Question Answering T. Le Vuong Le Svetha Venkatesh T. Tran BDL 24 22 0 18 Oct 2020
VolumeNet: A Lightweight Parallel Network for Super-Resolution of Medical Volumetric Data Yinhao Li Yutaro Iwamoto Lanfen Lin R. Xu Yenwei Chen SupR 29 38 0 16 Oct 2020
Pose And Joint-Aware Action Recognition Anshul B. Shah Shlok Kumar Mishra Ankan Bansal Jun-Cheng Chen Ramalingam Chellappa Abhinav Shrivastava 44 33 0 16 Oct 2020
Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning Xinyu Yang Majid Mirmehdi T. Burghardt 27 4 0 14 Oct 2020
Video Action Understanding Matthew Hutchinson V. Gadepally 43 20 0 13 Oct 2020
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain Francesco Ragusa Antonino Furnari S. Livatino G. Farinella EgoV 24 99 0 12 Oct 2020
Reconfigurable Cyber-Physical System for Lifestyle Video-Monitoring via Deep Learning Daniel Deniz Francisco Barranco J. Isern Eduardo Ros 9 7 0 07 Oct 2020
Support-set bottlenecks for video-text representation learning Mandela Patrick Po-Yao (Bernie) Huang Yuki M. Asano Florian Metze Alexander G. Hauptmann João Henriques Andrea Vedaldi 22 244 0 06 Oct 2020
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing Okan Kopuklu Stefan Hormann Fabian Herzog Hakan Çevikalp Gerhard Rigoll 3DPC 23 15 0 30 Sep 2020
Score-level Multi Cue Fusion for Sign Language Recognition Çagri Gökçe Ogulcan Özdemir A. Kındıroglu L. Akarun SLR 19 23 0 29 Sep 2020
PERF-Net: Pose Empowered RGB-Flow Net Yinxiao Li Zhichao Lu Xuehan Xiong Jonathan Huang 3DH 40 17 0 28 Sep 2020
Online Learnable Keyframe Extraction in Videos and its Application with Semantic Word Vector in Action Recognition G. Elahi Herbert Yang 25 25 0 25 Sep 2020
On the spatiotemporal behavior in biology-mimicking computing systems J. Végh Ádám-József Berki 22 6 0 18 Sep 2020
Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks Iulia Duta Andrei Liviu Nicolicioiu Marius Leordeanu 26 6 0 17 Sep 2020
Multi-Label Activity Recognition using Activity-specific Features and Activity Correlations Yanyi Zhang Xinyu Li I. Marsic HAI 28 23 0 16 Sep 2020
Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning Jinpeng Wang Yuting Gao Ke Li Yiqi Lin A. J. Ma Hao Cheng Pai Peng Feiyue Huang Rongrong Ji Xing Sun SSL 54 96 0 12 Sep 2020
Online Spatiotemporal Action Detection and Prediction via Causal Representations Gurkirt Singh 3DPC CML 24 0 0 31 Aug 2020
Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics Jiangliu Wang Jianbo Jiao Linchao Bao Shengfeng He Wei Liu Yunhui Liu SSL AI4TS 21 55 0 31 Aug 2020
All About Knowledge Graphs for Actions P. Ghosh Nirat Saini L. Davis Abhinav Shrivastava 24 31 0 28 Aug 2020
DMD: A Large-Scale Multi-Modal Driver Monitoring Dataset for Attention and Alertness Analysis J. Ortega Neslihan Köse P. Cañas Min-An Chao A. Unnervik Marcos Nieto Oihana Otaegui L. Salgado 27 91 0 27 Aug 2020
Self-Supervised Human Activity Recognition by Augmenting Generative Adversarial Networks Mohammad Zaki Zadeh Ashwin Ramesh Babu Ashish Jaiswal F. Makedon 14 16 0 26 Aug 2020
Making a Case for 3D Convolutions for Object Segmentation in Videos Sabarinath Mahadevan A. Athar Aljosa Osep Sebastian Hennen Laura Leal-Taixé Bastian Leibe VOS 21 87 0 26 Aug 2020
Effective Action Recognition with Embedded Key Point Shifts Haozhi Cao Yuecong Xu Jianfei Yang K. Mao Jianxiong Yin Simon See 15 7 0 26 Aug 2020
Discriminability Distillation in Group Representation Learning Manyuan Zhang Guanglu Song Hang Zhou Yu Liu FedML 17 18 0 25 Aug 2020
Quantitative Survey of the State of the Art in Sign Language Recognition Oscar Koller SLR 27 94 0 22 Aug 2020