Spatiotemporal Residual Networks for Video Action Recognition

7 November 2016

Christoph Feichtenhofer

A. Pinz

Richard P. Wildes

ArXiv (abs)PDF HTML

Papers citing "Spatiotemporal Residual Networks for Video Action Recognition"

50 / 273 papers shown

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Zhijian Liu

Song Han

254

133

25 Apr 2022

DirecFormer: A Directed Attention in Transformer Approach to Robust Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2022

213

19 Mar 2022

Gate-Shift-Fuse for Video Action RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Swathikiran Sudhakaran

Sergio Escalera

Oswald Lanz

267

16 Mar 2022

Enriched CNN-Transformer Feature Aggregation Networks for Super-ResolutionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

255

15 Mar 2022

PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos

221

08 Mar 2022

RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-guided Disease ClassificationEuropean Conference on Computer Vision (ECCV), 2022

197

23 Feb 2022

Multiview Transformers for Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2022

446

269

12 Jan 2022

3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve

Lei Wang

Jun Liu

Piotr Koniusz

164

23 Dec 2021

Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos

287

15 Dec 2021

SVIP: Sequence VerIfication for Procedures in Videos

Xu Tang

327

13 Dec 2021

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection

Christoph Feichtenhofer

ViT

492

842

02 Dec 2021

A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection Methods

M. Baradaran

R. Bergevin

268

02 Nov 2021

Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action RecognitionIEEE transactions on multimedia (IEEE Trans. Multimedia), 2021

282

28 Oct 2021

Deep Two-Stream Video Inference for Human Body Pose and Shape EstimationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

Bo Xu

134

22 Oct 2021

High-order Tensor Pooling with Attention for Action RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Lei Wang

Ke Sun

Piotr Koniusz

286

11 Oct 2021

TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge DeviceIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

Ji Lin

Chuang Gan

Kuan-Chieh Wang

Song Han

171

27 Sep 2021

Searching for Two-Stream Models in Multivariate Space for Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2021

Heng Wang

190

30 Aug 2021

Shifted Chunk Transformer for Spatio-Temporal Representational LearningNeural Information Processing Systems (NeurIPS), 2021

299

26 Aug 2021

When Video Classification Meets Incremental ClassesACM Multimedia (ACM MM), 2021

Xi Li

183

30 Jun 2021

TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?

591

154

21 Jun 2021

MaCLR: Motion-aware Contrastive Learning of Representations for VideosEuropean Conference on Computer Vision (ECCV), 2021

186

17 Jun 2021

SSAN: Separable Self-Attention Network for Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2021

161

27 May 2021

Anabranch Network for Camouflaged Object SegmentationComputer Vision and Image Understanding (CVIU), 2019

Trung-Nghia Le

263

628

20 May 2021

What can human minimal videos tell us about dynamic recognition models?Cognition (Cognition), 2020

Guy Ben-Yosef

Gabriel Kreiman

S. Ullman

19 Apr 2021

Adaptive Intermediate Representations for Video Understanding

156

14 Apr 2021

ViViT: A Video Vision TransformerIEEE International Conference on Computer Vision (ICCV), 2021

545

2,702

29 Mar 2021

Unified Graph Structured Models for Video UnderstandingIEEE International Conference on Computer Vision (ICCV), 2021

Anurag Arnab

Chen Sun

Cordelia Schmid

230

29 Mar 2021

Busy-Quiet Video Disentangling for Video ClassificationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

Guoxi Huang

A. Bors

270

29 Mar 2021

Learning to Recognize Actions on Objects in Egocentric Video with Attention DictionariesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Swathikiran Sudhakaran

Sergio Escalera

Oswald Lanz

EgoV

206

16 Feb 2021

RMS-Net: Regression and Masking for Soccer Event SpottingInternational Conference on Pattern Recognition (ICPR), 2021

Lorenzo Baraldi

214

15 Feb 2021

Video Transformer Network

781

475

01 Feb 2021

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation LearningIEEE International Conference on Computer Vision (ICCV), 2021

337

26 Jan 2021

RGB-D Salient Object Detection via 3D Convolutional Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2021

177

169

25 Jan 2021

A Layer-Wise Information Reinforcement Approach to Improve Learning in Deep Belief NetworksInternational Conference on Artificial Intelligence and Soft Computing (ICAISC), 2020

123

17 Jan 2021

Human Action Recognition from Various Data Modalities: A ReviewIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

Zehua Sun

Jun Liu

582

699

22 Dec 2020

Multi-shot Temporal Event Localization: a BenchmarkComputer Vision and Pattern Recognition (CVPR), 2020

Yao Hu

201

17 Dec 2020

A Comprehensive Study of Deep Video Action Recognition

Yi Zhu

Xinyu Li

Chunhui Liu

Mohammadreza Zolfaghari

283

210

11 Dec 2020

Spatial-Temporal Alignment Network for Action Recognition and Detection

Alexander G. Hauptmann

3DPC

154

04 Dec 2020

Recent Progress in Appearance-based Action Recognition

J. Humphreys

Zhe Chen

Dacheng Tao

170

25 Nov 2020

Play Fair: Frame Attributions in Video ModelsAsian Conference on Computer Vision (ACCV), 2020

Will Price

Dima Damen

FAtt

119

24 Nov 2020

Improved Soccer Action Spotting using both Audio and Video Streams

Bastien Vanderplaetse

Stéphane Dupont

198

09 Nov 2020

Multi-Temporal Convolutions for Human Action Recognition in Videos

Alexandros Stergiou

R. Poppe

210

08 Nov 2020

Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition

291

116

22 Oct 2020

Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit Latent Features

392

103

15 Oct 2020

The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain

174

125

12 Oct 2020

Adversarial Semi-Supervised Multi-Domain Tracking

Kourosh Meshgi

Maryam Sadat Mirzaei

156

30 Sep 2020

AssembleNet++: Assembling Modality Representations via Attention Connections

169

18 Aug 2020

A Unified Framework for Shot Type Classification Based on Subject Centric LensEuropean Conference on Computer Vision (ECCV), 2020

Linning Xu

220

08 Aug 2020

Self-supervised Video Representation Learning Using Inter-intra Contrastive FrameworkACM Multimedia (ACM MM), 2020

336

114

06 Aug 2020

HAMLET: A Hierarchical Multimodal Attention-based Human Activity Recognition Algorithm

Md. Mofijul Islam

Tariq Iqbal

158

03 Aug 2020