Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1611.02155
Cited By
Spatiotemporal Residual Networks for Video Action Recognition
7 November 2016
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spatiotemporal Residual Networks for Video Action Recognition"
50 / 273 papers shown
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
254
133
0
25 Apr 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Thanh-Dat Truong
Quoc-Huy Bui
C. Duong
Han-Seok Seo
Son Lam Phung
Xin Li
Khoa Luu
ViT
213
69
0
19 Mar 2022
Gate-Shift-Fuse for Video Action Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
267
33
0
16 Mar 2022
Enriched CNN-Transformer Feature Aggregation Networks for Super-Resolution
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Jinsu Yoo
Taehoon Kim
Sihaeng Lee
Seunghyeon Kim
Hankook Lee
Tae Hyun Kim
SupR
ViT
255
86
0
15 Mar 2022
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos
Yunhao Du
Zhihang Tong
Jun-Jun Wan
Binyu Zhang
Yanyun Zhao
221
3
0
08 Mar 2022
RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-guided Disease Classification
European Conference on Computer Vision (ECCV), 2022
Moinak Bhattacharya
Shubham Jain
Prateek Prasanna
ViT
MedIm
197
40
0
23 Feb 2022
Multiview Transformers for Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
446
269
0
12 Jan 2022
3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve
Lei Wang
Jun Liu
Piotr Koniusz
164
24
0
23 Dec 2021
Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos
Pengfei Pei
Xianfeng Zhao
Yun Cao
Jinchuan Li
Xiaowei Yi
ViT
287
9
0
15 Dec 2021
SVIP: Sequence VerIfication for Procedures in Videos
Yichen Qian
Weixin Luo
Dongze Lian
Xu Tang
P. Zhao
Shenghua Gao
ViT
327
23
0
13 Dec 2021
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li
Chaoxia Wu
Haoqi Fan
K. Mangalam
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
492
842
0
02 Dec 2021
A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection Methods
M. Baradaran
R. Bergevin
268
24
0
02 Nov 2021
Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition
IEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Liang Xu
Cuiling Lan
Wenjun Zeng
Cewu Lu
282
35
0
28 Oct 2021
Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Zi-Jun Li
Bo Xu
Han Huang
Cheng Lu
Yandong Guo
3DH
134
14
0
22 Oct 2021
High-order Tensor Pooling with Attention for Action Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Lei Wang
Ke Sun
Piotr Koniusz
286
20
0
11 Oct 2021
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Ji Lin
Chuang Gan
Kuan-Chieh Wang
Song Han
171
80
0
27 Sep 2021
Searching for Two-Stream Models in Multivariate Space for Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Xinyu Gong
Heng Wang
Zheng Shou
Matt Feiszli
Zinan Lin
Zhicheng Yan
190
9
0
30 Aug 2021
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Neural Information Processing Systems (NeurIPS), 2021
Xuefan Zha
Wentao Zhu
Tingxun Lv
Sen Yang
Ji Liu
AI4TS
ViT
299
30
0
26 Aug 2021
When Video Classification Meets Incremental Classes
ACM Multimedia (ACM MM), 2021
Hanbin Zhao
Xin Qin
Shihao Su
Yongjian Fu
Zibo Lin
Xi Li
CLL
183
32
0
30 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
591
154
0
21 Jun 2021
MaCLR: Motion-aware Contrastive Learning of Representations for Videos
European Conference on Computer Vision (ECCV), 2021
Fanyi Xiao
Joseph Tighe
Davide Modolo
SSL
186
18
0
17 Jun 2021
SSAN: Separable Self-Attention Network for Video Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2021
Xudong Guo
Xun Guo
Yan Lu
ViT
AI4TS
161
29
0
27 May 2021
Anabranch Network for Camouflaged Object Segmentation
Computer Vision and Image Understanding (CVIU), 2019
Trung-Nghia Le
Tam V. Nguyen
Zhongliang Nie
M. Tran
Akihiro Sugimoto
263
628
0
20 May 2021
What can human minimal videos tell us about dynamic recognition models?
Cognition (Cognition), 2020
Guy Ben-Yosef
Gabriel Kreiman
S. Ullman
79
5
0
19 Apr 2021
Adaptive Intermediate Representations for Video Understanding
Juhana Kangaspunta
A. Piergiovanni
Rico Jonschkowski
Michael S. Ryoo
A. Angelova
156
4
0
14 Apr 2021
ViViT: A Video Vision Transformer
IEEE International Conference on Computer Vision (ICCV), 2021
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
545
2,702
0
29 Mar 2021
Unified Graph Structured Models for Video Understanding
IEEE International Conference on Computer Vision (ICCV), 2021
Anurag Arnab
Chen Sun
Cordelia Schmid
230
52
0
29 Mar 2021
Busy-Quiet Video Disentangling for Video Classification
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Guoxi Huang
A. Bors
270
10
0
29 Mar 2021
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
EgoV
206
22
0
16 Feb 2021
RMS-Net: Regression and Masking for Soccer Event Spotting
International Conference on Pattern Recognition (ICPR), 2021
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
214
33
0
15 Feb 2021
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
781
475
0
01 Feb 2021
ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning
IEEE International Conference on Computer Vision (ICCV), 2021
Sangho Lee
Jiwan Chung
Youngjae Yu
Gunhee Kim
Thomas Breuel
Gal Chechik
Yale Song
337
66
0
26 Jan 2021
RGB-D Salient Object Detection via 3D Convolutional Neural Networks
AAAI Conference on Artificial Intelligence (AAAI), 2021
Qian Chen
Ze Liu
Y. Zhang
Keren Fu
Qijun Zhao
H. Du
3DPC
177
169
0
25 Jan 2021
A Layer-Wise Information Reinforcement Approach to Improve Learning in Deep Belief Networks
International Conference on Artificial Intelligence and Soft Computing (ICAISC), 2020
Mateus Roder
L. A. Passos
L. C. Ribeiro
C. R. Pereira
João Paulo Papa
123
1
0
17 Jan 2021
Human Action Recognition from Various Data Modalities: A Review
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
582
699
0
22 Dec 2020
Multi-shot Temporal Event Localization: a Benchmark
Computer Vision and Pattern Recognition (CVPR), 2020
Xiaolong Liu
Yao Hu
S. Bai
Fei Ding
X. Bai
Juil Sock
201
97
0
17 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
283
210
0
11 Dec 2020
Spatial-Temporal Alignment Network for Action Recognition and Detection
Junwei Liang
Liangliang Cao
Xuehan Xiong
Ting Yu
Alexander G. Hauptmann
3DPC
154
9
0
04 Dec 2020
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
170
0
0
25 Nov 2020
Play Fair: Frame Attributions in Video Models
Asian Conference on Computer Vision (ACCV), 2020
Will Price
Dima Damen
FAtt
119
6
0
24 Nov 2020
Improved Soccer Action Spotting using both Audio and Video Streams
Bastien Vanderplaetse
Stéphane Dupont
198
48
0
09 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
210
1
0
08 Nov 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
291
116
0
22 Oct 2020
Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit Latent Features
Myeongah Cho
Taeoh Kim
Woojin Kim
Suhwan Cho
Sangyoun Lee
392
103
0
15 Oct 2020
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain
Francesco Ragusa
Antonino Furnari
S. Livatino
G. Farinella
EgoV
174
125
0
12 Oct 2020
Adversarial Semi-Supervised Multi-Domain Tracking
Kourosh Meshgi
Maryam Sadat Mirzaei
156
1
0
30 Sep 2020
AssembleNet++: Assembling Modality Representations via Attention Connections
Michael S. Ryoo
A. Piergiovanni
Juhana Kangaspunta
A. Angelova
169
50
0
18 Aug 2020
A Unified Framework for Shot Type Classification Based on Subject Centric Lens
European Conference on Computer Vision (ECCV), 2020
Anyi Rao
Jiaze Wang
Linning Xu
Xuekun Jiang
Qingqiu Huang
Bolei Zhou
Dahua Lin
220
78
0
08 Aug 2020
Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework
ACM Multimedia (ACM MM), 2020
Li Tao
Xueting Wang
T. Yamasaki
SSL
336
114
0
06 Aug 2020
HAMLET: A Hierarchical Multimodal Attention-based Human Activity Recognition Algorithm
Md. Mofijul Islam
Tariq Iqbal
158
94
0
03 Aug 2020
Previous
1
2
3
4
5
6
Next