ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.02155
  4. Cited By
Spatiotemporal Residual Networks for Video Action Recognition

Spatiotemporal Residual Networks for Video Action Recognition

7 November 2016
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
ArXiv (abs)PDFHTML

Papers citing "Spatiotemporal Residual Networks for Video Action Recognition"

50 / 273 papers shown
AR-Net: Adaptive Frame Resolution for Efficient Action Recognition
AR-Net: Adaptive Frame Resolution for Efficient Action RecognitionEuropean Conference on Computer Vision (ECCV), 2020
Yue Meng
Chung-Ching Lin
Yikang Shen
P. Sattigeri
Leonid Karlinsky
A. Oliva
Kate Saenko
Rogerio Feris
220
167
0
31 Jul 2020
Approximated Bilinear Modules for Temporal Modeling
Approximated Bilinear Modules for Temporal ModelingIEEE International Conference on Computer Vision (ICCV), 2019
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
124
27
0
25 Jul 2020
Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human
  Action Recognition
Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition
Sudhakar Kumawat
Manisha Verma
Yuta Nakashima
Shanmuganathan Raman
334
50
0
22 Jul 2020
MotionSqueeze: Neural Motion Feature Learning for Video Understanding
MotionSqueeze: Neural Motion Feature Learning for Video Understanding
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
FAtt
173
146
0
20 Jul 2020
Generalized Few-Shot Video Classification with Video Retrieval and
  Feature Generation
Generalized Few-Shot Video Classification with Video Retrieval and Feature GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Yongqin Xian
Bruno Korbar
Matthijs Douze
Lorenzo Torresani
Bernt Schiele
Zeynep Akata
VGen
182
21
0
09 Jul 2020
Joint Learning of Social Groups, Individuals Action and Sub-group
  Activities in Videos
Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos
Mahsa Ehsanpour
Alireza Abedin
F. Saleh
Javen Qinfeng Shi
Ian Reid
Hamid Rezatofighi
317
84
0
06 Jul 2020
SmallBigNet: Integrating Core and Contextual Views for Video
  Classification
SmallBigNet: Integrating Core and Contextual Views for Video Classification
Xianhang Li
Yali Wang
Zhipeng Zhou
Yu Qiao
ViT
205
103
0
25 Jun 2020
Comprehensive Information Integration Modeling Framework for Video
  Titling
Comprehensive Information Integration Modeling Framework for Video TitlingKnowledge Discovery and Data Mining (KDD), 2020
Shengyu Zhang
Ziqi Tan
Jin Yu
Zhou Zhao
Kun Kuang
Tan Jiang
Jingren Zhou
Hongxia Yang
Leilei Gan
174
41
0
24 Jun 2020
Motion Representation Using Residual Frames with 3D CNN
Motion Representation Using Residual Frames with 3D CNN
Li Tao
Xueting Wang
T. Yamasaki
3DPC
135
2
0
21 Jun 2020
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action
  Segmentation
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation
Shijie Li
Yazan Abu Farha
Yun-Hai Liu
Mingg-Ming Cheng
Juergen Gall
222
54
0
16 Jun 2020
DTG-Net: Differentiated Teachers Guided Self-Supervised Video Action
  Recognition
DTG-Net: Differentiated Teachers Guided Self-Supervised Video Action Recognition
Ziming Liu
Guangyu Gao
•. A. K. Qin
Jinyang Li
ViT
176
1
0
13 Jun 2020
Action Recognition with Deep Multiple Aggregation Networks
Action Recognition with Deep Multiple Aggregation Networks
A. Mazari
H. Sahbi
182
0
0
08 Jun 2020
Deep hierarchical pooling design for cross-granularity action
  recognition
Deep hierarchical pooling design for cross-granularity action recognition
A. Mazari
H. Sahbi
134
0
0
08 Jun 2020
Exploiting Inter-Frame Regional Correlation for Efficient Action
  Recognition
Exploiting Inter-Frame Regional Correlation for Efficient Action RecognitionExpert systems with applications (ESWA), 2020
Yuecong Xu
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
117
11
0
06 May 2020
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Rolling-Unrolling LSTMs for Action Anticipation from First-Person VideoIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Antonino Furnari
G. Farinella
EgoV
249
166
0
04 May 2020
Asynchronous Interaction Aggregation for Action Detection
Asynchronous Interaction Aggregation for Action DetectionEuropean Conference on Computer Vision (ECCV), 2020
Jiajun Tang
Jinchao Xia
Xinzhi Mu
Bo Pang
Cewu Lu
224
130
0
16 Apr 2020
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View
Spatiotemporal Fusion in 3D CNNs: A Probabilistic ViewComputer Vision and Pattern Recognition (CVPR), 2020
Yizhou Zhou
Xiaoyan Sun
Chong Luo
Zhengjun Zha
Wenjun Zeng
3DPC
166
23
0
10 Apr 2020
Spatial Priming for Detecting Human-Object Interactions
Spatial Priming for Detecting Human-Object Interactions
Ankan Bansal
Sai Saketh Rambhatla
Abhinav Shrivastava
Rama Chellappa
108
0
0
09 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2020
Christoph Feichtenhofer
421
1,226
0
09 Apr 2020
Temporal Accumulative Features for Sign Language Recognition
Temporal Accumulative Features for Sign Language Recognition
A. Kındıroglu
Ogulcan Özdemir
L. Akarun
SLR
87
18
0
02 Apr 2020
Spatio-temporal Tubelet Feature Aggregation and Object Linking in Videos
Spatio-temporal Tubelet Feature Aggregation and Object Linking in Videos
Daniel Cores
V. Brea
M. Mucientes
ViT
124
1
0
01 Apr 2020
Combining detection and tracking for human pose estimation in videos
Combining detection and tracking for human pose estimation in videosComputer Vision and Pattern Recognition (CVPR), 2020
Manchen Wang
Joseph Tighe
Davide Modolo
VOT
166
123
0
30 Mar 2020
Learning Object Permanence from Video
Learning Object Permanence from VideoEuropean Conference on Computer Vision (ECCV), 2020
Aviv Shamsian
Ofri Kleinfeld
Amir Globerson
Gal Chechik
SSL
371
35
0
23 Mar 2020
Generative Multi-Stream Architecture For American Sign Language
  Recognition
Generative Multi-Stream Architecture For American Sign Language Recognition
Dom Huh
Sai Gurrapu
F. Olson
Huzefa Rangwala
Parth H. Pathak
Jana Kosecka
SLR
66
3
0
09 Mar 2020
Motion-Attentive Transition for Zero-Shot Video Object Segmentation
Motion-Attentive Transition for Zero-Shot Video Object SegmentationIEEE Transactions on Image Processing (TIP), 2020
Tianfei Zhou
Shunzhou Wang
Yi Zhou
Yazhou Yao
Jianwu Li
Ling Shao
VOS
429
211
0
09 Mar 2020
MoVi: A Large Multipurpose Motion and Video Dataset
MoVi: A Large Multipurpose Motion and Video DatasetPLoS ONE (PLOS ONE), 2020
Saeed Ghorbani
Kimia Mahdaviani
A. Thaler
Konrad Paul Kording
D. Cook
Gunnar Blohm
N. Troje
225
91
0
04 Mar 2020
Three-Stream Fusion Network for First-Person Interaction Recognition
Three-Stream Fusion Network for First-Person Interaction RecognitionPattern Recognition (Pattern Recognit.), 2020
Ye-ji Kim
Dong-Gyu Lee
Seong-Whan Lee
124
8
0
19 Feb 2020
Dynamic Inference: A New Approach Toward Efficient Video Action
  Recognition
Dynamic Inference: A New Approach Toward Efficient Video Action Recognition
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Yi Yang
Shilei Wen
158
37
0
09 Feb 2020
Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$
  Videos
Weakly-Supervised Multi-Person Action Recognition in 360∘^{\circ}∘ VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Junnan Li
Jianquan Liu
Yongkang Wong
Shoji Nishimura
Mohan S. Kankanhalli
229
14
0
09 Feb 2020
Learning Class Regularized Features for Action Recognition
Learning Class Regularized Features for Action Recognition
Alexandros Stergiou
R. Poppe
R. Veltkamp
84
4
0
07 Feb 2020
Modality Compensation Network: Cross-Modal Adaptation for Action
  Recognition
Modality Compensation Network: Cross-Modal Adaptation for Action RecognitionIEEE Transactions on Image Processing (TIP), 2020
Sijie Song
Jiaying Liu
Yanghao Li
Zongming Guo
139
56
0
31 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
598
230
0
23 Jan 2020
Spatio-Temporal Ranked-Attention Networks for Video Captioning
Spatio-Temporal Ranked-Attention Networks for Video CaptioningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
A. Cherian
Jue Wang
Chiori Hori
Tim K. Marks
AI4TS
125
22
0
17 Jan 2020
Rethinking Motion Representation: Residual Frames with 3D ConvNets for
  Better Action Recognition
Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action RecognitionIEEE Transactions on Image Processing (TIP), 2020
Li Tao
Xueting Wang
T. Yamasaki
3DPC
142
26
0
16 Jan 2020
Self-supervising Action Recognition by Statistical Moment and Subspace
  Descriptors
Self-supervising Action Recognition by Statistical Moment and Subspace DescriptorsACM Multimedia (ACM MM), 2020
Lei Wang
Piotr Koniusz
287
57
0
14 Jan 2020
Something-Else: Compositional Action Recognition with Spatial-Temporal
  Interaction Networks
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction NetworksComputer Vision and Pattern Recognition (CVPR), 2019
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
269
192
0
20 Dec 2019
Lower Dimensional Kernels for Video Discriminators
Lower Dimensional Kernels for Video DiscriminatorsNeural Networks (NN), 2019
Emmanuel Kahembwe
S. Ramamoorthy
199
52
0
18 Dec 2019
Video action detection by learning graph-based spatio-temporal
  interactions
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
222
10
0
09 Dec 2019
VideoDG: Generalizing Temporal Relations in Videos to Novel Domains
VideoDG: Generalizing Temporal Relations in Videos to Novel DomainsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Zhiyu Yao
Yunbo Wang
Jianmin Wang
Philip S. Yu
Mingsheng Long
OODViT
158
31
0
08 Dec 2019
A Multigrid Method for Efficiently Training Video Models
A Multigrid Method for Efficiently Training Video ModelsComputer Vision and Pattern Recognition (CVPR), 2019
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
301
99
0
02 Dec 2019
Gate-Shift Networks for Video Action Recognition
Gate-Shift Networks for Video Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2019
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
3DPC
316
172
0
01 Dec 2019
TEINet: Towards an Efficient Architecture for Video Recognition
TEINet: Towards an Efficient Architecture for Video RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2019
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
170
262
0
21 Nov 2019
STEP: Spatial Temporal Graph Convolutional Networks for Emotion
  Perception from Gaits
STEP: Spatial Temporal Graph Convolutional Networks for Emotion Perception from GaitsAAAI Conference on Artificial Intelligence (AAAI), 2019
Uttaran Bhattacharya
Trisha Mittal
Rohan Chandra
Tanmay Randhavane
Aniket Bera
Tianyi Zhou
CVBM
367
119
0
28 Oct 2019
Human Action Recognition with Multi-Laplacian Graph Convolutional
  Networks
Human Action Recognition with Multi-Laplacian Graph Convolutional Networks
A. Mazari
H. Sahbi
GNN
119
5
0
15 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal
  Reasoning
CATER: A diagnostic dataset for Compositional Actions and TEmporal ReasoningInternational Conference on Learning Representations (ICLR), 2019
Rohit Girdhar
Deva Ramanan
385
193
0
10 Oct 2019
Learning Energy-based Spatial-Temporal Generative ConvNets for Dynamic
  Patterns
Learning Energy-based Spatial-Temporal Generative ConvNets for Dynamic PatternsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Jianwen Xie
Song-Chun Zhu
Ying Nian Wu
GAN
178
52
0
26 Sep 2019
Discriminative Video Representation Learning Using Support Vector
  Classifiers
Discriminative Video Representation Learning Using Support Vector ClassifiersIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Jue Wang
A. Cherian
98
6
0
05 Sep 2019
Cooperative Cross-Stream Network for Discriminative Action
  Representation
Cooperative Cross-Stream Network for Discriminative Action Representation
Jingran Zhang
Fumin Shen
Xing Xu
Heng Tao Shen
153
5
0
27 Aug 2019
Deep Concept-wise Temporal Convolutional Networks for Action
  Localization
Deep Concept-wise Temporal Convolutional Networks for Action LocalizationACM Multimedia (ACM MM), 2019
Xin Li
Tianwei Lin
Xiao-Chang Liu
Chuang Gan
W. Zuo
Chong Li
Xiang Long
Dongliang He
Fu Li
Shilei Wen
177
32
0
26 Aug 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition
STM: SpatioTemporal and Motion Encoding for Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2019
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
417
434
0
07 Aug 2019
Previous
123456
Next
Page 3 of 6