ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00562
  4. Cited By
Human Action Recognition using Factorized Spatio-Temporal Convolutional
  Networks

Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks

2 October 2015
Lin Sun
Kui Jia
Dit-Yan Yeung
Bertram E. Shi
ArXivPDFHTML

Papers citing "Human Action Recognition using Factorized Spatio-Temporal Convolutional Networks"

50 / 70 papers shown
Title
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Weiliang Tang
Jia-Hui Pan
Wei Zhan
Jianshu Zhou
Huaxiu Yao
Yun-Hui Liu
Masayoshi Tomizuka
Mingyu Ding
Chi-Wing Fu
60
0
0
16 Sep 2024
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action
  Recognition
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
Fahad Shahbaz Khan
ViT
54
19
0
13 Jul 2023
DOAD: Decoupled One Stage Action Detection Network
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
Pichao Wang
Fan Wang
Jiashi Feng
Mike Zheng Show
26
4
0
01 Apr 2023
Extending Temporal Data Augmentation for Video Action Recognition
Extending Temporal Data Augmentation for Video Action Recognition
Artjoms Gorpincenko
Michal Mackiewicz
ViT
29
4
0
09 Nov 2022
Holistic Interaction Transformer Network for Action Detection
Holistic Interaction Transformer Network for Action Detection
Gueter Josmy Faure
Min-Hung Chen
S. Lai
33
37
0
23 Oct 2022
Video-based Human Action Recognition using Deep Learning: A Review
Video-based Human Action Recognition using Deep Learning: A Review
Hieu H. Pham
L. Khoudour
Alain Crouzil
Pablo Zegers
S. Velastín
35
34
0
07 Aug 2022
Gate-Shift-Fuse for Video Action Recognition
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
22
22
0
16 Mar 2022
HAKE: A Knowledge Engine Foundation for Human Activity Understanding
HAKE: A Knowledge Engine Foundation for Human Activity Understanding
Yong-Lu Li
Xinpeng Liu
Xiaoqian Wu
Yizhuo Li
Zuoyu Qiu
Liang Xu
Yue Xu
Haoshu Fang
Cewu Lu
32
38
0
14 Feb 2022
Multiview Transformers for Video Recognition
Multiview Transformers for Video Recognition
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
26
212
0
12 Jan 2022
Early Melanoma Diagnosis with Sequential Dermoscopic Images
Early Melanoma Diagnosis with Sequential Dermoscopic Images
Zhen Yu
Jennifer Nguyen
Toàn D. Nguyên
J. Kelly
C. Mclean
Paul Bonnington
Lei Zhang
Victoria Mar
Z. Ge
27
41
0
12 Oct 2021
Joint Learning On The Hierarchy Representation for Fine-Grained Human
  Action Recognition
Joint Learning On The Hierarchy Representation for Fine-Grained Human Action Recognition
M. C. Leong
Hui Li Tan
Haosong Zhang
Liyuan Li
Feng Lin
J. Lim
40
10
0
12 Oct 2021
TSM: Temporal Shift Module for Efficient and Scalable Video
  Understanding on Edge Device
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device
Ji Lin
Chuang Gan
Kuan-Chieh Jackson Wang
Song Han
40
64
0
27 Sep 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Hanxi Lin
Xinxiao Wu
Jiebo Luo
25
1
0
25 Jul 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,093
0
29 Mar 2021
CGAP2: Context and gap aware predictive pose framework for early
  detection of gestures
CGAP2: Context and gap aware predictive pose framework for early detection of gestures
Nishant Bhattacharya
Suresh Sundaram
16
0
0
18 Nov 2020
Approximated Bilinear Modules for Temporal Modeling
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
25
23
0
25 Jul 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
73
1,001
0
09 Apr 2020
TEA: Temporal Excitation and Aggregation for Action Recognition
TEA: Temporal Excitation and Aggregation for Action Recognition
Yan-Ran Li
Bin Ji
Xintian Shi
Jianguo Zhang
Bin Kang
Limin Wang
ViT
25
439
0
03 Apr 2020
Don't Forget The Past: Recurrent Depth Estimation from Monocular Video
Don't Forget The Past: Recurrent Depth Estimation from Monocular Video
Vaishakh Patil
Wouter Van Gansbeke
Dengxin Dai
Luc Van Gool
MDE
33
129
0
08 Jan 2020
Lower Dimensional Kernels for Video Discriminators
Lower Dimensional Kernels for Video Discriminators
Emmanuel Kahembwe
S. Ramamoorthy
29
50
0
18 Dec 2019
Temporal Factorization of 3D Convolutional Kernels
Temporal Factorization of 3D Convolutional Kernels
Gabrielle Ras
L. Ambrogioni
Umut Güçlü
Marcel van Gerven
16
1
0
09 Dec 2019
A Multigrid Method for Efficiently Training Video Models
A Multigrid Method for Efficiently Training Video Models
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
21
94
0
02 Dec 2019
STConvS2S: Spatiotemporal Convolutional Sequence to Sequence Network for
  Weather Forecasting
STConvS2S: Spatiotemporal Convolutional Sequence to Sequence Network for Weather Forecasting
Rafaela C. Nascimento
Y. M. Souto
Eduardo S. Ogasawara
Fábio Porto
Eduardo Bezerra
AI4TS
17
83
0
30 Nov 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Chenxu Luo
Alan Yuille
130
150
0
28 Sep 2019
Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos
Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos
Okan Kopuklu
Fabian Herzog
Gerhard Rigoll
22
6
0
11 Sep 2019
Exploring Temporal Differences in 3D Convolutional Neural Networks
Exploring Temporal Differences in 3D Convolutional Neural Networks
Gagan Kanojia
Sudhakar Kumawat
Shanmuganathan Raman
3DPC
AI4TS
21
3
0
07 Sep 2019
A Novel Approach for Robust Multi Human Action Recognition and
  Summarization based on 3D Convolutional Neural Networks
A Novel Approach for Robust Multi Human Action Recognition and Summarization based on 3D Convolutional Neural Networks
Noor Almaadeed
O. Elharrouss
S. Al-Maadeed
Ahmed Bouridane
Azeddine Beghdadi
3DH
16
14
0
25 Jul 2019
Video Modeling with Correlation Networks
Video Modeling with Correlation Networks
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
24
127
0
07 Jun 2019
Exploring Temporal Information for Improved Video Understanding
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
23
0
0
25 May 2019
Large Scale Holistic Video Understanding
Large Scale Holistic Video Understanding
Ali Diba
Mohsen Fayyaz
Vivek Sharma
Manohar Paluri
Jurgen Gall
Rainer Stiefelhagen
Luc Van Gool
29
35
0
25 Apr 2019
DynamoNet: Dynamic Action and Motion Network
DynamoNet: Dynamic Action and Motion Network
Ali Diba
Vivek Sharma
Luc Van Gool
Rainer Stiefelhagen
30
110
0
25 Apr 2019
Spatiotemporal Pyramid Network for Video Action Recognition
Spatiotemporal Pyramid Network for Video Action Recognition
Yunbo Wang
Mingsheng Long
Jianmin Wang
Philip S. Yu
32
227
0
04 Mar 2019
DistInit: Learning Video Representations Without a Single Labeled Video
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
27
54
0
26 Jan 2019
Semantic Image Networks for Human Action Recognition
Semantic Image Networks for Human Action Recognition
Sunder Ali Khowaja
Seok-Lyong Lee
21
32
0
21 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural
  Networks
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Asifullah Khan
A. Sohail
Umme Zahoora
Aqsa Saeed Qureshi
OOD
65
2,271
0
17 Jan 2019
3D PersonVLAD: Learning Deep Global Representations for Video-based
  Person Re-identification
3D PersonVLAD: Learning Deep Global Representations for Video-based Person Re-identification
Lin Wu
Yang Wang
Ling Shao
Ming Wang
3DPC
19
94
0
26 Dec 2018
Learning with privileged information via adversarial discriminative
  modality distillation
Learning with privileged information via adversarial discriminative modality distillation
Nuno C. Garcia
Pietro Morerio
Vittorio Murino
27
67
0
19 Oct 2018
Temporal-Spatial Mapping for Action Recognition
Temporal-Spatial Mapping for Action Recognition
Xiaolin Song
Cuiling Lan
Wenjun Zeng
Junliang Xing
Jingyu Yang
Xiaoyan Sun
33
48
0
11 Sep 2018
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human
  Activity Recognition
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition
Zhan Yang
Osolo Ian Raymond
Chengyuan Zhang
Ying Wan
J. Long
CVBM
42
36
0
31 Jul 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
26
181
0
19 Jun 2018
Modality Distillation with Multiple Stream Networks for Action
  Recognition
Modality Distillation with Multiple Stream Networks for Action Recognition
Nuno C. Garcia
Pietro Morerio
Vittorio Murino
30
181
0
19 Jun 2018
Deep Spatiotemporal Representation of the Face for Automatic Pain
  Intensity Estimation
Deep Spatiotemporal Representation of the Face for Automatic Pain Intensity Estimation
M. Tavakolian
Abdenour Hadid
CVBM
MedIm
3DH
18
19
0
18 Jun 2018
Needle Tip Force Estimation using an OCT Fiber and a Fused convGRU-CNN
  Architecture
Needle Tip Force Estimation using an OCT Fiber and a Fused convGRU-CNN Architecture
N. Gessert
Torben Priegnitz
T. Saathoff
Sven-Thomas Antoni
David Meyer
M. Hamann
K. Jünemann
Christoph Otte
Alexander Schlaefer
24
10
0
30 May 2018
Impression Network for Video Object Detection
Impression Network for Video Object Detection
Congrui Hetang
Hongwei Qin
Shaohui Liu
Junjie Yan
ObjD
14
31
0
16 Dec 2017
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in
  Video Classification
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
52
1,309
0
13 Dec 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
93
2,990
0
30 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Zhaofan Qiu
Ting Yao
Tao Mei
13
1,650
0
28 Nov 2017
Hierarchical Video Generation from Orthogonal Information: Optical Flow
  and Texture
Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture
Katsunori Ohnishi
Shohei Yamamoto
Yoshitaka Ushiku
Tatsuya Harada
VGen
GAN
40
59
0
27 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for
  Video Classification
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Xiang Long
Chuang Gan
Gerard de Melo
Jiajun Wu
Xiao-Chang Liu
Shilei Wen
34
208
0
27 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
25
89
0
11 Nov 2017
12
Next