ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.04851
  4. Cited By
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in
  Video Classification
v1v2 (latest)

Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification

13 December 2017
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
    3DH
ArXiv (abs)PDFHTML

Papers citing "Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification"

50 / 675 papers shown
Actions as Moving Points
Actions as Moving PointsEuropean Conference on Computer Vision (ECCV), 2020
Shouqing Yang
Zixu Wang
Limin Wang
Gangshan Wu
355
119
0
14 Jan 2020
Lower Dimensional Kernels for Video Discriminators
Lower Dimensional Kernels for Video DiscriminatorsNeural Networks (NN), 2019
Emmanuel Kahembwe
S. Ramamoorthy
199
52
0
18 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context
Mimetics: Towards Understanding Human Actions Out of ContextInternational Journal of Computer Vision (IJCV), 2019
Philippe Weinzaepfel
Grégory Rogez
309
81
0
16 Dec 2019
End-to-End Learning of Visual Representations from Uncurated
  Instructional Videos
End-to-End Learning of Visual Representations from Uncurated Instructional VideosComputer Vision and Pattern Recognition (CVPR), 2019
Antoine Miech
Jean-Baptiste Alayrac
Lucas Smaira
Ivan Laptev
Josef Sivic
Andrew Zisserman
VGenSSL
626
756
0
13 Dec 2019
Identity Preserve Transform: Understand What Activity Classification
  Models Have Learnt
Identity Preserve Transform: Understand What Activity Classification Models Have Learnt
Jialing Lyu
Weichao Qiu
Xinyue Wei
Yi Zhang
Alan Yuille
Zhengjun Zha
VLM
96
3
0
13 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing AudioComputer Vision and Pattern Recognition (CVPR), 2019
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
348
284
0
10 Dec 2019
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN
Paritosh Parmar
B. Morris
3DPC
204
10
0
10 Dec 2019
Video action detection by learning graph-based spatio-temporal
  interactions
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
258
10
0
09 Dec 2019
Synthetic Humans for Action Recognition from Unseen Viewpoints
Synthetic Humans for Action Recognition from Unseen ViewpointsInternational Journal of Computer Vision (IJCV), 2019
Gül Varol
Ivan Laptev
Cordelia Schmid
Andrew Zisserman
355
104
0
09 Dec 2019
Context R-CNN: Long Term Temporal Context for Per-Camera Object
  Detection
Context R-CNN: Long Term Temporal Context for Per-Camera Object DetectionComputer Vision and Pattern Recognition (CVPR), 2019
Sara Beery
Guanhang Wu
V. Rathod
Ronny Votel
Jonathan Huang
ObjD
337
126
0
07 Dec 2019
RSA: Randomized Simulation as Augmentation for Robust Human Action
  Recognition
RSA: Randomized Simulation as Augmentation for Robust Human Action Recognition
Yi Zhang
Xinyue Wei
Weichao Qiu
Zihao Xiao
Gregory Hager
Alan Yuille
142
7
0
03 Dec 2019
A Multigrid Method for Efficiently Training Video Models
A Multigrid Method for Efficiently Training Video ModelsComputer Vision and Pattern Recognition (CVPR), 2019
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
306
99
0
02 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little
  Network and Depthwise Temporal Aggregation
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal AggregationNeural Information Processing Systems (NeurIPS), 2019
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
240
133
0
02 Dec 2019
Gate-Shift Networks for Video Action Recognition
Gate-Shift Networks for Video Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2019
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
3DPC
329
172
0
01 Dec 2019
Action Recognition via Pose-Based Graph Convolutional Networks with
  Intermediate Dense Supervision
Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense SupervisionPattern Recognition (Pattern Recognit.), 2019
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
183
29
0
28 Nov 2019
Learning Efficient Video Representation with Video Shuffle Networks
Learning Efficient Video Representation with Video Shuffle Networks
Pingchuan Ma
Yao Zhou
Yu Lu
Wayne Zhang
171
7
0
26 Nov 2019
TEINet: Towards an Efficient Architecture for Video Recognition
TEINet: Towards an Efficient Architecture for Video RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2019
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
170
264
0
21 Nov 2019
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed
  Domain
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain
Barak Battash
H. Barad
Hanlin Tang
Amit Bleiweiss
234
36
0
19 Nov 2019
Tiny Video Networks
Tiny Video NetworksApplied AI Letters (AA), 2019
A. Piergiovanni
A. Angelova
Michael S. Ryoo
345
50
0
15 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal
  Reasoning
CATER: A diagnostic dataset for Compositional Actions and TEmporal ReasoningInternational Conference on Learning Representations (ICLR), 2019
Rohit Girdhar
Deva Ramanan
385
193
0
10 Oct 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Grouped Spatial-Temporal Aggregation for Efficient Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2019
Chenxu Luo
Alan Yuille
288
169
0
28 Sep 2019
Scheduled Differentiable Architecture Search for Visual Recognition
Scheduled Differentiable Architecture Search for Visual Recognition
Zhaofan Qiu
Ting Yao
Yiheng Zhang
Yongdong Zhang
Tao Mei
OOD
144
3
0
23 Sep 2019
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Will Price
Dima Damen
103
10
0
20 Sep 2019
Exploring Temporal Differences in 3D Convolutional Neural Networks
Exploring Temporal Differences in 3D Convolutional Neural NetworksCommunications in Computer and Information Science (CCIS), 2019
Gagan Kanojia
Sudhakar Kumawat
Shanmuganathan Raman
3DPCAI4TS
113
3
0
07 Sep 2019
PISEP^2: Pseudo Image Sequence Evolution based 3D Pose Prediction
PISEP^2: Pseudo Image Sequence Evolution based 3D Pose PredictionThe Visual Computer (Vis. Comput.), 2019
Xiaoli Liu
Jianqin Yin
Huaping Liu
Yilong Yin
3DH
166
7
0
04 Sep 2019
Deep Concept-wise Temporal Convolutional Networks for Action
  Localization
Deep Concept-wise Temporal Convolutional Networks for Action LocalizationACM Multimedia (ACM MM), 2019
Xin Li
Tianwei Lin
Xiao-Chang Liu
Chuang Gan
W. Zuo
Chong Li
Xiang Long
Dongliang He
Fu Li
Shilei Wen
189
32
0
26 Aug 2019
Action recognition with spatial-temporal discriminative filter banks
Action recognition with spatial-temporal discriminative filter banksIEEE International Conference on Computer Vision (ICCV), 2019
Brais Martínez
Davide Modolo
Yuanjun Xiong
Joseph Tighe
160
68
0
20 Aug 2019
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for
  Video Saliency Detection
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency DetectionIEEE International Conference on Computer Vision (ICCV), 2019
Kyle Min
Jason J. Corso
246
176
0
15 Aug 2019
Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised LearningIEEE International Joint Conference on Neural Network (IJCNN), 2019
Eric Arazo
Diego Ortego
Paul Albert
Noel E. O'Connor
Kevin McGuinness
667
996
0
08 Aug 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition
STM: SpatioTemporal and Motion Encoding for Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2019
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
417
436
0
07 Aug 2019
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective
  Untrimmed Video Recognition
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2019
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Shilei Wen
185
132
0
31 Jul 2019
Motion-Aware Feature for Improved Video Anomaly Detection
Motion-Aware Feature for Improved Video Anomaly DetectionBritish Machine Vision Conference (BMVC), 2019
Yi Zhu
Shawn D. Newsam
145
188
0
24 Jul 2019
An Efficient 3D CNN for Action/Object Segmentation in Video
An Efficient 3D CNN for Action/Object Segmentation in VideoBritish Machine Vision Conference (BMVC), 2019
Rui Hou
Chong Chen
Rahul Sukthankar
M. Shah
110
30
0
21 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Only Time Can Tell: Discovering Temporal Data for Temporal ModelingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
250
83
0
19 Jul 2019
AVD: Adversarial Video Distillation
AVD: Adversarial Video Distillation
M. Tavakolian
Mohammad Sabokrou
Abdenour Hadid
VGen
129
6
0
12 Jul 2019
Few-Shot Video Classification via Temporal Alignment
Few-Shot Video Classification via Temporal AlignmentComputer Vision and Pattern Recognition (CVPR), 2019
Kaidi Cao
Jingwei Ji
Zhangjie Cao
C. Chang
Juan Carlos Niebles
AI4TS
251
270
0
27 Jun 2019
Towards Real-Time Action Recognition on Mobile Devices Using Deep Models
Towards Real-Time Action Recognition on Mobile Devices Using Deep Models
Chen-Da Liu-Zhang
Xin-Xin Liu
Jianxin Wu
HAI
162
9
0
17 Jun 2019
Learning Spatio-Temporal Representation with Local and Global Diffusion
Learning Spatio-Temporal Representation with Local and Global DiffusionComputer Vision and Pattern Recognition (CVPR), 2019
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xinmei Tian
Tao Mei
160
178
0
13 Jun 2019
Recognizing Manipulation Actions from State-Transformations
Recognizing Manipulation Actions from State-TransformationsComputer Vision and Pattern Recognition (CVPR), 2019
N. A. Bakr
James L. Crowley
Rémi Ronfard
97
14
0
12 Jun 2019
FASTER Recurrent Networks for Efficient Video Classification
FASTER Recurrent Networks for Efficient Video Classification
Linchao Zhu
Laura Sevilla-Lara
Du Tran
Matt Feiszli
Yi Yang
Heng Wang
169
7
0
10 Jun 2019
Video Modeling with Correlation Networks
Video Modeling with Correlation NetworksComputer Vision and Pattern Recognition (CVPR), 2019
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
438
144
0
07 Jun 2019
Scaling Autoregressive Video Models
Scaling Autoregressive Video ModelsInternational Conference on Learning Representations (ICLR), 2019
Dirk Weissenborn
Oscar Täckström
Jakob Uszkoreit
DiffMVGen
400
236
0
06 Jun 2019
Design Light-weight 3D Convolutional Networks for Video Recognition
  Temporal Residual, Fully Separable Block, and Fast Algorithm
Design Light-weight 3D Convolutional Networks for Video Recognition Temporal Residual, Fully Separable Block, and Fast Algorithm
Haonan Wang
Jun Lin
Zhongfeng Wang
CVBM
124
4
0
31 May 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video
  Architectures
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video ArchitecturesInternational Conference on Learning Representations (ICLR), 2019
Michael S. Ryoo
A. Piergiovanni
Mingxing Tan
A. Angelova
371
107
0
30 May 2019
Hierarchical Feature Aggregation Networks for Video Action Recognition
Hierarchical Feature Aggregation Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
FAtt
139
8
0
29 May 2019
Exploring Temporal Information for Improved Video Understanding
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
185
0
0
25 May 2019
EnsembleNet: End-to-End Optimization of Multi-headed Models
EnsembleNet: End-to-End Optimization of Multi-headed Models
Hanhan Li
Joe Yue-Hei Ng
Apostol Natsev
152
17
0
24 May 2019
Lightweight Network Architecture for Real-Time Action Recognition
Lightweight Network Architecture for Real-Time Action RecognitionACM Symposium on Applied Computing (SAC), 2019
Alexander Kozlov
Vadim Andronov
Y. Gritsenko
ViT
110
39
0
21 May 2019
STAR: A Concise Deep Learning Framework for Citywide Human Mobility
  Prediction
STAR: A Concise Deep Learning Framework for Citywide Human Mobility PredictionInternational Conference on Mobile Data Management (MDM), 2019
Hongnian Wang
Han Su
HAI
122
17
0
16 May 2019
Follow the Attention: Combining Partial Pose and Object Motion for
  Fine-Grained Action Detection
Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection
M. M. K. Moghaddam
Ehsan Abbasnejad
Javen Qinfeng Shi
191
2
0
11 May 2019
Previous
123...121314
Next
Page 13 of 14
Pageof 14