Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1712.04851
Cited By
v1
v2 (latest)
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
13 December 2017
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification"
50 / 675 papers shown
Actions as Moving Points
European Conference on Computer Vision (ECCV), 2020
Shouqing Yang
Zixu Wang
Limin Wang
Gangshan Wu
355
119
0
14 Jan 2020
Lower Dimensional Kernels for Video Discriminators
Neural Networks (NN), 2019
Emmanuel Kahembwe
S. Ramamoorthy
199
52
0
18 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context
International Journal of Computer Vision (IJCV), 2019
Philippe Weinzaepfel
Grégory Rogez
309
81
0
16 Dec 2019
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
Computer Vision and Pattern Recognition (CVPR), 2019
Antoine Miech
Jean-Baptiste Alayrac
Lucas Smaira
Ivan Laptev
Josef Sivic
Andrew Zisserman
VGen
SSL
626
756
0
13 Dec 2019
Identity Preserve Transform: Understand What Activity Classification Models Have Learnt
Jialing Lyu
Weichao Qiu
Xinyue Wei
Yi Zhang
Alan Yuille
Zhengjun Zha
VLM
96
3
0
13 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Computer Vision and Pattern Recognition (CVPR), 2019
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
348
284
0
10 Dec 2019
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN
Paritosh Parmar
B. Morris
3DPC
204
10
0
10 Dec 2019
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
258
10
0
09 Dec 2019
Synthetic Humans for Action Recognition from Unseen Viewpoints
International Journal of Computer Vision (IJCV), 2019
Gül Varol
Ivan Laptev
Cordelia Schmid
Andrew Zisserman
355
104
0
09 Dec 2019
Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection
Computer Vision and Pattern Recognition (CVPR), 2019
Sara Beery
Guanhang Wu
V. Rathod
Ronny Votel
Jonathan Huang
ObjD
337
126
0
07 Dec 2019
RSA: Randomized Simulation as Augmentation for Robust Human Action Recognition
Yi Zhang
Xinyue Wei
Weichao Qiu
Zihao Xiao
Gregory Hager
Alan Yuille
142
7
0
03 Dec 2019
A Multigrid Method for Efficiently Training Video Models
Computer Vision and Pattern Recognition (CVPR), 2019
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
306
99
0
02 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Neural Information Processing Systems (NeurIPS), 2019
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
240
133
0
02 Dec 2019
Gate-Shift Networks for Video Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2019
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
3DPC
329
172
0
01 Dec 2019
Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision
Pattern Recognition (Pattern Recognit.), 2019
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
183
29
0
28 Nov 2019
Learning Efficient Video Representation with Video Shuffle Networks
Pingchuan Ma
Yao Zhou
Yu Lu
Wayne Zhang
171
7
0
26 Nov 2019
TEINet: Towards an Efficient Architecture for Video Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2019
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
170
264
0
21 Nov 2019
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain
Barak Battash
H. Barad
Hanlin Tang
Amit Bleiweiss
234
36
0
19 Nov 2019
Tiny Video Networks
Applied AI Letters (AA), 2019
A. Piergiovanni
A. Angelova
Michael S. Ryoo
345
50
0
15 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
International Conference on Learning Representations (ICLR), 2019
Rohit Girdhar
Deva Ramanan
385
193
0
10 Oct 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2019
Chenxu Luo
Alan Yuille
288
169
0
28 Sep 2019
Scheduled Differentiable Architecture Search for Visual Recognition
Zhaofan Qiu
Ting Yao
Yiheng Zhang
Yongdong Zhang
Tao Mei
OOD
144
3
0
23 Sep 2019
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Will Price
Dima Damen
103
10
0
20 Sep 2019
Exploring Temporal Differences in 3D Convolutional Neural Networks
Communications in Computer and Information Science (CCIS), 2019
Gagan Kanojia
Sudhakar Kumawat
Shanmuganathan Raman
3DPC
AI4TS
113
3
0
07 Sep 2019
PISEP^2: Pseudo Image Sequence Evolution based 3D Pose Prediction
The Visual Computer (Vis. Comput.), 2019
Xiaoli Liu
Jianqin Yin
Huaping Liu
Yilong Yin
3DH
166
7
0
04 Sep 2019
Deep Concept-wise Temporal Convolutional Networks for Action Localization
ACM Multimedia (ACM MM), 2019
Xin Li
Tianwei Lin
Xiao-Chang Liu
Chuang Gan
W. Zuo
Chong Li
Xiang Long
Dongliang He
Fu Li
Shilei Wen
189
32
0
26 Aug 2019
Action recognition with spatial-temporal discriminative filter banks
IEEE International Conference on Computer Vision (ICCV), 2019
Brais Martínez
Davide Modolo
Yuanjun Xiong
Joseph Tighe
160
68
0
20 Aug 2019
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection
IEEE International Conference on Computer Vision (ICCV), 2019
Kyle Min
Jason J. Corso
246
176
0
15 Aug 2019
Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
IEEE International Joint Conference on Neural Network (IJCNN), 2019
Eric Arazo
Diego Ortego
Paul Albert
Noel E. O'Connor
Kevin McGuinness
667
996
0
08 Aug 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2019
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
417
436
0
07 Aug 2019
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2019
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Shilei Wen
185
132
0
31 Jul 2019
Motion-Aware Feature for Improved Video Anomaly Detection
British Machine Vision Conference (BMVC), 2019
Yi Zhu
Shawn D. Newsam
145
188
0
24 Jul 2019
An Efficient 3D CNN for Action/Object Segmentation in Video
British Machine Vision Conference (BMVC), 2019
Rui Hou
Chong Chen
Rahul Sukthankar
M. Shah
110
30
0
21 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
250
83
0
19 Jul 2019
AVD: Adversarial Video Distillation
M. Tavakolian
Mohammad Sabokrou
Abdenour Hadid
VGen
129
6
0
12 Jul 2019
Few-Shot Video Classification via Temporal Alignment
Computer Vision and Pattern Recognition (CVPR), 2019
Kaidi Cao
Jingwei Ji
Zhangjie Cao
C. Chang
Juan Carlos Niebles
AI4TS
251
270
0
27 Jun 2019
Towards Real-Time Action Recognition on Mobile Devices Using Deep Models
Chen-Da Liu-Zhang
Xin-Xin Liu
Jianxin Wu
HAI
162
9
0
17 Jun 2019
Learning Spatio-Temporal Representation with Local and Global Diffusion
Computer Vision and Pattern Recognition (CVPR), 2019
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xinmei Tian
Tao Mei
160
178
0
13 Jun 2019
Recognizing Manipulation Actions from State-Transformations
Computer Vision and Pattern Recognition (CVPR), 2019
N. A. Bakr
James L. Crowley
Rémi Ronfard
97
14
0
12 Jun 2019
FASTER Recurrent Networks for Efficient Video Classification
Linchao Zhu
Laura Sevilla-Lara
Du Tran
Matt Feiszli
Yi Yang
Heng Wang
169
7
0
10 Jun 2019
Video Modeling with Correlation Networks
Computer Vision and Pattern Recognition (CVPR), 2019
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
438
144
0
07 Jun 2019
Scaling Autoregressive Video Models
International Conference on Learning Representations (ICLR), 2019
Dirk Weissenborn
Oscar Täckström
Jakob Uszkoreit
DiffM
VGen
400
236
0
06 Jun 2019
Design Light-weight 3D Convolutional Networks for Video Recognition Temporal Residual, Fully Separable Block, and Fast Algorithm
Haonan Wang
Jun Lin
Zhongfeng Wang
CVBM
124
4
0
31 May 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures
International Conference on Learning Representations (ICLR), 2019
Michael S. Ryoo
A. Piergiovanni
Mingxing Tan
A. Angelova
371
107
0
30 May 2019
Hierarchical Feature Aggregation Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
FAtt
139
8
0
29 May 2019
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
185
0
0
25 May 2019
EnsembleNet: End-to-End Optimization of Multi-headed Models
Hanhan Li
Joe Yue-Hei Ng
Apostol Natsev
152
17
0
24 May 2019
Lightweight Network Architecture for Real-Time Action Recognition
ACM Symposium on Applied Computing (SAC), 2019
Alexander Kozlov
Vadim Andronov
Y. Gritsenko
ViT
110
39
0
21 May 2019
STAR: A Concise Deep Learning Framework for Citywide Human Mobility Prediction
International Conference on Mobile Data Management (MDM), 2019
Hongnian Wang
Han Su
HAI
122
17
0
16 May 2019
Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection
M. M. K. Moghaddam
Ehsan Abbasnejad
Javen Qinfeng Shi
191
2
0
11 May 2019
Previous
1
2
3
...
12
13
14
Next
Page 13 of 14
Page
of 14
Go