Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,275 papers shown
Title
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
52
1,309
0
13 Dec 2017
Im2Flow: Motion Hallucination from Static Images for Action Recognition
Ruohan Gao
Bo Xiong
Kristen Grauman
33
92
0
12 Dec 2017
Learning Latent Super-Events to Detect Multiple Activities in Videos
A. Piergiovanni
Michael S. Ryoo
16
90
0
05 Dec 2017
Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition
Pichao Wang
Wanqing Li
Jun Wan
P. Ogunbona
Xinwang Liu
22
72
0
05 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou
Zhaowen Wang
Chen Fang
Trung Bui
Tamara L. Berg
VGen
28
206
0
04 Dec 2017
Object Classification using Ensemble of Local and Deep Features
Siddharth Srivastava
Prerana Mukherjee
Brejesh Lall
Kamlesh Jaiswal
34
7
0
04 Dec 2017
Compressed Video Action Recognition
Chao-Yuan Wu
Manzil Zaheer
Hexiang Hu
R. Manmatha
Alex Smola
Philipp Krahenbuhl
37
325
0
02 Dec 2017
Learning to Segment Moving Objects
P. Tokmakov
Cordelia Schmid
Alahari Karteek
VOS
36
97
0
01 Dec 2017
Label Efficient Learning of Transferable Representations across Domains and Tasks
Zelun Luo
Yuliang Zou
Judy Hoffman
Li Fei-Fei
39
275
0
30 Nov 2017
Graph Distillation for Action Detection with Privileged Modalities
Zelun Luo
Jun-Ting Hsieh
Lu Jiang
Juan Carlos Niebles
Li Fei-Fei
51
104
0
30 Nov 2017
Budget-Aware Activity Detection with A Recurrent Policy Network
Behrooz Mahasseni
Xiaodong Yang
Pavlo Molchanov
Jan Kautz
36
6
0
30 Nov 2017
An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos
Rui Hou
Chen Chen
M. Shah
MedIm
36
60
0
30 Nov 2017
Relation Networks for Object Detection
Han Hu
Jiayuan Gu
Zheng-Wei Zhang
Jifeng Dai
Yichen Wei
ObjD
42
1,222
0
30 Nov 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
153
2,990
0
30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition
Shuyang Sun
Zhanghui Kuang
Wanli Ouyang
Lu Sheng
Wayne Zhang
40
293
0
29 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Zhaofan Qiu
Ting Yao
Tao Mei
39
1,651
0
28 Nov 2017
Revisiting hand-crafted feature for action recognition: a set of improved dense trajectories
K. Matsui
Toru Tamaki
Gwladys Auffret
B. Raytchev
K. Kaneda
26
0
0
28 Nov 2017
Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture
Katsunori Ohnishi
Shohei Yamamoto
Yoshitaka Ushiku
Tatsuya Harada
VGen
GAN
45
59
0
27 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
72
1,914
0
27 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Xiang Long
Chuang Gan
Gerard de Melo
Jiajun Wu
Xiao-Chang Liu
Shilei Wen
34
208
0
27 Nov 2017
Predictive Learning: Using Future Representation Learning Variantial Autoencoder for Human Action Prediction
Runsheng Yu
Zhenyu Shi
Qiongxiong Ma
Laiyun Qing
3DH
DRL
31
4
0
25 Nov 2017
Appearance-and-Relation Networks for Video Classification
Limin Wang
Wei Li
Wen Li
Luc Van Gool
39
350
0
24 Nov 2017
Summarizing First-Person Videos from Third Persons' Points of Views
Hsuan-I Ho
Wei-Chen Chiu
Y. Wang
EgoV
3DH
37
28
0
24 Nov 2017
Deep Video Generation, Prediction and Completion of Human Action Sequences
Haoye Cai
Chunyan Bai
Yu-Wing Tai
Chi-Keung Tang
VGen
29
143
0
23 Nov 2017
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
41
1,033
0
22 Nov 2017
Three-Stream Convolutional Networks for Video-based Person Re-Identification
Zeng Yu
Tianrui Li
Ning Yu
Xun Gong
Ke Chen
Yi Pan
16
6
0
22 Nov 2017
Multi-Level Recurrent Residual Networks for Action Recognition
Zhenxing Zheng
Gaoyun An
Q. Ruan
13
12
0
22 Nov 2017
Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
A. Karami
M. M. Arzani
Rahman Yousefzadeh
Luc Van Gool
26
241
0
22 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
115
8,829
0
21 Nov 2017
Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion
Weiyao Lin
Yang Mi
Jianxin Wu
K. Lu
H. Xiong
27
37
0
20 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
33
145
0
16 Nov 2017
Occlusion Aware Unsupervised Learning of Optical Flow
Yang Wang
Yezhou Yang
Zhenheng Yang
Liang Zhao
Peng Wang
Wei Xu
SSL
30
308
0
16 Nov 2017
A Correlation Based Feature Representation for First-Person Activity Recognition
R. Kahani
Alireza Talebpour
Ahmad Mahmoudi-Aznaveh
30
12
0
15 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
25
89
0
11 Nov 2017
Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification
Yuxin Peng
Yunzhen Zhao
Junchao Zhang
32
115
0
09 Nov 2017
Attentional Pooling for Action Recognition
Rohit Girdhar
Deva Ramanan
24
319
0
04 Nov 2017
Dual Skipping Networks
Changmao Cheng
Yanwei Fu
Yu-Gang Jiang
Wei Liu
Wenlian Lu
Jianfeng Feng
Xiangyang Xue
35
1
0
28 Oct 2017
Class Correlation affects Single Object Localization using Pre-trained ConvNets
P. H. Vardhan
Kunal Sekhri
Dipan K. Pal
Marios Savvides
13
0
0
26 Oct 2017
ContextVP: Fully Context-Aware Video Prediction
Wonmin Byeon
Qin Wang
R. Srivastava
Petros Koumoutsakos
28
8
0
23 Oct 2017
ActivityNet Challenge 2017 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Ranjay Krishna
Victor Escorcia
Kenji Hata
S. Buch
50
48
0
22 Oct 2017
Generalized Zero-Shot Learning for Action Recognition with Web-Scale Video Data
Kun Liu
Wu Liu
Huadong Ma
Wenbing Huang
Xiongxiong Dong
36
40
0
20 Oct 2017
Learning to Recognize Actions from Limited Training Examples Using a Recurrent Spiking Neural Model
Priyadarshini Panda
N. Srinivasa
12
27
0
19 Oct 2017
Pose-based Deep Gait Recognition
Anna Sokolova
Anton Konushin
CVBM
28
43
0
17 Oct 2017
Single Shot Temporal Action Detection
Tianwei Lin
Xu Zhao
Zheng Shou
36
451
0
17 Oct 2017
Video Classification With CNNs: Using The Codec As A Spatio-Temporal Activity Sensor
Aaron Chadha
Alhabib Abbas
Y. Andreopoulos
29
37
0
14 Oct 2017
Detect to Track and Track to Detect
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
VOT
36
561
0
11 Oct 2017
Real-Time Action Detection in Video Surveillance using Sub-Action Descriptor with Multi-CNN
Cheng-Bin Jin
Shengzhe Li
Hakil Kim
38
26
0
10 Oct 2017
Detecting the Moment of Completion: Temporal Models for Localising Action Completion
Farnoosh Heidarivincheh
Majid Mirmehdi
Dima Damen
39
5
0
06 Oct 2017
Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks
Hassan Al Hajj
M. Lamard
Pierre-Henri Conze
B. Cochener
G. Quellec
41
3
0
04 Oct 2017
Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks
Anh Nguyen
Dimitrios Kanoulas
L. Muratore
D. Caldwell
Nikos G. Tsagarakis
29
71
0
01 Oct 2017
Previous
1
2
3
...
37
38
39
...
44
45
46
Next