ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
v1v2 (latest)

Two-Stream Convolutional Networks for Action Recognition in Videos

Neural Information Processing Systems (NeurIPS), 2014
9 June 2014
Karen Simonyan
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,340 papers shown
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Linchao Zhu
Zhongwen Xu
Yi Yang
158
78
0
28 Nov 2016
Online Real-time Multiple Spatiotemporal Action Localisation and
  Prediction
Online Real-time Multiple Spatiotemporal Action Localisation and Prediction
Gurkirt Singh
Suman Saha
Michael Sapienza
Juil Sock
Fabio Cuzzolin
429
299
0
25 Nov 2016
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for
  Human Action Recognition in Videos
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
Amlan Kar
Nishant Rai
Karan Sikka
Gaurav Sharma
298
157
0
24 Nov 2016
Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for
  Isolated Gesture Recognition
Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition
Jiali Duan
Shuai Zhou
Jun Wan
Xiaoyuan Guo
Stan Z. Li
150
38
0
21 Nov 2016
Deep Temporal Linear Encoding Networks
Deep Temporal Linear Encoding Networks
Ali Diba
Vivek Sharma
Luc Van Gool
144
238
0
21 Nov 2016
Temporal Generative Adversarial Nets with Singular Value Clipping
Temporal Generative Adversarial Nets with Singular Value Clipping
Masaki Saito
Eiichi Matsumoto
Shunta Saito
GAN
245
485
0
21 Nov 2016
Deep Tensor Convolution on Multicores
Deep Tensor Convolution on Multicores
David Budden
A. Matveev
Shibani Santurkar
S. Chaudhuri
Nir Shavit
185
39
0
20 Nov 2016
An End-to-End Spatio-Temporal Attention Model for Human Action
  Recognition from Skeleton Data
An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data
Sijie Song
Cuiling Lan
Junliang Xing
Wenjun Zeng
Jiaying Liu
363
1,031
0
18 Nov 2016
Deep Action- and Context-Aware Sequence Learning for Activity
  Recognition and Anticipation
Deep Action- and Context-Aware Sequence Learning for Activity Recognition and Anticipation
Mohammad Sadegh Ali Akbarian
F. Saleh
Basura Fernando
Mathieu Salzmann
L. Petersson
Lars Andersson
137
11
0
17 Nov 2016
Temporal Convolutional Networks for Action Segmentation and Detection
Temporal Convolutional Networks for Action Segmentation and Detection
Colin S. Lea
Michael D. Flynn
René Vidal
A. Reiter
Gregory Hager
307
1,809
0
16 Nov 2016
Learning long-term dependencies for action recognition with a
  biologically-inspired deep network
Learning long-term dependencies for action recognition with a biologically-inspired deep network
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
194
65
0
16 Nov 2016
Joint Network based Attention for Action Recognition
Joint Network based Attention for Action Recognition
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
104
8
0
16 Nov 2016
Multispectral Deep Neural Networks for Pedestrian Detection
Multispectral Deep Neural Networks for Pedestrian Detection
Jingjing Liu
Shaoting Zhang
Shu Wang
Dimitris N. Metaxas
3DH
169
412
0
08 Nov 2016
Action Recognition Based on Joint Trajectory Maps Using Convolutional
  Neural Networks
Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks
Pichao Wang
Zhihao Li
Yonghong Hou
W. Li
296
373
0
08 Nov 2016
Spatiotemporal Residual Networks for Video Action Recognition
Spatiotemporal Residual Networks for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
353
740
0
07 Nov 2016
Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All
  Networks
Exploiting Spatio-Temporal Structure with Recurrent Winner-Take-All Networks
Eder Santana
Matthew S. Emigh
Pablo Zegers
José C. Príncipe
BDL
403
16
0
31 Oct 2016
Real-time Online Action Detection Forests using Spatio-temporal Contexts
Real-time Online Action Detection Forests using Spatio-temporal Contexts
Seungryul Baek
K. Kim
Tae-Kyun Kim
149
24
0
28 Oct 2016
Review of Action Recognition and Detection Methods
Review of Action Recognition and Detection Methods
Soo-Min Kang
Richard P. Wildes
183
59
0
21 Oct 2016
ARTiS: Appearance-based Action Recognition in Task Space for Real-Time
  Human-Robot Collaboration
ARTiS: Appearance-based Action Recognition in Task Space for Real-Time Human-Robot Collaboration
M. Eich
S. Shirazi
G. Wyeth
137
2
0
18 Oct 2016
Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at
  Extremely Low Resolutions
Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at Extremely Low ResolutionsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2016
Jiawei Chen
Jonathan Wu
Janusz Konrad
Prakash Ishwar
203
48
0
12 Oct 2016
Egocentric Height Estimation
Egocentric Height EstimationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2016
Jessie Finocchiaro
Aisha Urooj Khan
Ali Borji
MDEEgoV
80
8
0
09 Oct 2016
Weakly supervised learning of actions from transcripts
Weakly supervised learning of actions from transcriptsComputer Vision and Image Understanding (CVIU), 2016
Hilde Kuehne
Alexander Richard
Juergen Gall
261
125
0
07 Oct 2016
Visual Question Answering: Datasets, Algorithms, and Future Challenges
Visual Question Answering: Datasets, Algorithms, and Future ChallengesComputer Vision and Image Understanding (CVIU), 2016
Kushal Kafle
Christopher Kanan
OOD
259
257
0
05 Oct 2016
Learning Language-Visual Embedding for Movie Understanding with
  Natural-Language
Learning Language-Visual Embedding for Movie Understanding with Natural-Language
Atousa Torabi
Niket Tandon
Leonid Sigal
154
106
0
26 Sep 2016
Deep Learning for Video Classification and Captioning
Deep Learning for Video Classification and Captioning
Zuxuan Wu
Ting Yao
Yanwei Fu
Yu-Gang Jiang
3DVVLM
185
139
0
22 Sep 2016
Deep CTR Prediction in Display Advertising
Deep CTR Prediction in Display Advertising
Junxuan Chen
Baigui Sun
Hao Li
Hongtao Lu
Xiansheng Hua
3DV
234
142
0
20 Sep 2016
Pose from Action: Unsupervised Learning of Pose Features based on Motion
Pose from Action: Unsupervised Learning of Pose Features based on Motion
Senthil Purushwalkam
Abhinav Gupta
SSL
140
23
0
18 Sep 2016
GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for
  Multimodal Information Fusion
GeThR-Net: A Generalized Temporally Hybrid Recurrent Neural Network for Multimodal Information Fusion
Ankit Gandhi
Arjun Sharma
Arijit Biswas
Om Deshmukh
AI4TS
91
13
0
17 Sep 2016
Combining Texture and Shape Cues for Object Recognition With Minimal
  Supervision
Combining Texture and Shape Cues for Object Recognition With Minimal Supervision
Xingchao Peng
Kate Saenko
3DPC
62
4
0
14 Sep 2016
Using Spatial Pooler of Hierarchical Temporal Memory to classify noisy
  videos with predefined complexity
Using Spatial Pooler of Hierarchical Temporal Memory to classify noisy videos with predefined complexity
Maciej Wielgosz
Marcin Pietroñ
82
10
0
10 Sep 2016
Sequential Deep Trajectory Descriptor for Action Recognition with
  Three-stream CNN
Sequential Deep Trajectory Descriptor for Action Recognition with Three-stream CNN
Yemin Shi
Yonghong Tian
Yaowei Wang
Tiejun Huang
143
199
0
10 Sep 2016
Generating Videos with Scene Dynamics
Generating Videos with Scene Dynamics
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
GANVGen
482
1,550
0
08 Sep 2016
Making a Case for Learning Motion Representations with Phase
Making a Case for Learning Motion Representations with Phase
S. Pintea
Jan van Gemert
117
11
0
06 Sep 2016
Deep-Anomaly: Fully Convolutional Neural Network for Fast Anomaly
  Detection in Crowded Scenes
Deep-Anomaly: Fully Convolutional Neural Network for Fast Anomaly Detection in Crowded Scenes
Mohammad Sabokrou
Mohsen Fayyaz
Mahmood Fathy
Zahra Moayed
Reinhard Klette
217
455
0
03 Sep 2016
Transferring Object-Scene Convolutional Neural Networks for Event
  Recognition in Still Images
Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images
Limin Wang
Zhe Wang
Yu Qiao
Luc Van Gool
159
6
0
01 Sep 2016
Efficient Two-Stream Motion and Appearance 3D CNNs for Video
  Classification
Efficient Two-Stream Motion and Appearance 3D CNNs for Video Classification
Ali Diba
A. Pazandeh
Luc Van Gool
176
77
0
31 Aug 2016
What makes ImageNet good for transfer learning?
What makes ImageNet good for transfer learning?
Minyoung Huh
Pulkit Agrawal
Alexei A. Efros
OODSSegVLMSSL
364
699
0
30 Aug 2016
Human Action Recognition without Human
Human Action Recognition without Human
Yun He
Soma Shirakabe
Y. Satoh
Hirokatsu Kataoka
182
47
0
29 Aug 2016
Sympathy for the Details: Dense Trajectories and Hybrid Classification
  Architectures for Action Recognition
Sympathy for the Details: Dense Trajectories and Hybrid Classification Architectures for Action RecognitionEuropean Conference on Computer Vision (ECCV), 2016
César Roberto de Souza
Adrien Gaidon
E. Vig
A. Peña
119
43
0
25 Aug 2016
Searching Action Proposals via Spatial Actionness Estimation and
  Temporal Path Inference and Tracking
Searching Action Proposals via Spatial Actionness Estimation and Temporal Path Inference and TrackingAsian Conference on Computer Vision (ACCV), 2016
Nannan Li
Dan Xu
Zhenqiang Ying
Zhihao Li
Ge Li
114
14
0
23 Aug 2016
Large-scale Continuous Gesture Recognition Using Convolutional Neural
  Networks
Large-scale Continuous Gesture Recognition Using Convolutional Neural NetworksInternational Conference on Pattern Recognition (ICPR), 2016
Pichao Wang
W. Li
Song Liu
Yuyao Zhang
Zhimin Gao
P. Ogunbona
SLR
243
54
0
22 Aug 2016
STFCN: Spatio-Temporal FCN for Semantic Video Segmentation
STFCN: Spatio-Temporal FCN for Semantic Video Segmentation
Mohsen Fayyaz
M. H. Saffar
Mohammad Sabokrou
M. Fathy
Reinhard Klette
Fay Huang
225
57
0
21 Aug 2016
Leveraging Structural Context Models and Ranking Score Fusion for Human
  Interaction Prediction
Leveraging Structural Context Models and Ranking Score Fusion for Human Interaction PredictionIEEE transactions on multimedia (TMM), 2016
Qiuhong Ke
Bennamoun
Senjian An
F. Boussaïd
Ferdous Sohel
226
40
0
18 Aug 2016
Depth2Action: Exploring Embedded Depth for Large-Scale Action
  Recognition
Depth2Action: Exploring Embedded Depth for Large-Scale Action Recognition
Yi Zhu
Shawn D. Newsam
136
43
0
15 Aug 2016
About Pyramid Structure in Convolutional Neural Networks
About Pyramid Structure in Convolutional Neural Networks
I. Ullah
A. Petrosino
3DV
210
32
0
14 Aug 2016
Discriminatively Trained Latent Ordinal Model for Video Classification
Discriminatively Trained Latent Ordinal Model for Video Classification
Karan Sikka
Gaurav Sharma
179
11
0
08 Aug 2016
Signs in time: Encoding human motion as a temporal image
Signs in time: Encoding human motion as a temporal image
Joon Son Chung
Andrew Zisserman
SLR
120
14
0
06 Aug 2016
Fusing Deep Convolutional Networks for Large Scale Visual Concept
  Classification
Fusing Deep Convolutional Networks for Large Scale Visual Concept Classification
H. Ergun
M. Sert
99
9
0
05 Aug 2016
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos
Suman Saha
Gurkirt Singh
Michael Sapienza
Juil Sock
Fabio Cuzzolin
ViT
230
213
0
04 Aug 2016
Modeling Spatial and Temporal Cues for Multi-label Facial Action Unit
  Detection
Modeling Spatial and Temporal Cues for Multi-label Facial Action Unit Detection
Wen-Sheng Chu
Fernando de la Torre
J. Cohn
138
19
0
02 Aug 2016
Previous
123...4344454647
Next