Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,275 papers shown

Title
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification Saining Xie Chen Sun Jonathan Huang Zhuowen Tu Kevin Patrick Murphy 3DH 52 1,309 0 13 Dec 2017
Im2Flow: Motion Hallucination from Static Images for Action Recognition Ruohan Gao Bo Xiong Kristen Grauman 33 92 0 12 Dec 2017
Learning Latent Super-Events to Detect Multiple Activities in Videos A. Piergiovanni Michael S. Ryoo 16 90 0 05 Dec 2017
Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition Pichao Wang Wanqing Li Jun Wan P. Ogunbona Xinwang Liu 22 72 0 05 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild Yipin Zhou Zhaowen Wang Chen Fang Trung Bui Tamara L. Berg VGen 28 206 0 04 Dec 2017
Object Classification using Ensemble of Local and Deep Features Siddharth Srivastava Prerana Mukherjee Brejesh Lall Kamlesh Jaiswal 34 7 0 04 Dec 2017
Compressed Video Action Recognition Chao-Yuan Wu Manzil Zaheer Hexiang Hu R. Manmatha Alex Smola Philipp Krahenbuhl 37 325 0 02 Dec 2017
Learning to Segment Moving Objects P. Tokmakov Cordelia Schmid Alahari Karteek VOS 36 97 0 01 Dec 2017
Label Efficient Learning of Transferable Representations across Domains and Tasks Zelun Luo Yuliang Zou Judy Hoffman Li Fei-Fei 39 275 0 30 Nov 2017
Graph Distillation for Action Detection with Privileged Modalities Zelun Luo Jun-Ting Hsieh Lu Jiang Juan Carlos Niebles Li Fei-Fei 51 104 0 30 Nov 2017
Budget-Aware Activity Detection with A Recurrent Policy Network Behrooz Mahasseni Xiaodong Yang Pavlo Molchanov Jan Kautz 36 6 0 30 Nov 2017
An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos Rui Hou Chen Chen M. Shah MedIm 36 60 0 30 Nov 2017
Relation Networks for Object Detection Han Hu Jiayuan Gu Zheng-Wei Zhang Jifeng Dai Yichen Wei ObjD 42 1,222 0 30 Nov 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun Manohar Paluri 153 2,990 0 30 Nov 2017
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition Shuyang Sun Zhanghui Kuang Wanli Ouyang Lu Sheng Wayne Zhang 40 293 0 29 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks Zhaofan Qiu Ting Yao Tao Mei 39 1,651 0 28 Nov 2017
Revisiting hand-crafted feature for action recognition: a set of improved dense trajectories K. Matsui Toru Tamaki Gwladys Auffret B. Raytchev K. Kaneda 26 0 0 28 Nov 2017
Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture Katsunori Ohnishi Shohei Yamamoto Yoshitaka Ushiku Tatsuya Harada VGen GAN 45 59 0 27 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? Kensho Hara Hirokatsu Kataoka Y. Satoh 3DPC 72 1,914 0 27 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification Xiang Long Chuang Gan Gerard de Melo Jiajun Wu Xiao-Chang Liu Shilei Wen 34 208 0 27 Nov 2017
Predictive Learning: Using Future Representation Learning Variantial Autoencoder for Human Action Prediction Runsheng Yu Zhenyu Shi Qiongxiong Ma Laiyun Qing 3DH DRL 31 4 0 25 Nov 2017
Appearance-and-Relation Networks for Video Classification Limin Wang Wei Li Wen Li Luc Van Gool 39 350 0 24 Nov 2017
Summarizing First-Person Videos from Third Persons' Points of Views Hsuan-I Ho Wei-Chen Chiu Y. Wang EgoV 3DH 37 28 0 24 Nov 2017
Deep Video Generation, Prediction and Completion of Human Action Sequences Haoye Cai Chunyan Bai Yu-Wing Tai Chi-Keung Tang VGen 29 143 0 23 Nov 2017
Temporal Relational Reasoning in Videos Bolei Zhou A. Andonian Aude Oliva Antonio Torralba NAI 41 1,033 0 22 Nov 2017
Three-Stream Convolutional Networks for Video-based Person Re-Identification Zeng Yu Tianrui Li Ning Yu Xun Gong Ke Chen Yi Pan 16 6 0 22 Nov 2017
Multi-Level Recurrent Residual Networks for Action Recognition Zhenxing Zheng Gaoyun An Q. Ruan 13 12 0 22 Nov 2017
Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification Ali Diba Mohsen Fayyaz Vivek Sharma A. Karami M. M. Arzani Rahman Yousefzadeh Luc Van Gool 26 241 0 22 Nov 2017
Non-local Neural Networks Xinyu Wang Ross B. Girshick Abhinav Gupta Kaiming He OffRL 115 8,829 0 21 Nov 2017
Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion Weiyao Lin Yang Mi Jianxin Wu K. Lu H. Xiong 27 37 0 20 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding Chih-Yao Ma Asim Kadav I. Melvin Z. Kira G. Al-Regib H. Graf 33 145 0 16 Nov 2017
Occlusion Aware Unsupervised Learning of Optical Flow Yang Wang Yezhou Yang Zhenheng Yang Liang Zhao Peng Wang Wei Xu SSL 30 308 0 16 Nov 2017
A Correlation Based Feature Representation for First-Person Activity Recognition R. Kahani Alireza Talebpour Ahmad Mahmoudi-Aznaveh 30 12 0 15 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition Jiagang Zhu Wei Zou Zheng Zhu 25 89 0 11 Nov 2017
Two-stream Collaborative Learning with Spatial-Temporal Attention for Video Classification Yuxin Peng Yunzhen Zhao Junchao Zhang 32 115 0 09 Nov 2017
Attentional Pooling for Action Recognition Rohit Girdhar Deva Ramanan 24 319 0 04 Nov 2017
Dual Skipping Networks Changmao Cheng Yanwei Fu Yu-Gang Jiang Wei Liu Wenlian Lu Jianfeng Feng Xiangyang Xue 35 1 0 28 Oct 2017
Class Correlation affects Single Object Localization using Pre-trained ConvNets P. H. Vardhan Kunal Sekhri Dipan K. Pal Marios Savvides 13 0 0 26 Oct 2017
ContextVP: Fully Context-Aware Video Prediction Wonmin Byeon Qin Wang R. Srivastava Petros Koumoutsakos 28 8 0 23 Oct 2017
ActivityNet Challenge 2017 Summary Guohao Li Juan Carlos Niebles Cees G. M. Snoek Fabian Caba Heilbron Humam Alwassel Ranjay Krishna Victor Escorcia Kenji Hata S. Buch 50 48 0 22 Oct 2017
Generalized Zero-Shot Learning for Action Recognition with Web-Scale Video Data Kun Liu Wu Liu Huadong Ma Wenbing Huang Xiongxiong Dong 36 40 0 20 Oct 2017
Learning to Recognize Actions from Limited Training Examples Using a Recurrent Spiking Neural Model Priyadarshini Panda N. Srinivasa 12 27 0 19 Oct 2017
Pose-based Deep Gait Recognition Anna Sokolova Anton Konushin CVBM 28 43 0 17 Oct 2017
Single Shot Temporal Action Detection Tianwei Lin Xu Zhao Zheng Shou 36 451 0 17 Oct 2017
Video Classification With CNNs: Using The Codec As A Spatio-Temporal Activity Sensor Aaron Chadha Alhabib Abbas Y. Andreopoulos 29 37 0 14 Oct 2017
Detect to Track and Track to Detect Christoph Feichtenhofer A. Pinz Andrew Zisserman VOT 36 561 0 11 Oct 2017
Real-Time Action Detection in Video Surveillance using Sub-Action Descriptor with Multi-CNN Cheng-Bin Jin Shengzhe Li Hakil Kim 38 26 0 10 Oct 2017
Detecting the Moment of Completion: Temporal Models for Localising Action Completion Farnoosh Heidarivincheh Majid Mirmehdi Dima Damen 39 5 0 06 Oct 2017
Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks Hassan Al Hajj M. Lamard Pierre-Henri Conze B. Cochener G. Quellec 41 3 0 04 Oct 2017
Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks Anh Nguyen Dimitrios Kanoulas L. Muratore D. Caldwell Nikos G. Tsagarakis 29 71 0 01 Oct 2017