Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,275 papers shown
Title
Temporal Action Detection with Multi-level Supervision
Baifeng Shi
Qi Dai
Judy Hoffman
Kate Saenko
Trevor Darrell
Huijuan Xu
18
13
0
24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
35
123
0
23 Nov 2020
We don't Need Thousand Proposals
:
\colon
:
Single Shot Actor-Action Detection in Videos
A. J. Rana
Yogesh S Rawat
ViT
13
11
0
22 Nov 2020
Boundary-sensitive Pre-training for Temporal Localization in Videos
Mengmeng Xu
Juan-Manuel Perez-Rua
Victor Escorcia
Brais Martínez
Xiatian Zhu
Li Zhang
Guohao Li
Tao Xiang
33
61
0
21 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild
Faizaan Sakib
T. Burghardt
27
24
0
21 Nov 2020
3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
J. Morlier
3DPC
30
12
0
20 Nov 2020
Consistency-Aware Graph Network for Human Interaction Understanding
Zhenhua Wang
Jiajun Meng
Dongyan Guo
Jianhua Zhang
Javen Qinfeng Shi
Shengyong Chen
GNN
14
3
0
20 Nov 2020
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos
Reza Ghoddoosian
S. Sayed
V. Athitsos
AI4TS
14
7
0
20 Nov 2020
HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects
Suihanjin Yu
Youming Zhang
Chen Wang
Xiao Bai
Liang Zhang
Edwin R. Hancock
27
3
0
19 Nov 2020
TRAT: Tracking by Attention Using Spatio-Temporal Features
Hasan Saribas
Hakan Çevikalp
Okan Kopuklu
Bedirhan Uzun
21
25
0
18 Nov 2020
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks
Thomas Teixeira
Eric Granger
Alessandro Lameiras Koerich
CVBM
33
9
0
18 Nov 2020
Double-Prong ConvLSTM for Spatiotemporal Occupancy Prediction in Dynamic Environments
Maneekwan Toyungyernsub
Masha Itkina
Ransalu Senanayake
Mykel J. Kochenderfer
42
22
0
18 Nov 2020
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues
A. Alam
I. Ullah
Young-Koo Lee
47
22
0
16 Nov 2020
JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition
Jinmiao Cai
Nianjuan Jiang
Xiaoguang Han
Kui Jia
Jiangbo Lu
27
84
0
16 Nov 2020
SALAD: Self-Assessment Learning for Action Detection
Guillaume Vaudaux-Ruth
Adrien Chan-Hon-Tong
Catherine Achard
19
8
0
13 Nov 2020
Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs
Sean Segal
Eric Kee
Wenjie Luo
Abbas Sadat
Ersin Yumer
R. Urtasun
30
11
0
12 Nov 2020
Transformers for One-Shot Visual Imitation
Sudeep Dasari
Abhinav Gupta
LM&Ro
34
91
0
11 Nov 2020
Skeleton-based Relational Reasoning for Group Activity Analysis
Mauricio Perez
Jun Liu
Alex C. Kot
10
43
0
11 Nov 2020
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos
Di Yang
Rui Dai
Yaohui Wang
Rupayan Mallick
Luca Minciullo
Gianpiero Francesca
Francois Bremond
38
16
0
10 Nov 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition
T. Ayral
M. Pedersoli
Simon L Bacon
Eric Granger
CVBM
3DH
13
11
0
10 Nov 2020
Multi-modal Fusion for Single-Stage Continuous Gesture Recognition
Harshala Gammulle
Simon Denman
Sridha Sridharan
Clinton Fookes
SLR
32
29
0
10 Nov 2020
STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection
Yichao Cao
Qingfei Tang
Xiaobo Lu
Fan Li
Jinde Cao
22
3
0
10 Nov 2020
An Empirical Study of Visual Features for DNN based Audio-Visual Speech Enhancement in Multi-talker Environments
Shrishti Saha Shetu
Soumitro Chakrabarty
Emanuel Habets
28
2
0
09 Nov 2020
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition
Vinoj Jayasundara
D. Roy
Basura Fernando
3DPC
26
3
0
08 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
29
1
0
08 Nov 2020
Mutual Modality Learning for Video Action Classification
Stepan Alekseevich Komkov
Maksim Dzabraev
Aleksandr Petiushko
27
9
0
04 Nov 2020
Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
Qi Kuang
Xin Jin
Qinping Zhao
Bin Zhou
39
29
0
04 Nov 2020
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation
Yuan Cheng
Yuchao Yang
Hai-Bao Chen
Ngai Wong
Hao Yu
3DPC
40
3
0
04 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition
Zihao Wang
Chen Lin
Lu Sheng
Junjie Yan
Jing Shao
ViT
25
7
0
02 Nov 2020
Actor and Action Modular Network for Text-based Video Segmentation
Jianhua Yang
Yan Huang
K. Niu
Linjiang Huang
Zhanyu Ma
Liang Wang
19
9
0
02 Nov 2020
Multimodal and self-supervised representation learning for automatic gesture recognition in surgical robotics
Aniruddha Tamhane
J. Wu
Mathias Unberath
SSL
11
0
0
31 Oct 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation
Zhengyuan Yang
Amanda Kay
Yuncheng Li
Wendi F. Cross
Jiebo Luo
33
18
0
30 Oct 2020
Exploring Dynamic Context for Multi-path Trajectory Prediction
Hao Cheng
Wentong Liao
Xuejiao Tang
M. Yang
Monika Sester
Bodo Rosenhahn
48
32
0
30 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching
Haoyuan Zhang
Yonghong Hou
Pichao Wang
Zihui Guo
Wanqing Li
34
15
0
29 Oct 2020
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications
Hochul Hwang
Cheongjae Jang
Geonwoo Park
Junghyun Cho
Ig-Jae Kim
37
70
0
28 Oct 2020
Triple-view Convolutional Neural Networks for COVID-19 Diagnosis with Chest X-ray
Jianjia Zhang
19
2
0
27 Oct 2020
Object-aware Feature Aggregation for Video Object Detection
Qichuan Geng
Hong Zhang
Na Jiang
Xiaojuan Qi
Liangjun Zhang
Zhongjun Zhou
VOS
48
3
0
23 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
23
96
0
22 Oct 2020
Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
Yuanhao Zhai
Le Wang
Wei Tang
Qilin Zhang
Junsong Yuan
G. Hua
38
135
0
22 Oct 2020
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
55
30
0
20 Oct 2020
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition
Yuqian Fu
Li Zhang
Junke Wang
Yanwei Fu
Yu-Gang Jiang
30
95
0
20 Oct 2020
A Grid-based Representation for Human Action Recognition
Soufiane Lamghari
Guillaume-Alexandre Bilodeau
Nicolas Saunier
31
3
0
17 Oct 2020
Self-Selective Context for Interaction Recognition
Mert Kilickaya
Noureldien Hussein
E. Gavves
A. Smeulders
28
2
0
17 Oct 2020
Pose And Joint-Aware Action Recognition
Anshul B. Shah
Shlok Kumar Mishra
Ankan Bansal
Jun-Cheng Chen
Ramalingam Chellappa
Abhinav Shrivastava
50
33
0
16 Oct 2020
Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset
Keshav Bhandari
Mario A. DeLaGarza
Ziliang Zong
Hugo Latapie
Yan Yan
EgoV
21
5
0
15 Oct 2020
HS-ResNet: Hierarchical-Split Block on Convolutional Neural Network
P. Yuan
Shufei Lin
Cheng Cui
Yuning Du
Ruoyu Guo
Dongliang He
Errui Ding
Shumin Han
29
43
0
15 Oct 2020
Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit Latent Features
Myeongah Cho
Taeoh Kim
Woojin Kim
Suhwan Cho
Sangyoun Lee
22
92
0
15 Oct 2020
Pose Refinement Graph Convolutional Network for Skeleton-based Action Recognition
Shijie Li
Jinhui Yi
Yazan Abu Farha
Juergen Gall
14
35
0
14 Oct 2020
Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning
Xinyu Yang
Majid Mirmehdi
T. Burghardt
27
4
0
14 Oct 2020
Video Action Understanding
Matthew Hutchinson
V. Gadepally
43
20
0
13 Oct 2020
Previous
1
2
3
...
18
19
20
...
44
45
46
Next