ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXivPDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,275 papers shown
Title
Temporal Action Detection with Multi-level Supervision
Temporal Action Detection with Multi-level Supervision
Baifeng Shi
Qi Dai
Judy Hoffman
Kate Saenko
Trevor Darrell
Huijuan Xu
18
13
0
24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
35
123
0
23 Nov 2020
We don't Need Thousand Proposals$\colon$ Single Shot Actor-Action
  Detection in Videos
We don't Need Thousand Proposals ⁣:\colon: Single Shot Actor-Action Detection in Videos
A. J. Rana
Yogesh S Rawat
ViT
13
11
0
22 Nov 2020
Boundary-sensitive Pre-training for Temporal Localization in Videos
Boundary-sensitive Pre-training for Temporal Localization in Videos
Mengmeng Xu
Juan-Manuel Perez-Rua
Victor Escorcia
Brais Martínez
Xiatian Zhu
Li Zhang
Guohao Li
Tao Xiang
33
61
0
21 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild
Visual Recognition of Great Ape Behaviours in the Wild
Faizaan Sakib
T. Burghardt
27
24
0
21 Nov 2020
3D attention mechanism for fine-grained classification of table tennis
  strokes using a Twin Spatio-Temporal Convolutional Neural Networks
3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
J. Morlier
3DPC
30
12
0
20 Nov 2020
Consistency-Aware Graph Network for Human Interaction Understanding
Consistency-Aware Graph Network for Human Interaction Understanding
Zhenhua Wang
Jiajun Meng
Dongyan Guo
Jianhua Zhang
Javen Qinfeng Shi
Shengyong Chen
GNN
14
3
0
20 Nov 2020
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled
  Videos
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos
Reza Ghoddoosian
S. Sayed
V. Athitsos
AI4TS
14
7
0
20 Nov 2020
HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving
  Objects
HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects
Suihanjin Yu
Youming Zhang
Chen Wang
Xiao Bai
Liang Zhang
Edwin R. Hancock
27
3
0
19 Nov 2020
TRAT: Tracking by Attention Using Spatio-Temporal Features
TRAT: Tracking by Attention Using Spatio-Temporal Features
Hasan Saribas
Hakan Çevikalp
Okan Kopuklu
Bedirhan Uzun
21
25
0
18 Nov 2020
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural
  Networks
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks
Thomas Teixeira
Eric Granger
Alessandro Lameiras Koerich
CVBM
33
9
0
18 Nov 2020
Double-Prong ConvLSTM for Spatiotemporal Occupancy Prediction in Dynamic
  Environments
Double-Prong ConvLSTM for Spatiotemporal Occupancy Prediction in Dynamic Environments
Maneekwan Toyungyernsub
Masha Itkina
Ransalu Senanayake
Mykel J. Kochenderfer
42
22
0
18 Nov 2020
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey,
  Opportunities, and Open Research Issues
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues
A. Alam
I. Ullah
Young-Koo Lee
47
22
0
16 Nov 2020
JOLO-GCN: Mining Joint-Centered Light-Weight Information for
  Skeleton-Based Action Recognition
JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition
Jinmiao Cai
Nianjuan Jiang
Xiaoguang Han
Kui Jia
Jiangbo Lu
27
84
0
16 Nov 2020
SALAD: Self-Assessment Learning for Action Detection
SALAD: Self-Assessment Learning for Action Detection
Guillaume Vaudaux-Ruth
Adrien Chan-Hon-Tong
Catherine Achard
19
8
0
13 Nov 2020
Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs
Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs
Sean Segal
Eric Kee
Wenjie Luo
Abbas Sadat
Ersin Yumer
R. Urtasun
30
11
0
12 Nov 2020
Transformers for One-Shot Visual Imitation
Transformers for One-Shot Visual Imitation
Sudeep Dasari
Abhinav Gupta
LM&Ro
34
91
0
11 Nov 2020
Skeleton-based Relational Reasoning for Group Activity Analysis
Skeleton-based Relational Reasoning for Group Activity Analysis
Mauricio Perez
Jun Liu
Alex C. Kot
10
43
0
11 Nov 2020
Selective Spatio-Temporal Aggregation Based Pose Refinement System:
  Towards Understanding Human Activities in Real-World Videos
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos
Di Yang
Rui Dai
Yaohui Wang
Rupayan Mallick
Luca Minciullo
Gianpiero Francesca
Francois Bremond
38
16
0
10 Nov 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial
  Expression Recognition
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition
T. Ayral
M. Pedersoli
Simon L Bacon
Eric Granger
CVBM
3DH
13
11
0
10 Nov 2020
Multi-modal Fusion for Single-Stage Continuous Gesture Recognition
Multi-modal Fusion for Single-Stage Continuous Gesture Recognition
Harshala Gammulle
Simon Denman
Sridha Sridharan
Clinton Fookes
SLR
32
29
0
10 Nov 2020
STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection
STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection
Yichao Cao
Qingfei Tang
Xiaobo Lu
Fan Li
Jinde Cao
22
3
0
10 Nov 2020
An Empirical Study of Visual Features for DNN based Audio-Visual Speech
  Enhancement in Multi-talker Environments
An Empirical Study of Visual Features for DNN based Audio-Visual Speech Enhancement in Multi-talker Environments
Shrishti Saha Shetu
Soumitro Chakrabarty
Emanuel Habets
28
2
0
09 Nov 2020
FlowCaps: Optical Flow Estimation with Capsule Networks For Action
  Recognition
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition
Vinoj Jayasundara
D. Roy
Basura Fernando
3DPC
26
3
0
08 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
29
1
0
08 Nov 2020
Mutual Modality Learning for Video Action Classification
Mutual Modality Learning for Video Action Classification
Stepan Alekseevich Komkov
Maksim Dzabraev
Aleksandr Petiushko
27
9
0
04 Nov 2020
Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
Qi Kuang
Xin Jin
Qinping Zhao
Bin Zhou
39
29
0
04 Nov 2020
S3-Net: A Fast and Lightweight Video Scene Understanding Network by
  Single-shot Segmentation
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation
Yuan Cheng
Yuchao Yang
Hai-Bao Chen
Ngai Wong
Hao Yu
3DPC
40
3
0
04 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition
PV-NAS: Practical Neural Architecture Search for Video Recognition
Zihao Wang
Chen Lin
Lu Sheng
Junjie Yan
Jing Shao
ViT
25
7
0
02 Nov 2020
Actor and Action Modular Network for Text-based Video Segmentation
Actor and Action Modular Network for Text-based Video Segmentation
Jianhua Yang
Yan Huang
K. Niu
Linjiang Huang
Zhanyu Ma
Liang Wang
19
9
0
02 Nov 2020
Multimodal and self-supervised representation learning for automatic
  gesture recognition in surgical robotics
Multimodal and self-supervised representation learning for automatic gesture recognition in surgical robotics
Aniruddha Tamhane
J. Wu
Mathias Unberath
SSL
11
0
0
31 Oct 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom
  Interpretation
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation
Zhengyuan Yang
Amanda Kay
Yuncheng Li
Wendi F. Cross
Jiebo Luo
33
18
0
30 Oct 2020
Exploring Dynamic Context for Multi-path Trajectory Prediction
Exploring Dynamic Context for Multi-path Trajectory Prediction
Hao Cheng
Wentong Liao
Xuejiao Tang
M. Yang
Monika Sester
Bodo Rosenhahn
48
32
0
30 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture
  Searching
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching
Haoyuan Zhang
Yonghong Hou
Pichao Wang
Zihui Guo
Wanqing Li
34
15
0
29 Oct 2020
ElderSim: A Synthetic Data Generation Platform for Human Action
  Recognition in Eldercare Applications
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications
Hochul Hwang
Cheongjae Jang
Geonwoo Park
Junghyun Cho
Ig-Jae Kim
37
70
0
28 Oct 2020
Triple-view Convolutional Neural Networks for COVID-19 Diagnosis with
  Chest X-ray
Triple-view Convolutional Neural Networks for COVID-19 Diagnosis with Chest X-ray
Jianjia Zhang
19
2
0
27 Oct 2020
Object-aware Feature Aggregation for Video Object Detection
Object-aware Feature Aggregation for Video Object Detection
Qichuan Geng
Hong Zhang
Na Jiang
Xiaojuan Qi
Liangjun Zhang
Zhongjun Zhou
VOS
48
3
0
23 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action
  Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
23
96
0
22 Oct 2020
Two-Stream Consensus Network for Weakly-Supervised Temporal Action
  Localization
Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
Yuanhao Zhai
Le Wang
Wei Tang
Qilin Zhang
Junsong Yuan
G. Hua
38
135
0
22 Oct 2020
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded
  Dialogues
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
55
30
0
20 Oct 2020
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition
Yuqian Fu
Li Zhang
Junke Wang
Yanwei Fu
Yu-Gang Jiang
30
95
0
20 Oct 2020
A Grid-based Representation for Human Action Recognition
A Grid-based Representation for Human Action Recognition
Soufiane Lamghari
Guillaume-Alexandre Bilodeau
Nicolas Saunier
31
3
0
17 Oct 2020
Self-Selective Context for Interaction Recognition
Self-Selective Context for Interaction Recognition
Mert Kilickaya
Noureldien Hussein
E. Gavves
A. Smeulders
28
2
0
17 Oct 2020
Pose And Joint-Aware Action Recognition
Pose And Joint-Aware Action Recognition
Anshul B. Shah
Shlok Kumar Mishra
Ankan Bansal
Jun-Cheng Chen
Ramalingam Chellappa
Abhinav Shrivastava
50
33
0
16 Oct 2020
Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset
Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset
Keshav Bhandari
Mario A. DeLaGarza
Ziliang Zong
Hugo Latapie
Yan Yan
EgoV
21
5
0
15 Oct 2020
HS-ResNet: Hierarchical-Split Block on Convolutional Neural Network
HS-ResNet: Hierarchical-Split Block on Convolutional Neural Network
P. Yuan
Shufei Lin
Cheng Cui
Yuning Du
Ruoyu Guo
Dongliang He
Errui Ding
Shumin Han
29
43
0
15 Oct 2020
Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit
  Latent Features
Unsupervised Video Anomaly Detection via Normalizing Flows with Implicit Latent Features
Myeongah Cho
Taeoh Kim
Woojin Kim
Suhwan Cho
Sangyoun Lee
22
92
0
15 Oct 2020
Pose Refinement Graph Convolutional Network for Skeleton-based Action
  Recognition
Pose Refinement Graph Convolutional Network for Skeleton-based Action Recognition
Shijie Li
Jinhui Yi
Yazan Abu Farha
Juergen Gall
14
35
0
14 Oct 2020
Back to the Future: Cycle Encoding Prediction for Self-supervised
  Contrastive Video Representation Learning
Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning
Xinyu Yang
Majid Mirmehdi
T. Burghardt
27
4
0
14 Oct 2020
Video Action Understanding
Video Action Understanding
Matthew Hutchinson
V. Gadepally
43
20
0
13 Oct 2020
Previous
123...181920...444546
Next