Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,275 papers shown
Title
Video-based estimation of pain indicators in dogs
Hongyi Zhu
Y. Salgirli
P. Can
Durmucs Atilgan
A. A. Salah
38
3
0
27 Sep 2022
FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification
P. Jin
Lichao Mou
Yuansheng Hua
Gui-Song Xia
Xiao Xiang Zhu
AI4TS
32
8
0
22 Sep 2022
Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure based FoG Detection
Kun Hu
Shaohui Mei
Wei Wang
K. E. Martens
Liang Wang
S. Lewis
Dagan Feng
Zhiyong Wang
37
5
0
22 Sep 2022
Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos
Yilin Wen
Hao Pan
Lei Yang
Jia Pan
Taku Komura
Wenping Wang
54
27
0
20 Sep 2022
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks
Haodong Duan
Yue Zhao
Kai-xiang Chen
Yu Xiong
Dahua Lin
21
7
0
20 Sep 2022
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain
Francesco Ragusa
Antonino Furnari
G. Farinella
EgoV
51
24
0
19 Sep 2022
Differentiable Frequency-based Disentanglement for Aerial Video Action Recognition
D. Kothandaraman
Ming-Shun Lin
Tianyi Zhou
32
6
0
15 Sep 2022
On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition
Farrukh Rahman
Ömer Mubarek
Z. Kira
ViT
18
2
0
15 Sep 2022
Action-based Early Autism Diagnosis Using Contrastive Feature Learning
Asha Rani
Pankaj Yadav
Yashaswi Verma
26
3
0
12 Sep 2022
Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
FAtt
AAML
35
19
0
07 Sep 2022
Real-Time Cattle Interaction Recognition via Triple-stream Network
Yang Yang
Mizuka Komatsu
K. Oyama
T. Ohkawa
33
3
0
06 Sep 2022
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition
Tianjiao Li
Lin Geng Foo
Qiuhong Ke
Hossein Rahmani
Anran Wang
Jinghua Wang
Jing Liu
19
21
0
03 Sep 2022
Progressive Fusion for Multimodal Integration
Shiv Shankar
Laure Thompson
M. Fiterau
38
3
0
01 Sep 2022
ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
Francois Bremond
27
8
0
31 Aug 2022
Attentive pooling for Group Activity Recognition
Ding Li
Yuan Xie
Wensheng Zhang
Yongqiang Tang
Zhizhong Zhang
23
0
0
31 Aug 2022
A Circular Window-based Cascade Transformer for Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
47
6
0
30 Aug 2022
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Jou-An Chen
Wei Niu
Bin Ren
Yanzhi Wang
Xipeng Shen
25
24
0
29 Aug 2022
Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos
Fan Yang
Norimichi Ukita
S. Sakti
Satoshi Nakamura
24
0
0
27 Aug 2022
Adaptive Perception Transformer for Temporal Action Localization
Yizheng Ouyang
Tianjin Zhang
Weibo Gu
Hongfa Wang
26
3
0
25 Aug 2022
Lane Change Classification and Prediction with Action Recognition Networks
Kai-Bin Liang
Jun Wang
A. Bhalerao
24
2
0
24 Aug 2022
Modality Mixer for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Yeonju Park
Muhammad Adi Nugroho
Changick Kim
26
10
0
24 Aug 2022
Identifying Auxiliary or Adversarial Tasks Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding
Stephen Su
Sam Kwong
Qingyu Zhao
De-An Huang
Juan Carlos Niebles
Ehsan Adeli
27
0
0
22 Aug 2022
Hierarchical Compositional Representations for Few-shot Action Recognition
Chang-bo Li
Jie Zhang
Shuzhe Wu
Xin Jin
Shiguang Shan
32
20
0
19 Aug 2022
Motion Sensitive Contrastive Learning for Self-supervised Video Representation
Jingcheng Ni
Nana Zhou
Jie Qin
Qianrun Wu
Junqi Liu
Boxun Li
Di Huang
SSL
47
16
0
12 Aug 2022
Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework
Hayat Ullah
Arslan Munir
HAI
29
27
0
09 Aug 2022
BabyNet: A Lightweight Network for Infant Reaching Action Recognition in Unconstrained Environments to Support Future Pediatric Rehabilitation Applications
Amel Dechemi
Vikarn Bhakri
Ipsita Sahin
Arjun Modi
Julya Mestas
Pamodya Peiris
Dannya Enriquez Barrundia
Elena Kokkoni
Konstantinos Karydis
30
10
0
09 Aug 2022
Video-based Human Action Recognition using Deep Learning: A Review
Hieu H. Pham
L. Khoudour
Alain Crouzil
Pablo Zegers
S. Velastín
35
34
0
07 Aug 2022
Frozen CLIP Models are Efficient Video Learners
Ziyi Lin
Shijie Geng
Renrui Zhang
Peng Gao
Gerard de Melo
Xiaogang Wang
Jifeng Dai
Yu Qiao
Hongsheng Li
CLIP
VLM
25
200
0
06 Aug 2022
AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement
S. Guan
Haiyan Lu
Linchao Zhu
Gengfa Fang
21
20
0
06 Aug 2022
Two-Stream Transformer Architecture for Long Video Understanding
Edward Fish
Jon Weinbren
Andrew Gilbert
ViT
33
6
0
02 Aug 2022
Video Question Answering with Iterative Video-Text Co-Tokenization
A. Piergiovanni
K. Morton
Weicheng Kuo
Michael S. Ryoo
A. Angelova
39
18
0
01 Aug 2022
Two-Stream UNET Networks for Semantic Segmentation in Medical Images
Xin Chen
Ke Ding
16
0
0
27 Jul 2022
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition
Wangmeng Xiang
Chong Li
Biao Wang
Xihan Wei
Xiangpei Hua
Lei Zhang
ViT
30
27
0
27 Jul 2022
Compositional Human-Scene Interaction Synthesis with Semantic Control
Kaifeng Zhao
Shaofei Wang
Yan Zhang
Thabo Beeler
Siyu Tang
35
65
0
26 Jul 2022
Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation
Michal Balazia
Philippe Muller
Ákos Levente Tánczos
A. V. Liechtenstein
Franccois Brémond
22
22
0
26 Jul 2022
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos
Jiang Bian
Xuhong Li
Tao Wang
Qingzhong Wang
Jun Huang
Chen Liu
Jun Zhao
Feixiang Lu
Dejing Dou
Haoyi Xiong
31
10
0
26 Jul 2022
Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning
A. Syed
Eman A. Aldhahri
M. Iqbal
Abid Ali
Ammar Muthanna
Harun Jamil
F. Jamil
3DH
23
2
0
23 Jul 2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Yoad Tewel
Yoav Shalev
Roy Nadler
Idan Schwartz
Lior Wolf
37
27
0
22 Jul 2022
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
Yuetian Weng
Zizheng Pan
Mingfei Han
Xiaojun Chang
Bohan Zhuang
ViT
19
25
0
21 Jul 2022
GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
Huseyin Coskun
Alireza Zareian
Joshua L. Moore
F. Tombari
Chen Wang
SSL
53
3
0
20 Jul 2022
A Generalized & Robust Framework For Timestamp Supervision in Temporal Action Segmentation
R. Rahaman
Dipika Singhania
Alexandre Hoang Thiery
Angela Yao
33
2
0
20 Jul 2022
Is an Object-Centric Video Representation Beneficial for Transfer?
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
ViT
37
27
0
20 Jul 2022
ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network
Nikolaos Gkalelis
Dimitrios Daskalakis
Vasileios Mezaris
19
10
0
20 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
27
1
0
20 Jul 2022
Time Is MattEr: Temporal Self-supervision for Video Transformers
Sukmin Yun
Jaehyung Kim
Dongyoon Han
Hwanjun Song
Jung-Woo Ha
Jinwoo Shin
ViT
19
12
0
19 Jul 2022
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Yansong Tang
Xingyu Liu
Xumin Yu
Danyang Zhang
Jiwen Lu
Jie Zhou
27
20
0
17 Jul 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming-Hsuan Yang
Serge Belongie
Huayu Chen
VLM
41
22
0
15 Jul 2022
Is Appearance Free Action Recognition Possible?
Filip Ilic
Thomas Pock
Richard P. Wildes
19
15
0
13 Jul 2022
Robotic Detection of a Human-Comprehensible Gestural Language for Underwater Multi-Human-Robot Collaboration
Sadman Sakib Enan
Michael Fulton
Junaed Sattar
39
8
0
12 Jul 2022
Trusted Multi-Scale Classification Framework for Whole Slide Image
Ming Feng
Kele Xu
Na Wu
Weiquan Huang
Yan Bai
Changjian Wang
Huaimin Wang
45
5
0
12 Jul 2022
Previous
1
2
3
...
8
9
10
...
44
45
46
Next