Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,275 papers shown
Title
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen C. Mac
Minh Do
Minh Vo
TTA
19
1
0
12 Jul 2022
Pixel-level Correspondence for Self-Supervised Learning from Video
Yash Sharma
Yi Zhu
Chris Russell
Thomas Brox
SSL
25
4
0
08 Jul 2022
GraphVid: It Only Takes a Few Nodes to Understand a Video
Eitan Kosman
Dotan Di Castro
GNN
43
5
0
04 Jul 2022
OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Ye Liu
Lingfeng Qiao
Di Yin
Zhuoxuan Jiang
Xinghua Jiang
Deqiang Jiang
Bo Ren
23
7
0
04 Jul 2022
Automated Classification of General Movements in Infants Using a Two-stream Spatiotemporal Fusion Network
Yuki Hashimoto
Akira Furui
K. Shimatani
M. Casadio
P. Moretti
P. Morasso
Toshio Tsuji
8
3
0
04 Jul 2022
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Taeoh Kim
Jinhyung Kim
Minho Shim
Sangdoo Yun
Myunggu Kang
Dongyoon Wee
Sangyoun Lee
AI4TS
23
10
0
30 Jun 2022
A Comprehensive Survey on Deep Gait Recognition: Algorithms, Datasets and Challenges
Chuanfu Shen
Shiqi Yu
Jilong Wang
George Q. Huang
Liang Wang
CVBM
47
54
0
28 Jun 2022
Programmatic Concept Learning for Human Motion Description and Synthesis
Sumith Kulal
Jiayuan Mao
A. Aiken
Jiajun Wu
33
7
0
27 Jun 2022
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Zhan Chen
Sicheng Li
Bing Yang
Qinghan Li
Hong Liu
35
255
0
27 Jun 2022
VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Kashu Yamazaki
Sang Truong
Khoa T. Vo
Michael Kidd
Chase Rainwater
Khoa Luu
Ngan Le
VLM
CoGe
13
25
0
26 Jun 2022
Review on Social Behavior Analysis of Laboratory Animals: From Methodologies to Applications
Ziping Jiang
Paul L. Chazot
Richard Jiang
33
1
0
25 Jun 2022
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos
S. H. Khorasgani
Yuxuan Chen
Florian Shkurti
SSL
57
23
0
25 Jun 2022
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
Kun Xia
Le Wang
Sanping Zhou
Nanning Zheng
Wei Tang
44
36
0
23 Jun 2022
Motion Gait: Gait Recognition via Motion Excitation
Yunpeng Zhang
Zhengyou Wang
Shanna Zhuang
Hui Wang
CVBM
16
1
0
22 Jun 2022
Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
46
6
0
21 Jun 2022
Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Shuaicheng Li
Feng Zhang
Ruiwei Zhao
Rui Feng
Kunlin Yang
Lin-Na Liu
Jun Hou
ViT
31
5
0
21 Jun 2022
One-stage Action Detection Transformer
Lijun Li
Lian Zhuo
Bangyin Zhang
ViT
34
0
0
21 Jun 2022
M&M Mix: A Multimodal Multiview Transformer Ensemble
Xuehan Xiong
Anurag Arnab
Arsha Nagrani
Cordelia Schmid
ViT
23
19
0
20 Jun 2022
Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Xiang Hao
Jingxiang Chen
Shixing Chen
Ahmed Saad
Raffay Hamid
AI4TS
28
0
0
16 Jun 2022
OmniMAE: Single Model Masked Pretraining on Images and Videos
Rohit Girdhar
Alaaeldin El-Nouby
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
ViT
42
98
0
16 Jun 2022
Stand-Alone Inter-Frame Attention in Video Models
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Jiebo Luo
Tao Mei
ViT
33
46
0
14 Jun 2022
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks
Shanghua Gao
Zhong-Yu Li
Qi Han
Ming-Ming Cheng
Liang Wang
42
35
0
14 Jun 2022
Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators
Indhumathi Kandaswamy
Saurabh Farkya
Z. Daniels
G. V. D. Wal
Aswin Raghavan
...
Jun Hu
M. Lomnitz
M. Isnardi
David C. Zhang
M. Piacentino
BDL
17
3
0
10 Jun 2022
Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window
Qiuli Wang
Xin Tan
Chen Liu
26
0
0
08 Jun 2022
Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking
Longlong Jing
Ruichi Yu
Henrik Kretzschmar
Kang Li
C. Qi
...
Yingwei Li
Yurong You
Han Deng
Congcong Li
Drago Anguelov
3DPC
MDE
42
18
0
08 Jun 2022
A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
M. Kowal
Mennatullah Siam
Md. Amirul Islam
Neil D. B. Bruce
Richard P. Wildes
Konstantinos G. Derpanis
26
25
0
06 Jun 2022
3D Convolutional with Attention for Action Recognition
Labina Shrestha
Shikha Dubey
Farrukh Olimov
M. Rafique
M. Jeon
29
0
0
05 Jun 2022
Revisiting the "Video" in Video-Language Understanding
S. Buch
Cristobal Eyzaguirre
Adrien Gaidon
Jiajun Wu
L. Fei-Fei
Juan Carlos Niebles
41
158
0
03 Jun 2022
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
Fei Wu
Qingzhong Wang
Jian Bian
Haoyi Xiong
Ning Ding
Feixiang Lu
Junqing Cheng
Dejing Dou
AI4TS
44
53
0
02 Jun 2022
Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage
Ezechukwu I. Nwokedi
R. Bains
L. Bidaut
Xujiong Ye
Sara Wells
James M. Brown
27
2
0
01 Jun 2022
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Lingtong Kong
Boyuan Jiang
Donghao Luo
Wenqing Chu
Xiaoming Huang
Ying Tai
Chengjie Wang
Jie Yang
76
146
0
29 May 2022
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
Hehe Fan
Xin Yu
Yuhang Ding
Yi Yang
Mohan Kankanhalli
3DPC
131
111
0
27 May 2022
Cross-Architecture Self-supervised Video Representation Learning
Sheng Guo
Zihua Xiong
Yujie Zhong
Limin Wang
Xiaobo Guo
Bing Han
Weilin Huang
SSL
AI4TS
76
24
0
26 May 2022
Learning Muti-expert Distribution Calibration for Long-tailed Video Classification
Yufan Hu
Junyu Gao
Changsheng Xu
19
4
0
22 May 2022
Scalable and Efficient Training of Large Convolutional Neural Networks with Differential Privacy
Zhiqi Bu
Jialin Mao
Shiyun Xu
139
48
0
21 May 2022
Action parsing using context features
N. Mehrseresht
22
0
0
20 May 2022
PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection
Guang-Hui Shi
Ruifeng Li
Chaoxiang Ma
3DPC
74
134
0
16 May 2022
Deep Decomposition and Bilinear Pooling Network for Blind Night-Time Image Quality Evaluation
Qiuping Jiang
Jiawu Xu
Yudong Mao
Wei Zhou
Xiongkuo Min
Guangtao Zhai
24
4
0
12 May 2022
Past and Future Motion Guided Network for Audio Visual Event Localization
Ting-Yen Chen
Jianqin Yin
Jin Tang
26
2
0
08 May 2022
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement
Bing Li
Jiaxin Chen
Dongming Zhang
Xiuguo Bao
Di Huang
23
15
0
07 May 2022
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Mingdong Yang
Guo Chen
Yin-Dong Zheng
Tong Lu
Limin Wang
46
45
0
05 May 2022
Deep Neural Network approaches for Analysing Videos of Music Performances
F. Liwicki
Richa Upadhyay
Prakash Chandra Chhipa
Killian Murphy
F. Visi
S. Östersjö
Marcus Liwicki
21
1
0
05 May 2022
CoCa: Contrastive Captioners are Image-Text Foundation Models
Jiahui Yu
Zirui Wang
Vijay Vasudevan
Legg Yeung
Mojtaba Seyedhosseini
Yonghui Wu
VLM
CLIP
OffRL
85
1,265
0
04 May 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
Xianhang Li
Huiyu Wang
Chen Wei
Jieru Mei
Alan Yuille
Yuyin Zhou
Cihang Xie
30
0
0
03 May 2022
Exposing Deepfake Face Forgeries with Guided Residuals
Zhiqing Guo
Gaobo Yang
Jiyou Chen
Xingming Sun
CVBM
31
26
0
02 May 2022
Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation For Action Recognition
Yang Zhou
Zhanhao He
Ke Lu
Guanhong Wang
Gaoang Wang
CLL
SLR
19
2
0
01 May 2022
RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems
B. Mudassar
Sho Ko
Maojingjing Li
Priyabrata Saha
Saibal Mukhopadhyay
16
2
0
30 Apr 2022
Self-supervised Contrastive Learning for Audio-Visual Action Recognition
Yang Liu
Y. Tan
Haoyu Lan
SSL
47
6
0
28 Apr 2022
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training
Guanhong Wang
Ke Lu
Yang Zhou
Zhanhao He
Gaoang Wang
SSL
32
3
0
27 Apr 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
29
108
0
25 Apr 2022
Previous
1
2
3
...
9
10
11
...
44
45
46
Next