Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
Neural Information Processing Systems (NeurIPS), 2014
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,340 papers shown
Beyond still images: Temporal features and input variance resilience
Scientific Reports (Sci Rep), 2023
AmirHosein Fadaei
M. Dehaqani
270
0
0
01 Nov 2023
1DFormer: a Transformer Architecture Learning 1D Landmark Representations for Facial Landmark Tracking
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Shi Yin
Shijie Huan
Shangfei Wang
Jinshui Hu
Tao Guo
Bing Yin
Baocai Yin
Cong Liu
ViT
151
0
0
01 Nov 2023
CHAMMI: A benchmark for channel-adaptive models in microscopy imaging
Neural Information Processing Systems (NeurIPS), 2023
Zitong S. Chen
Chau Pham
Siqi Wang
Michael Doron
Nikita Moshkov
Bryan A. Plummer
Juan C. Caicedo
184
14
0
30 Oct 2023
On the Relevance of Temporal Features for Medical Ultrasound Video Recognition
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
D. H. Smith
J. P. Lineberger
G. H. Baker
169
4
0
16 Oct 2023
Watt For What: Rethinking Deep Learning's Energy-Performance Relationship
Shreyank N. Gowda
Xinyue Hao
Gen Li
Laura Sevilla-Lara
Shashank Narayana Gowda
HAI
179
17
0
10 Oct 2023
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
International Conference on Learning Representations (ICLR), 2023
Bin Zhu
Bin Lin
Munan Ning
Yang Yan
Jiaxi Cui
...
Zongwei Li
Wancai Zhang
Zhifeng Li
Wei Liu
Liejie Yuan
VLM
MLLM
740
336
0
03 Oct 2023
Telling Stories for Common Sense Zero-Shot Action Recognition
Asian Conference on Computer Vision (ACCV), 2023
Shreyank N. Gowda
Carolina Scarton
LM&Ro
195
3
0
29 Sep 2023
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
300
15
0
29 Sep 2023
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
273
23
0
28 Sep 2023
Local Compressed Video Stream Learning for Generic Event Boundary Detection
International Journal of Computer Vision (IJCV), 2023
Libo Zhang
Xin Gu
Congcong Li
Tiejian Luo
Hengrui Fan
228
6
0
27 Sep 2023
CPR-Coach: Recognizing Composite Error Actions based on Single-class Training
Computer Vision and Pattern Recognition (CVPR), 2023
Shunli Wang
Qing Yu
Shuai Wang
Dingkang Yang
Liuzhen Su
Xiao Zhao
Haopeng Kuang
Pei Zhang
Peng Zhai
Lihua Zhang
345
5
0
21 Sep 2023
Exploring Self-supervised Skeleton-based Action Recognition in Occluded Environments
Yifei Chen
Kunyu Peng
Alina Roitberg
David Schneider
Kailai Li
Junwei Zheng
R. Liu
Yufan Chen
Kailun Yang
Rainer Stiefelhagen
318
0
0
21 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
245
1
0
20 Sep 2023
Selective Volume Mixup for Video Action Recognition
Yi Tan
Zhaofan Qiu
Y. Hao
Ting Yao
Xiangnan He
Tao Mei
ViT
212
4
0
18 Sep 2023
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool
Josiah W. Smith
Murat Torlak
187
1
0
16 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
216
31
0
14 Sep 2023
Judging a video by its bitstream cover
Data Compression Conference (DCC), 2023
Yuxing Han
Yunan Ding
Jiangtao Wen
Chen Ye Gan
117
0
0
14 Sep 2023
TransNet: A Transfer Learning-Based Network for Human Action Recognition
International Conference on Machine Learning and Applications (ICMLA), 2023
Khaled Alomar
Xiaohao Cai
236
1
0
13 Sep 2023
Enhancing multimodal cooperation via sample-level modality valuation
Computer Vision and Pattern Recognition (CVPR), 2023
Yake Wei
Ruoxuan Feng
Zihe Wang
Di Hu
468
50
0
12 Sep 2023
EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
IEEE International Conference on Computer Vision (ICCV), 2023
Yue Xu
Yong-Lu Li
Zhemin Huang
Michael Xu Liu
Cewu Lu
Yu-Wing Tai
Chi-Keung Tang
EgoV
175
12
0
05 Sep 2023
SOAR: Scene-debiasing Open-set Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Yuanhao Zhai
Ziyi Liu
Zhenyu Wu
Yi Wu
Chunluan Zhou
David Doermann
Junsong Yuan
Gang Hua
316
13
0
03 Sep 2023
Towards Contrastive Learning in Music Video Domain
Karel Veldkamp
Mariya Hendriksen
Zoltán Szlávik
Alexander Keijser
SSL
210
3
0
01 Sep 2023
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning
ACM Multimedia (ACM MM), 2023
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
Jian Zhao
203
9
0
29 Aug 2023
Evaluation of Key Spatiotemporal Learners for Print Track Anomaly Classification Using Melt Pool Image Streams
IFAC-PapersOnLine (IFAC-PapersOnLine), 2023
Lynn Cherif
Mutahar Safdar
Guy Lamouche
P. Wanjara
P. Paul
G. Wood
Max Zimmermann
F. Hannesen
Yao Zhao
103
4
0
28 Aug 2023
LAC: Latent Action Composition for Skeleton-based Action Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Di Yang
Yaohui Wang
A. Dantcheva
Quan Kong
Lorenzo Garattoni
Gianpiero Francesca
Francois Bremond
538
18
0
28 Aug 2023
Improving Video Violence Recognition with Human Interaction Learning on 3D Skeleton Point Clouds
Qingxin Xiao
Guosheng Lin
Qingyao Wu
3DH
3DPC
196
5
0
26 Aug 2023
TriGait: Aligning and Fusing Skeleton and Silhouette Gait Data via a Tri-Branch Network
Yangwei Sun
Xu Feng
Liyan Ma
Long Hu
Mark Nixon
CVBM
271
9
0
25 Aug 2023
AccFlow: Backward Accumulation for Long-Range Optical Flow
IEEE International Conference on Computer Vision (ICCV), 2023
Guangyang Wu
Xiaohong Liu
Kunming Luo
Xi Liu
Qingqing Zheng
Shuaicheng Liu
Xinyang Jiang
Guangtao Zhai
Wenyi Wang
155
27
0
25 Aug 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
307
4
0
23 Aug 2023
Sign Language Translation with Iterative Prototype
IEEE International Conference on Computer Vision (ICCV), 2023
Huijie Yao
Wen-gang Zhou
Hao Feng
Hezhen Hu
Hao Zhou
Houqiang Li
SLR
85
26
0
23 Aug 2023
Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2023
Xi Li
Songhe Wang
Rui Huang
Mahanth K. Gowda
G. Kesidis
AAML
380
7
0
21 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
IEEE International Conference on Computer Vision (ICCV), 2023
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
151
50
0
21 Aug 2023
Visual Crowd Analysis: Open Research Problems
The AI Magazine (AI Mag.), 2023
Muhammad Asif Khan
Hamid Menouar
R. Hamila
HAI
287
9
0
21 Aug 2023
Joint learning of images and videos with a single Vision Transformer
Shuki Shimizu
Toru Tamaki
ViT
181
0
0
21 Aug 2023
Spatial-Temporal Alignment Network for Action Recognition
Jinhui Ye
Junwei Liang
3DPC
161
2
0
19 Aug 2023
Unlimited Knowledge Distillation for Action Recognition in the Dark
Ruibing Jin
Guosheng Lin
Ruibing Jin
Jie Lin
Zhengguo Li
Xiaoli Li
Zhenghua Chen
155
2
0
18 Aug 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
IEEE International Conference on Computer Vision (ICCV), 2023
Zhiqiang Shen
Xiaoxiao Sheng
Hehe Fan
Longguang Wang
Yike Guo
Qiong Liu
Hao Wen
Xiaoping Zhou
3DPC
149
20
0
18 Aug 2023
Event-Guided Procedure Planning from Instructional Videos with Text Supervision
IEEE International Conference on Computer Vision (ICCV), 2023
Ante Wang
Kun-Li Channing Lin
Jiachen Du
Jingke Meng
Wei-Shi Zheng
144
18
0
17 Aug 2023
FOLT: Fast Multiple Object Tracking from UAV-captured Videos Based on Optical Flow
ACM Multimedia (ACM MM), 2023
Mufeng Yao
Jiaqi Wang
Jinlong Peng
M. Chi
Chao Liu
VOT
144
27
0
14 Aug 2023
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Liang Shang
Yanli Liu
Zhengyang Lou
Shuxue Quan
N. Adluru
Bochen Guan
W. Sethares
297
5
0
10 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
205
17
0
10 Aug 2023
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition
L. Bicsi
B. Alexe
Radu Tudor Ionescu
Marius Leordeanu
257
2
0
09 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
ACM Multimedia (ACM MM), 2023
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
261
10
0
09 Aug 2023
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition
Indian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2023
S. Chaudhuri
Saumik Bhattacharya
175
5
0
07 Aug 2023
Incorporating Pre-training Data Matters in Unsupervised Domain Adaptation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yinsong Xu
Aidong Men
Yang Liu
Xiahai Zhuang
Qingchao Chen
313
3
0
06 Aug 2023
SkateboardAI: The Coolest Video Action Recognition for Skateboarding
AAAI Conference on Artificial Intelligence (AAAI), 2023
Hanxiao Chen
ViT
118
4
0
02 Aug 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration
ACM Multimedia (ACM MM), 2023
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
220
9
0
27 Jul 2023
Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding Emotion
Proceedings of the IEEE (Proc. IEEE), 2023
James Z. Wang
Sicheng Zhao
Chenyan Wu
Reginald B. Adams
M. Newman
T. Shafir
Rachelle Tsachor
335
54
0
25 Jul 2023
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering
IEEE transactions on multimedia (IEEE TMM), 2023
Yi Cheng
Hehe Fan
Dongyun Lin
Ying Sun
Mohan S. Kankanhalli
J. Lim
211
10
0
25 Jul 2023
On the Connection between Pre-training Data Diversity and Fine-tuning Robustness
Neural Information Processing Systems (NeurIPS), 2023
Vivek Ramanujan
Thao Nguyen
Sewoong Oh
Ludwig Schmidt
Ali Farhadi
OOD
190
33
0
24 Jul 2023
Previous
1
2
3
...
5
6
7
...
45
46
47
Next