Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
Neural Information Processing Systems (NeurIPS), 2014
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,338 papers shown
Title
AugmentGest: Can Random Data Cropping Augmentation Boost Gesture Recognition Performance?
Nada Aboudeshish
D. Ignatov
Radu Timofte
181
5
0
08 Jun 2025
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Yuping He
Yifei Huang
Guo Chen
Lidong Lu
Baoqi Pei
Jilan Xu
Tong Lu
Yoichi Sato
EgoV
361
2
0
06 Jun 2025
Efficient Egocentric Action Recognition with Multimodal Data
Marco Calzavara
Ard Kastrati
Matteo Macchini
Dushan Vasilevski
Roger Wattenhofer
EgoV
227
0
0
02 Jun 2025
3D Skeleton-Based Action Recognition: A Review
Mengyuan Liu
Hong Liu
Qianshuo Hu
Bin Ren
Junsong Yuan
Jiaying Lin
Jiajun Wen
212
2
0
01 Jun 2025
Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis
Vasilii Korolkov
123
1
0
31 May 2025
Spatiotemporal Analysis of Forest Machine Operations Using 3D Video Classification
Maciej Wielgosz
Simon Berg
Heikki Korpunen
Stephan Hoffmann
165
0
0
30 May 2025
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning
Liyun Zhu
Qixiang Chen
Xi Shen
Xiaodong Cun
AI4TS
LRM
202
6
0
29 May 2025
Vid-SME: Membership Inference Attacks against Large Video Understanding Models
Qi Li
Runpeng Yu
Xinchao Wang
271
4
0
29 May 2025
CA3D: Convolutional-Attentional 3D Nets for Efficient Video Activity Recognition on the Edge
Gabriele Lagani
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
146
1
0
26 May 2025
Multi-task Learning For Joint Action and Gesture Recognition
Konstantinos Spathis
N. Kardaris
Petros Maragos
138
0
0
23 May 2025
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?
Computer Vision and Pattern Recognition (CVPR), 2025
Jianyang Xie
Yitian Zhao
Y. Meng
He Zhao
Anh Nguyen
Yalin Zheng
215
2
0
15 May 2025
Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment
João Alves
Pia Haubro Andersen
Rikke Gade
152
0
0
06 May 2025
ZS-VCOS: Zero-Shot Video Camouflaged Object Segmentation By Optical Flow and Open Vocabulary Object Detection
Wenqi Guo
Mohamed Shehata
Shan Du
VLM
352
0
0
10 Apr 2025
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
Computer Vision and Pattern Recognition (CVPR), 2025
Yoojin Jung
Byung Cheol Song
AAML
VLM
MQ
182
1
0
07 Apr 2025
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?
Shreyank N. Gowda
Boyan Gao
Xiao Gu
Xiaobo Jin
VLM
307
0
0
02 Apr 2025
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
Shuyu Li
Shulei Ji
Zihao Wang
Songruoyao Wu
Jiaxing Yu
Jianchao Tan
MGen
VGen
503
3
0
01 Apr 2025
Sample-level Adaptive Knowledge Distillation for Action Recognition
Ping Li
Chenhao Ping
Wenxiao Wang
Mingli Song
326
1
0
01 Apr 2025
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs
Computer Vision and Pattern Recognition (CVPR), 2025
Lucas Ventura
Antoine Yang
Cordelia Schmid
Gül Varol
242
2
0
31 Mar 2025
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition
Jongseo Lee
Joohyun Chang
Dongho Lee
Jinwoo Choi
476
0
0
30 Mar 2025
CMD-HAR: Cross-Modal Disentanglement for Wearable Human Activity Recognition
Xiaoyang Li
Siyao Li
Ying Yu
Yixuan Jiang
Hang Xiao
Jingxi Long
Haotian Tang
Chao Li
362
0
0
27 Mar 2025
BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors
Chengyang Hu
Yuduo Chen
Lizhuang Ma
165
0
0
26 Mar 2025
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
Computer Vision and Pattern Recognition (CVPR), 2025
Zichen Liu
Kunlun Xu
Fuchun Sun
Xu Zou
Yuxin Peng
Jiahuan Zhou
VLM
AI4TS
385
9
0
20 Mar 2025
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Saket Gurukar
Asim Kadav
VLM
333
1
0
17 Mar 2025
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Shristi Das Biswas
Efstathia Soufleri
Arani Roy
Kaushik Roy
290
1
0
17 Mar 2025
Gate-Shift-Pose: Enhancing Action Recognition in Sports with Skeleton Information
Edoardo Bianchi
Oswald Lanz
3DH
239
3
0
06 Mar 2025
BdSLW401: Transformer-Based Word-Level Bangla Sign Language Recognition Using Relative Quantization Encoding (RQE)
Husne Ara Rubaiyeat
Njayou Youssouf
Md. Kamrul Hasan
H. Mahmud
SLR
227
3
0
04 Mar 2025
2DMCG:2DMambawith Change Flow Guidance for Change Detection in Remote Sensing
JunYao Kaung
HongWei Ge
Mamba
864
1
0
01 Mar 2025
Solar Multimodal Transformer: Intraday Solar Irradiance Predictor using Public Cameras and Time Series
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Yanan Niu
Roy Sarkis
D. Psaltis
Mario Paolone
Christophe Moser
Luisa Lambertini
223
3
0
28 Feb 2025
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Qingyuan Jiang
Longfei Huang
Yang Yang
252
1
0
27 Feb 2025
Reliable Multimodal Learning Via Multi-Level Adaptive DeConfusion
Tianze Zhang
Shu Shen
Chao Chen
337
0
0
27 Feb 2025
ASurvey: Spatiotemporal Consistency in Video Generation
Zhiyu Yin
Kehai Chen
Xuefeng Bai
Ruili Jiang
Junlin Li
Hongdong Li
Jin Liu
Yang Xiang
Jun Yu
Min Zhang
EGVM
VGen
AI4TS
238
0
0
25 Feb 2025
Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition
Machine Intelligence Research (MIR), 2023
Hongda Liu
Yunlong Wang
Min Ren
Junxing Hu
Zhengquan Luo
Guangqi Hou
Zhe Sun
258
3
0
24 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
305
0
0
11 Feb 2025
Seeing in the Dark: A Teacher-Student Framework for Dark Video Action Recognition via Knowledge Distillation and Contrastive Learning
Sharana Dharshikgan Suresh Dass
H. Barua
Ganesh Krishnasamy
Raveendran Paramesran
Raphael C.-W. Phan
331
0
0
06 Feb 2025
BRIDLE: Generalized Self-supervised Learning with Quantization
Hoang M. Nguyen
Satya Narayan Shukla
Qiang Zhang
Hanchao Yu
Sreya D. Roy
Taipeng Tian
Lingjiong Zhu
Yuchen Liu
SSL
MQ
308
0
0
04 Feb 2025
Can masking background and object reduce static bias for zero-shot action recognition?
Conference on Multimedia Modeling (MMM), 2025
Takumi Fukuzawa
Kensho Hara
Hirokatsu Kataoka
Toru Tamaki
425
4
0
22 Jan 2025
High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition
Ziao Li
Junyi Wang
Bangli Liu
Haibin Cai
Mohamad Saada
Guhong Nie
3DH
267
0
0
08 Jan 2025
Multiscaled Multi-Head Attention-based Video Transformer Network for Hand Gesture Recognition
IEEE Signal Processing Letters (IEEE SPL), 2025
Mallika Garg
Debashis Ghosh
P. M. Pradhan
SLR
247
23
0
03 Jan 2025
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Computer Vision and Pattern Recognition (CVPR), 2023
Pinelopi Papalampidi
Skanda Koppula
Shreya Pathak
Celine Lee
Joseph Heyward
Viorica Patraucean
Jiajun Shen
Antoine Miech
Andrew Zisserman
Aida Nematzdeh
VLM
248
38
0
31 Dec 2024
Scaling 4D Representations
João Carreira
Dilara Gokay
Michael King
Chuhan Zhang
Ignacio Rocco
...
Viorica Patraucean
Dima Damen
Pauline Luc
Mehdi S. M. Sajjadi
Andrew Zisserman
391
19
0
19 Dec 2024
CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices
Andrei Znobishchev
Valerii Filev
Oleg Kudashev
Nikita Orlov
Humphrey Shi
242
1
0
17 Dec 2024
Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences
Antonios Gasteratos
Stavros N. Moutsis
Konstantinos A. Tsintotas
Yiannis Aloimonos
167
0
0
17 Dec 2024
EdgeOAR: Real-time Online Action Recognition On Edge Devices
Wei Luo
Deyu Zhang
Ying Tang
Fan Wu
Yaoxue Zhang
215
0
0
02 Dec 2024
Learning Visual Abstract Reasoning through Dual-Stream Networks
AAAI Conference on Artificial Intelligence (AAAI), 2024
Kai Zhao
Chang Xu
Bailu Si
331
8
0
29 Nov 2024
A Novel Approach to Image Steganography Using Generative Adversarial Networks
Waheed Rehman
GAN
249
5
0
27 Nov 2024
A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation Learning
ACM Computing Surveys (ACM CSUR), 2024
Luis Vilaca
Yi Yu
Paula Vinan
442
3
0
24 Nov 2024
When Spatial meets Temporal in Action Recognition
Huajun Chen
Lei Wang
Yuxiao Chen
Tom Gedeon
Piotr Koniusz
256
3
0
22 Nov 2024
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Youwei Zhou
Tianyang Xu
Cong Wu
Xiaojun Wu
J. Kittler
3DH
487
4
0
22 Nov 2024
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
Hanyu Guo
Wanchuan Yu
Suzhou Que
Kaiwen Du
Yan Yan
Hanzi Wang
370
2
0
18 Nov 2024
Weakly-Supervised Anomaly Detection in Surveillance Videos Based on Two-Stream I3D Convolution Network
Sareh Nejad
Anwar Haque
159
5
0
13 Nov 2024
Previous
1
2
3
4
5
...
45
46
47
Next