ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXivPDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,275 papers shown
Title
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?
Jianyang Xie
Yitian Zhao
Y. Meng
He Zhao
Anh Nguyen
Yalin Zheng
22
0
0
15 May 2025
Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment
Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment
João Alves
Pia Haubro Andersen
Rikke Gade
37
0
0
06 May 2025
ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation
ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation
Wenqi Guo
Shan Du
VLM
62
0
0
10 Apr 2025
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
Yoojin Jung
Byung Cheol Song
AAML
VLM
MQ
41
0
0
07 Apr 2025
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?
Shreyank N. Gowda
Boyan Gao
Xiao Gu
Xiaobo Jin
VLM
47
0
0
02 Apr 2025
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
Shuyu Li
Shulei Ji
Zihao Wang
Songruoyao Wu
Jiaxing Yu
Kaipeng Zhang
MGen
VGen
73
1
0
01 Apr 2025
Sample-level Adaptive Knowledge Distillation for Action Recognition
Sample-level Adaptive Knowledge Distillation for Action Recognition
Ping Li
Chenhao Ping
Wenxiao Wang
Mingli Song
49
0
0
01 Apr 2025
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs
Lucas Ventura
Antoine Yang
Cordelia Schmid
Gül Varol
41
0
0
31 Mar 2025
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition
Jongseo Lee
Joohyun Chang
Dongho Lee
Jinwoo Choi
56
0
0
30 Mar 2025
CMD-HAR: Cross-Modal Disentanglement for Wearable Human Activity Recognition
CMD-HAR: Cross-Modal Disentanglement for Wearable Human Activity Recognition
Hanyu Liu
Siyao Li
Ying Yu
Yixuan Jiang
Hang Xiao
Jingxi Long
Haotian Tang
51
0
0
27 Mar 2025
BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors
BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors
Chengyang Hu
Yuduo Chen
Lizhuang Ma
81
0
0
26 Mar 2025
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
Zichen Liu
Kunlun Xu
Fuchun Sun
Xu Zou
Yuxin Peng
Jiahuan Zhou
VLM
AI4TS
71
1
0
20 Mar 2025
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Shristi Das Biswas
Efstathia Soufleri
Arani Roy
Kaushik Roy
59
0
0
17 Mar 2025
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Saket Gurukar
Asim Kadav
VLM
55
0
0
17 Mar 2025
Gate-Shift-Pose: Enhancing Action Recognition in Sports with Skeleton Information
Edoardo Bianchi
Oswald Lanz
3DH
68
1
0
06 Mar 2025
BdSLW401: Transformer-Based Word-Level Bangla Sign Language Recognition Using Relative Quantization Encoding (RQE)
Husne Ara Rubaiyeat
Njayou Youssouf
Md. Kamrul Hasan
H. Mahmud
SLR
57
0
0
04 Mar 2025
Solar Multimodal Transformer: Intraday Solar Irradiance Predictor using Public Cameras and Time Series
Yanan Niu
Roy Sarkis
D. Psaltis
Mario Paolone
Christophe Moser
Luisa Lambertini
38
0
0
28 Feb 2025
MICINet: Multi-Level Inter-Class Confusing Information Removal for Reliable Multimodal Classification
MICINet: Multi-Level Inter-Class Confusing Information Removal for Reliable Multimodal Classification
Tao Zhang
Shu Shen
Chao Chen
76
0
0
27 Feb 2025
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion
Qingyuan Jiang
Longfei Huang
Yang Yang
62
0
0
27 Feb 2025
ASurvey: Spatiotemporal Consistency in Video Generation
ASurvey: Spatiotemporal Consistency in Video Generation
Zhiyu Yin
Kehai Chen
Xuefeng Bai
Ruili Jiang
J. Li
Hongdong Li
Jin Liu
Yang Xiang
Jun Yu
Min Zhang
EGVM
VGen
AI4TS
64
0
0
25 Feb 2025
Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition
Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition
Hongda Liu
Yunlong Wang
Min Ren
Junxing Hu
Zhengquan Luo
Guangqi Hou
Zhe Sun
55
0
0
24 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
45
0
0
11 Feb 2025
MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling
MD-BERT: Action Recognition in Dark Videos via Dynamic Multi-Stream Fusion and Temporal Modeling
Sharana Dharshikgan Suresh Dass
H. Barua
Ganesh Krishnasamy
Raveendran Paramesran
Raphael C.-W. Phan
69
0
0
06 Feb 2025
BRIDLE: Generalized Self-supervised Learning with Quantization
BRIDLE: Generalized Self-supervised Learning with Quantization
Hoang M. Nguyen
Satya Narayan Shukla
Qiang Zhang
Hanchao Yu
Sreya D. Roy
Taipeng Tian
Lingjiong Zhu
Yuchen Liu
SSL
MQ
84
0
0
04 Feb 2025
Can masking background and object reduce static bias for zero-shot action recognition?
Can masking background and object reduce static bias for zero-shot action recognition?
Takumi Fukuzawa
Kensho Hara
Hirokatsu Kataoka
Toru Tamaki
43
0
0
22 Jan 2025
High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition
High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition
Ziao Li
Junyi Wang
Bangli Liu
Haibin Cai
Mohamad Saada
Guhong Nie
3DH
60
0
0
08 Jan 2025
Multiscaled Multi-Head Attention-based Video Transformer Network for Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
SLR
41
16
0
03 Jan 2025
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
Pinelopi Papalampidi
Skanda Koppula
Shreya Pathak
Justin T Chiu
Joseph Heyward
Viorica Patraucean
Jiajun Shen
Antoine Miech
Andrew Zisserman
Aida Nematzdeh
VLM
72
24
0
31 Dec 2024
Scaling 4D Representations
Scaling 4D Representations
João Carreira
Dilara Gokay
Michael King
Chuhan Zhang
Ignacio Rocco
...
Viorica Patraucean
Dima Damen
Pauline Luc
Mehdi S. M. Sajjadi
Andrew Zisserman
90
3
0
19 Dec 2024
CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile
  Devices
CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices
Andrei Znobishchev
Valerii Filev
Oleg Kudashev
Nikita Orlov
Humphrey Shi
79
0
0
17 Dec 2024
Future Aspects in Human Action Recognition: Exploring Emerging
  Techniques and Ethical Influences
Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences
Antonios Gasteratos
Stavros N. Moutsis
Konstantinos A. Tsintotas
Yiannis Aloimonos
62
0
0
17 Dec 2024
EdgeOAR: Real-time Online Action Recognition On Edge Devices
EdgeOAR: Real-time Online Action Recognition On Edge Devices
Wei Luo
Deyu Zhang
Ying Tang
Fan Wu
Yaoxue Zhang
80
0
0
02 Dec 2024
Learning Visual Abstract Reasoning through Dual-Stream Networks
Learning Visual Abstract Reasoning through Dual-Stream Networks
Kai Zhao
Chang Xu
Bailu Si
112
4
0
29 Nov 2024
A Novel Approach to Image Steganography Using Generative Adversarial Networks
Waheed Rehman
GAN
79
0
0
27 Nov 2024
A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation Learning
Luis Vilaca
Yi Yu
Paula Vinan
79
0
0
24 Nov 2024
When Spatial meets Temporal in Action Recognition
When Spatial meets Temporal in Action Recognition
H. Chen
Lei Wang
Yuhang Chen
Tom Gedeon
Piotr Koniusz
105
2
0
22 Nov 2024
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Youwei Zhou
Tianyang Xu
Cong Wu
Xiaojun Wu
J. Kittler
3DH
80
0
0
22 Nov 2024
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
Hanyu Guo
Wanchuan Yu
Suzhou Que
Kaiwen Du
Yan Yan
Hanzi Wang
70
1
0
18 Nov 2024
Weakly-Supervised Anomaly Detection in Surveillance Videos Based on
  Two-Stream I3D Convolution Network
Weakly-Supervised Anomaly Detection in Surveillance Videos Based on Two-Stream I3D Convolution Network
Sareh Nejad
Anwar Haque
21
1
0
13 Nov 2024
Don't Look Twice: Faster Video Transformers with Run-Length Tokenization
Don't Look Twice: Faster Video Transformers with Run-Length Tokenization
Rohan Choudhury
Guanglei Zhu
Sihan Liu
Koichiro Niinuma
Kris M. Kitani
László A. Jeni
34
11
0
07 Nov 2024
Learning Video Representations without Natural Videos
Learning Video Representations without Natural Videos
Xueyang Yu
Xinlei Chen
Yossi Gandelsman
VGen
AI4TS
54
0
0
31 Oct 2024
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake
  Detection
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection
Fan Nie
Jiangqun Ni
Jian Zhang
Bin Zhang
Weizhe Zhang
DiffM
39
1
0
31 Oct 2024
SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by
  Exploiting Temporal Continuity
SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity
Kailong Wang
Jieru Zhao
Shuo Yang
Wenchao Ding
M. Guo
30
0
0
28 Oct 2024
That was not what I was aiming at! Differentiating human intent and
  outcome in a physically dynamic throwing task
That was not what I was aiming at! Differentiating human intent and outcome in a physically dynamic throwing task
Vidullan Surendran
Alan R. Wagner
28
0
0
26 Oct 2024
Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and
  Cross-Modality Attention Model
Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model
Shibo Zhou
Bo Yang
Mengwen Yuan
Runhao Jiang
Rui Yan
Gang Pan
Huajin Tang
37
4
0
21 Oct 2024
LocoMotion: Learning Motion-Focused Video-Language Representations
LocoMotion: Learning Motion-Focused Video-Language Representations
Hazel Doughty
Fida Mohammad Thoker
Cees G. M. Snoek
58
2
0
15 Oct 2024
On-the-fly Modulation for Balanced Multimodal Learning
On-the-fly Modulation for Balanced Multimodal Learning
Yake Wei
D. Hu
Henghui Du
Zhicheng Dou
34
7
0
15 Oct 2024
The Ingredients for Robotic Diffusion Transformers
The Ingredients for Robotic Diffusion Transformers
Sudeep Dasari
Oier Mees
Sebastian Zhao
Mohan Kumar Srirama
Sergey Levine
56
20
0
14 Oct 2024
Fourier-based Action Recognition for Wildlife Behavior Quantification
  with Event Cameras
Fourier-based Action Recognition for Wildlife Behavior Quantification with Event Cameras
Friedhelm Hamann
Suman Ghosh
Ignacio Juarez Martinez
Tom Hart
Alex Kacelnik
Guillermo Gallego
36
0
0
09 Oct 2024
Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for
  Action Detection
Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for Action Detection
Zhe Luo
Weina Fu
Shuai Liu
Saeed Anwar
Muhammad Saqib
Sambit Bakshi
Khan Muhammad
43
2
0
08 Oct 2024
1234...444546
Next