ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXivPDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,275 papers shown
Title
Vision-Language Models can Identify Distracted Driver Behavior from
  Naturalistic Videos
Vision-Language Models can Identify Distracted Driver Behavior from Naturalistic Videos
Md Zahid Hasan
Jiajing Chen
Jiyang Wang
Mohammed Shaiqur Rahman
Ameya Joshi
Senem Velipasalar
C. Hegde
Anuj Sharma
S. Sarkar
VLM
55
18
0
16 Jun 2023
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in
  Vision Transformers
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
Dominick Reilly
Aman Chadha
Srijan Das
ViT
33
4
0
15 Jun 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
E2E-LOAD: End-to-End Long-form Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
33
5
0
13 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired
  Ego-Exo Videos via Temporal Alignment
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Zihui Xue
Kristen Grauman
EgoV
45
31
0
08 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action
  Recognition
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
31
4
0
07 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video
  Using Multiple Instances Learning
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
27
7
0
06 Jun 2023
Human-Object Interaction Prediction in Videos through Gaze Following
Human-Object Interaction Prediction in Videos through Gaze Following
Zhifan Ni
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
35
10
0
06 Jun 2023
A Multi-Modal Transformer Network for Action Detection
A Multi-Modal Transformer Network for Action Detection
Matthew Korban
Scott T. Acton
Peter Youngs
ViT
43
15
0
31 May 2023
Discovering Novel Actions from Open World Egocentric Videos with
  Object-Grounded Visual Commonsense Reasoning
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning
Sanjoy Kundu
Shubham Trehan
Sathyanarayanan N. Aakur
LRM
LM&Ro
27
1
0
26 May 2023
Action Sensitivity Learning for Temporal Action Localization
Action Sensitivity Learning for Temporal Action Localization
Jiayi Shao
Xiaohan Wang
Ruijie Quan
Junjun Zheng
Jiang Yang
Yezhou Yang
33
22
0
25 May 2023
Cross-view Action Recognition Understanding From Exocentric to
  Egocentric Perspective
Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective
Thanh-Dat Truong
Khoa Luu
EgoV
46
10
0
25 May 2023
Continual Learning through Human-Robot Interaction: Human Perceptions of a Continual Learning Robot in Repeated Interactions
Continual Learning through Human-Robot Interaction: Human Perceptions of a Continual Learning Robot in Repeated Interactions
Ali Ayub
Zachary De Francesco
Patrick Holthaus
Chrystopher L. Nehaniv
Kerstin Dautenhahn
CLL
HAI
44
5
0
22 May 2023
Exploring Few-Shot Adaptation for Activity Recognition on Diverse
  Domains
Exploring Few-Shot Adaptation for Activity Recognition on Diverse Domains
Kunyu Peng
Di Wen
David Schneider
Jiaming Zhang
Kailun Yang
M. Sarfraz
Rainer Stiefelhagen
Alina Roitberg
41
2
0
15 May 2023
Is end-to-end learning enough for fitness activity recognition?
Is end-to-end learning enough for fitness activity recognition?
Antoine Mercier
Guillaume Berger
Sunny Panchal
Florian Letsch
Cornelius Boehm
Nahua Kang
Ingo Bax
Roland Memisevic
28
2
0
14 May 2023
Lightweight Delivery Detection on Doorbell Cameras
Lightweight Delivery Detection on Doorbell Cameras
Pirazh Khorramshahi
Zhe Wu
Tianchen Wang
Luke Deluccia
Hongcheng Wang
19
0
0
13 May 2023
Active Semantic Localization with Graph Neural Embedding
Active Semantic Localization with Graph Neural Embedding
Mitsuki Yoshida
Kanji Tanaka
Ryo Yamamoto
Daiki Iwata
22
1
0
10 May 2023
Group Activity Recognition via Dynamic Composition and Interaction
Group Activity Recognition via Dynamic Composition and Interaction
Youliang Zhang
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
35
0
0
09 May 2023
Video-Specific Query-Key Attention Modeling for Weakly-Supervised
  Temporal Action Localization
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
Xijun Wang
Aggelos K. Katsaggelos
34
0
0
07 May 2023
ItoV: Efficiently Adapting Deep Learning-based Image Watermarking to
  Video Watermarking
ItoV: Efficiently Adapting Deep Learning-based Image Watermarking to Video Watermarking
Guanhui Ye
Jiashi Gao
Yuchen Wang
Liyan Song
Xue-Ming Wei
35
3
0
04 May 2023
Weakly-supervised Micro- and Macro-expression Spotting Based on
  Multi-level Consistency
Weakly-supervised Micro- and Macro-expression Spotting Based on Multi-level Consistency
Wang-Wang Yu
Kai-Fu Yang
Hong-Mei Yan
Yong-Jie Li
37
2
0
04 May 2023
Local and Global Contextual Features Fusion for Pedestrian Intention
  Prediction
Local and Global Contextual Features Fusion for Pedestrian Intention Prediction
Mohsen Azarmi
Mahdi Rezaei
Tanveer Hussain
Chenghao Qian
46
8
0
01 May 2023
Physical Adversarial Attacks for Surveillance: A Survey
Physical Adversarial Attacks for Surveillance: A Survey
Kien Nguyen Thanh
Tharindu Fernando
Clinton Fookes
Sridha Sridharan
AAML
36
8
0
01 May 2023
Weakly-Supervised Temporal Action Localization with Bidirectional
  Semantic Consistency Constraint
Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint
Guozhang Li
De Cheng
Xinpeng Ding
N. Wang
Jie Li
Xinbo Gao
25
6
0
25 Apr 2023
MRSN: Multi-Relation Support Network for Video Action Detection
MRSN: Multi-Relation Support Network for Video Action Detection
Yin-Dong Zheng
Guo Chen
Minglei Yuan
Tong Lu
36
8
0
24 Apr 2023
Implicit Temporal Modeling with Learnable Alignment for Video
  Recognition
Implicit Temporal Modeling with Learnable Alignment for Video Recognition
S. Tu
Qi Dai
Zuxuan Wu
Zhi-Qi Cheng
Hang-Rui Hu
Yu-Gang Jiang
46
35
0
20 Apr 2023
Search-Map-Search: A Frame Selection Paradigm for Action Recognition
Search-Map-Search: A Frame Selection Paradigm for Action Recognition
Mingjun Zhao
Yu
Xiaoli Wang
Lei Yang
Di Niu
26
5
0
20 Apr 2023
Video-based Contrastive Learning on Decision Trees: from Action
  Recognition to Autism Diagnosis
Video-based Contrastive Learning on Decision Trees: from Action Recognition to Autism Diagnosis
Mindi Ruan
Xiang Yu
Naifeng Zhang
Chuanbo Hu
Shuo Wang
Xin Li
36
8
0
20 Apr 2023
Self-Supervised 3D Action Representation Learning with Skeleton Cloud
  Colorization
Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization
Siyuan Yang
Jun Liu
Shijian Lu
Er Meng Hwa
Yongjian Hu
Alex C. Kot
3DPC
3DH
38
16
0
18 Apr 2023
Multimodal Short Video Rumor Detection System Based on Contrastive
  Learning
Multimodal Short Video Rumor Detection System Based on Contrastive Learning
Yuxing Yang
Junhao Zhao
Siyi Wang
Xiangyu Min
Peifeng Wang
Haizhou Wang
19
2
0
17 Apr 2023
Unsupervised Learning Optical Flow in Multi-frame Dynamic Environment
  Using Temporal Dynamic Modeling
Unsupervised Learning Optical Flow in Multi-frame Dynamic Environment Using Temporal Dynamic Modeling
Zitang Sun
Shinýa Nishida
Zhengbo Luo
21
1
0
14 Apr 2023
Explaining, Analyzing, and Probing Representations of Self-Supervised
  Learning Models for Sensor-based Human Activity Recognition
Explaining, Analyzing, and Probing Representations of Self-Supervised Learning Models for Sensor-based Human Activity Recognition
Bulat Khaertdinov
S. Asteriadis
34
3
0
14 Apr 2023
PMI Sampler: Patch Similarity Guided Frame Selection for Aerial Action
  Recognition
PMI Sampler: Patch Similarity Guided Frame Selection for Aerial Action Recognition
Ruiqi Xian
Xijun Wang
D. Kothandaraman
Tianyi Zhou
25
7
0
14 Apr 2023
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation
  for Videos
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos
Qi Zhao
Ulugbek S. Kamilov
Zhan Ma
26
32
0
13 Apr 2023
VARS: Video Assistant Referee System for Automated Soccer Decision
  Making from Multiple Views
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
Jan Held
A. Cioppa
Silvio Giancola
Abdullah Hamdi
Guohao Li
Marc Van Droogenbroeck
27
29
0
10 Apr 2023
On the Benefits of 3D Pose and Tracking for Human Action Recognition
On the Benefits of 3D Pose and Tracking for Human Action Recognition
Jathushan Rajasegaran
Georgios Pavlakos
Angjoo Kanazawa
Christoph Feichtenhofer
Jitendra Malik
44
30
0
03 Apr 2023
AutoLabel: CLIP-based framework for Open-set Video Domain Adaptation
AutoLabel: CLIP-based framework for Open-set Video Domain Adaptation
Giacomo Zara
Subhankar Roy
Paolo Rota
Elisa Ricci
VLM
21
13
0
03 Apr 2023
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot
  Action Recognition
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
24
40
0
03 Apr 2023
From Isolated Islands to Pangea: Unifying Semantic Space for Human
  Action Understanding
From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding
Yong-Lu Li
Xiaoqian Wu
Xinpeng Liu
Zehao Wang
Yiming Dou
...
Junyi Zhang
Yixing Li
Jingru Tan
Xudong Lu
Cewu Lu
27
17
0
02 Apr 2023
DOAD: Decoupled One Stage Action Detection Network
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
Pichao Wang
Fan Wang
Jiashi Feng
Mike Zheng Show
26
4
0
01 Apr 2023
Decomposed Cross-modal Distillation for RGB-based Temporal Action
  Detection
Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Pilhyeon Lee
Taeoh Kim
Minho Shim
Dongyoon Wee
H. Byun
41
11
0
30 Mar 2023
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action
  Detection
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
25
3
0
28 Mar 2023
Unified Keypoint-based Action Recognition Framework via Structured
  Keypoint Pooling
Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling
Ryo Hachiuma
Fumiaki Sato
Taiki Sekii
3DPC
29
37
0
27 Mar 2023
Selective Structured State-Spaces for Long-Form Video Understanding
Selective Structured State-Spaces for Long-Form Video Understanding
Jue Wang
Wenjie Zhu
Pichao Wang
Xiang Yu
Linda Liu
Mohamed Omar
Raffay Hamid
41
95
0
25 Mar 2023
Enlarging Instance-specific and Class-specific Information for Open-set
  Action Recognition
Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition
Jun Cen
Shiwei Zhang
Xiang Wang
Yixuan Pei
Zhiwu Qing
Yingya Zhang
Qifeng Chen
44
3
0
25 Mar 2023
A Large-scale Study of Spatiotemporal Representation Learning with a New
  Benchmark on Action Recognition
A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition
Andong Deng
Taojiannan Yang
Chong Chen
AI4TS
32
13
0
23 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
126
63
0
23 Mar 2023
Automatic evaluation of herding behavior in towed fishing gear using
  end-to-end training of CNN and attention-based networks
Automatic evaluation of herding behavior in towed fishing gear using end-to-end training of CNN and attention-based networks
Orri Steinn Guðfinnsson
Týr Vilhjálmsson
Martin Eineborg
T. Thórhallsson
11
0
0
21 Mar 2023
Propagate And Calibrate: Real-time Passive Non-line-of-sight Tracking
Propagate And Calibrate: Real-time Passive Non-line-of-sight Tracking
Yihao Wang
Zhigang Wang
Bin Zhao
Dong Wang
Mulin. Chen
Xuelong Li
19
2
0
21 Mar 2023
Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization
Tubelet-Contrastive Self-Supervision for Video-Efficient Generalization
Fida Mohammad Thoker
Hazel Doughty
Cees G. M. Snoek
ViT
50
9
0
20 Mar 2023
Synthetic-to-Real Domain Adaptation for Action Recognition: A Dataset
  and Baseline Performances
Synthetic-to-Real Domain Adaptation for Action Recognition: A Dataset and Baseline Performances
Arun V. Reddy
Ketul Shah
William Paul
Rohita Mocharla
Judy Hoffman
Kapil D. Katyal
Dinesh Manocha
Celso M. de Melo
Ramalingam Chellappa
31
17
0
17 Mar 2023
Previous
123...567...444546
Next