ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXivPDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,275 papers shown
Title
Exploiting long-term temporal dynamics for video captioning
Exploiting long-term temporal dynamics for video captioning
Yuyu Guo
Jingqiu Zhang
Lianli Gao
19
18
0
22 Feb 2022
3DRM:Pair-wise relation module for 3D object detection
3DRM:Pair-wise relation module for 3D object detection
Yuqing Lan
Yao Duan
Yifei Shi
Hui Huang
Kai Xu
3DPC
30
4
0
20 Feb 2022
Student Dangerous Behavior Detection in School
Student Dangerous Behavior Detection in School
Huayi Zhou
Fei Jiang
Hongtao Lu
12
4
0
19 Feb 2022
Going Deeper into Recognizing Actions in Dark Environments: A
  Comprehensive Benchmark Study
Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive Benchmark Study
Yuecong Xu
Jianfei Yang
Haozhi Cao
Jianxiong Yin
Zhenghua Chen
Xiaoli Li
Zhengguo Li
Qiaoqiao Xu
43
2
0
19 Feb 2022
ActionFormer: Localizing Moments of Actions with Transformers
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
31
333
0
16 Feb 2022
Integration of knowledge and data in machine learning
Integration of knowledge and data in machine learning
Yuntian Chen
Dongxiao Zhang
PINN
28
31
0
15 Feb 2022
HAKE: A Knowledge Engine Foundation for Human Activity Understanding
HAKE: A Knowledge Engine Foundation for Human Activity Understanding
Yong-Lu Li
Xinpeng Liu
Xiaoqian Wu
Yizhuo Li
Zuoyu Qiu
Liang Xu
Yue Xu
Haoshu Fang
Cewu Lu
32
38
0
14 Feb 2022
CZU-MHAD: A multimodal dataset for human action recognition utilizing a
  depth camera and 10 wearable inertial sensors
CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors
Xin Chao
Zhenjie Hou
Yu Mo
35
20
0
07 Feb 2022
A Coding Framework and Benchmark towards Compressed Video Understanding
A Coding Framework and Benchmark towards Compressed Video Understanding
Yuan Tian
Guo Lu
Yichao Yan
Guangtao Zhai
L. Chen
Zhiyong Gao
41
21
0
06 Feb 2022
Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action
  Recognition
Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition
Lipeng Ke
Kuan-Chuan Peng
Siwei Lyu
3DPC
34
33
0
04 Feb 2022
Video Violence Recognition and Localization Using a Semi-Supervised Hard
  Attention Model
Video Violence Recognition and Localization Using a Semi-Supervised Hard Attention Model
Hamid Reza Mohammadi
Ehsan Nazerfard
32
24
0
04 Feb 2022
Human Activity Recognition Using Tools of Convolutional Neural Networks:
  A State of the Art Review, Data Sets, Challenges and Future Prospects
Human Activity Recognition Using Tools of Convolutional Neural Networks: A State of the Art Review, Data Sets, Challenges and Future Prospects
Md. Milon Islam
Sheikh Nooruddin
Fakhri Karray
Muhammad Ghulam
BDL
29
129
0
02 Feb 2022
An Eye for an Eye: Defending against Gradient-based Attacks with
  Gradients
An Eye for an Eye: Defending against Gradient-based Attacks with Gradients
Hanbin Hong
Yuan Hong
Yu Kong
AAML
35
2
0
02 Feb 2022
MMSys'22 Grand Challenge on AI-based Video Production for Soccer
MMSys'22 Grand Challenge on AI-based Video Production for Soccer
Cise Midoglu
Steven A. Hicks
Vajira Thambawita
T. Kupka
Pål Halvorsen
VGen
49
14
0
02 Feb 2022
Should I take a walk? Estimating Energy Expenditure from Video Data
Should I take a walk? Estimating Energy Expenditure from Video Data
Kunyu Peng
Alina Roitberg
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
18
4
0
01 Feb 2022
Capturing Temporal Information in a Single Frame: Channel Sampling
  Strategies for Action Recognition
Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition
Kiyoon Kim
Shreyank N. Gowda
Oisin Mac Aodha
Laura Sevilla-Lara
34
9
0
25 Jan 2022
LTC-GIF: Attracting More Clicks on Feature-length Sports Videos
LTC-GIF: Attracting More Clicks on Feature-length Sports Videos
G. Mujtaba
Jaehyuk Choi
Eun‐Seok Ryu
16
0
0
22 Jan 2022
LTC-SUM: Lightweight Client-driven Personalized Video Summarization
  Framework Using 2D CNN
LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN
G. Mujtaba
A. Malik
Eun‐Seok Ryu
32
15
0
22 Jan 2022
Omnivore: A Single Model for Many Visual Modalities
Omnivore: A Single Model for Many Visual Modalities
Rohit Girdhar
Mannat Singh
Nikhil Ravi
Laurens van der Maaten
Armand Joulin
Ishan Misra
229
226
0
20 Jan 2022
Autoencoding Video Latents for Adversarial Video Generation
Autoencoding Video Latents for Adversarial Video Generation
Sai Hemanth Kasaraneni
VGen
30
3
0
18 Jan 2022
Action Keypoint Network for Efficient Video Recognition
Action Keypoint Network for Efficient Video Recognition
Xu Chen
Yahong Han
Xiaohan Wang
Yifang Sun
Yi Yang
3DPC
32
6
0
17 Jan 2022
Video Transformers: A Survey
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
29
103
0
16 Jan 2022
Hand-Object Interaction Reasoning
Hand-Object Interaction Reasoning
Jian Ma
Dima Damen
27
7
0
13 Jan 2022
OCSampler: Compressing Videos to One Clip with Single-step Sampling
OCSampler: Compressing Videos to One Clip with Single-step Sampling
Jintao Lin
Haodong Duan
Kai-xiang Chen
Dahua Lin
Limin Wang
47
24
0
12 Jan 2022
Multiview Transformers for Video Recognition
Multiview Transformers for Video Recognition
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
26
212
0
12 Jan 2022
Incidents1M: a large-scale dataset of images with natural disasters,
  damage, and incidents
Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents
Ethan Weber
Dim P. Papadopoulos
Àgata Lapedriza
Ferda Ofli
Muhammad Imran
Antonio Torralba
75
23
0
11 Jan 2022
Motion-Focused Contrastive Learning of Video Representations
Motion-Focused Contrastive Learning of Video Representations
Rui Li
Yiheng Zhang
Zhaofan Qiu
Ting Yao
Dong Liu
Tao Mei
SSL
39
34
0
11 Jan 2022
Representing Videos as Discriminative Sub-graphs for Action Recognition
Representing Videos as Discriminative Sub-graphs for Action Recognition
Dong Li
Zhaofan Qiu
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
44
26
0
11 Jan 2022
Boosting Video Representation Learning with Multi-Faceted Integration
Boosting Video Representation Learning with Multi-Faceted Integration
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xiaoping Zhang
Dong Wu
Tao Mei
31
8
0
11 Jan 2022
Condensing a Sequence to One Informative Frame for Video Recognition
Condensing a Sequence to One Informative Frame for Video Recognition
Zhaofan Qiu
Ting Yao
Y. Shu
Chong-Wah Ngo
Tao Mei
42
9
0
11 Jan 2022
Optimization Planning for 3D ConvNets
Optimization Planning for 3D ConvNets
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
3DPC
3DH
42
9
0
11 Jan 2022
TSA-Net: Tube Self-Attention Network for Action Quality Assessment
TSA-Net: Tube Self-Attention Network for Action Quality Assessment
Shunli Wang
Dingkang Yang
Peng Zhai
Chixiao Chen
Lihua Zhang
ViT
37
63
0
11 Jan 2022
A ConvNet for the 2020s
A ConvNet for the 2020s
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
ViT
42
4,989
0
10 Jan 2022
TVNet: Temporal Voting Network for Action Localization
TVNet: Temporal Voting Network for Action Localization
Hanyuan Wang
Dima Damen
Majid Mirmehdi
Toby Perrett
30
6
0
02 Jan 2022
Robust and Precise Facial Landmark Detection by Self-Calibrated Pose
  Attention Network
Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network
Jun Wan
Hui Xi
Jie Zhou
Zhihui Lai
Witold Pedrycz
Xu Wang
Hang Sun
CVBM
14
17
0
23 Dec 2021
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity
  Recognition
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition
Xiangbo Shu
Jiawen Yang
Rui Yan
Yan Song
25
147
0
21 Dec 2021
Precondition and Effect Reasoning for Action Recognition
Precondition and Effect Reasoning for Action Recognition
Hongsang Yoo
Haopeng Li
Qiuhong Ke
Liangchen Liu
Rui Zhang
CML
51
4
0
19 Dec 2021
Adversarial Memory Networks for Action Prediction
Adversarial Memory Networks for Action Prediction
Zhiqiang Tao
Yue Bai
Handong Zhao
Sheng Li
Yuanyuan Kong
Y. Fu
GAN
18
2
0
18 Dec 2021
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Yinghao Xu
Fangyun Wei
Xiao Sun
Ceyuan Yang
Yujun Shen
Bo Dai
Bolei Zhou
Stephen Lin
VLM
33
52
0
17 Dec 2021
Distillation of Human-Object Interaction Contexts for Action Recognition
Distillation of Human-Object Interaction Contexts for Action Recognition
Muna Almushyti
Frederick W. Li
39
3
0
17 Dec 2021
Spatio-Temporal CNN baseline method for the Sports Video Task of
  MediaEval 2021 benchmark
Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark
Pierre-Etienne Martin
22
7
0
16 Dec 2021
Rethinking Nearest Neighbors for Visual Classification
Rethinking Nearest Neighbors for Visual Classification
Menglin Jia
Bor-Chun Chen
Zuxuan Wu
Claire Cardie
Serge Belongie
Ser-Nam Lim
SSL
41
10
0
15 Dec 2021
Temporal Action Proposal Generation with Background Constraint
Temporal Action Proposal Generation with Background Constraint
Haosen Yang
Wenhao Wu
Lining Wang
Sheng Jin
Boyang Xia
Huanjin Yao
Hujie Huang
23
27
0
15 Dec 2021
Temporal Transformer Networks with Self-Supervision for Action
  Recognition
Temporal Transformer Networks with Self-Supervision for Action Recognition
Yongkang Zhang
Jun Li
Guoming Wu
Hanjie Zhang
Zhiping Shi
Zhaoxun Liu
Zizhang Wu
ViT
32
4
0
14 Dec 2021
Co-training Transformer with Videos and Images Improves Action
  Recognition
Co-training Transformer with Videos and Images Improves Action Recognition
Bowen Zhang
Jiahui Yu
Christopher Fifty
Wei Han
Andrew M. Dai
Ruoming Pang
Fei Sha
ViT
28
54
0
14 Dec 2021
SVIP: Sequence VerIfication for Procedures in Videos
SVIP: Sequence VerIfication for Procedures in Videos
Yichen Qian
Weixin Luo
Dongze Lian
Xu Tang
P. Zhao
Shenghua Gao
ViT
36
17
0
13 Dec 2021
Progressive Attention on Multi-Level Dense Difference Maps for Generic
  Event Boundary Detection
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Jiaqi Tang
Zhaoyang Liu
Chao Qian
Wayne Wu
Limin Wang
17
17
0
09 Dec 2021
Prompting Visual-Language Models for Efficient Video Understanding
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
33
364
0
08 Dec 2021
ViewCLR: Learning Self-supervised Video Representation for Unseen
  Viewpoints
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
44
17
0
07 Dec 2021
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Guo Chen
Yin-Dong Zheng
Limin Wang
Tong Lu
AI4TS
37
70
0
07 Dec 2021
Previous
123...111213...444546
Next