Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,275 papers shown
Title
Exploiting long-term temporal dynamics for video captioning
Yuyu Guo
Jingqiu Zhang
Lianli Gao
19
18
0
22 Feb 2022
3DRM:Pair-wise relation module for 3D object detection
Yuqing Lan
Yao Duan
Yifei Shi
Hui Huang
Kai Xu
3DPC
30
4
0
20 Feb 2022
Student Dangerous Behavior Detection in School
Huayi Zhou
Fei Jiang
Hongtao Lu
12
4
0
19 Feb 2022
Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive Benchmark Study
Yuecong Xu
Jianfei Yang
Haozhi Cao
Jianxiong Yin
Zhenghua Chen
Xiaoli Li
Zhengguo Li
Qiaoqiao Xu
43
2
0
19 Feb 2022
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
31
333
0
16 Feb 2022
Integration of knowledge and data in machine learning
Yuntian Chen
Dongxiao Zhang
PINN
28
31
0
15 Feb 2022
HAKE: A Knowledge Engine Foundation for Human Activity Understanding
Yong-Lu Li
Xinpeng Liu
Xiaoqian Wu
Yizhuo Li
Zuoyu Qiu
Liang Xu
Yue Xu
Haoshu Fang
Cewu Lu
32
38
0
14 Feb 2022
CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors
Xin Chao
Zhenjie Hou
Yu Mo
35
20
0
07 Feb 2022
A Coding Framework and Benchmark towards Compressed Video Understanding
Yuan Tian
Guo Lu
Yichao Yan
Guangtao Zhai
L. Chen
Zhiyong Gao
41
21
0
06 Feb 2022
Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition
Lipeng Ke
Kuan-Chuan Peng
Siwei Lyu
3DPC
34
33
0
04 Feb 2022
Video Violence Recognition and Localization Using a Semi-Supervised Hard Attention Model
Hamid Reza Mohammadi
Ehsan Nazerfard
32
24
0
04 Feb 2022
Human Activity Recognition Using Tools of Convolutional Neural Networks: A State of the Art Review, Data Sets, Challenges and Future Prospects
Md. Milon Islam
Sheikh Nooruddin
Fakhri Karray
Muhammad Ghulam
BDL
29
129
0
02 Feb 2022
An Eye for an Eye: Defending against Gradient-based Attacks with Gradients
Hanbin Hong
Yuan Hong
Yu Kong
AAML
35
2
0
02 Feb 2022
MMSys'22 Grand Challenge on AI-based Video Production for Soccer
Cise Midoglu
Steven A. Hicks
Vajira Thambawita
T. Kupka
Pål Halvorsen
VGen
49
14
0
02 Feb 2022
Should I take a walk? Estimating Energy Expenditure from Video Data
Kunyu Peng
Alina Roitberg
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
18
4
0
01 Feb 2022
Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition
Kiyoon Kim
Shreyank N. Gowda
Oisin Mac Aodha
Laura Sevilla-Lara
34
9
0
25 Jan 2022
LTC-GIF: Attracting More Clicks on Feature-length Sports Videos
G. Mujtaba
Jaehyuk Choi
Eun‐Seok Ryu
16
0
0
22 Jan 2022
LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN
G. Mujtaba
A. Malik
Eun‐Seok Ryu
32
15
0
22 Jan 2022
Omnivore: A Single Model for Many Visual Modalities
Rohit Girdhar
Mannat Singh
Nikhil Ravi
Laurens van der Maaten
Armand Joulin
Ishan Misra
229
226
0
20 Jan 2022
Autoencoding Video Latents for Adversarial Video Generation
Sai Hemanth Kasaraneni
VGen
30
3
0
18 Jan 2022
Action Keypoint Network for Efficient Video Recognition
Xu Chen
Yahong Han
Xiaohan Wang
Yifang Sun
Yi Yang
3DPC
32
6
0
17 Jan 2022
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
29
103
0
16 Jan 2022
Hand-Object Interaction Reasoning
Jian Ma
Dima Damen
27
7
0
13 Jan 2022
OCSampler: Compressing Videos to One Clip with Single-step Sampling
Jintao Lin
Haodong Duan
Kai-xiang Chen
Dahua Lin
Limin Wang
47
24
0
12 Jan 2022
Multiview Transformers for Video Recognition
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
26
212
0
12 Jan 2022
Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents
Ethan Weber
Dim P. Papadopoulos
Àgata Lapedriza
Ferda Ofli
Muhammad Imran
Antonio Torralba
75
23
0
11 Jan 2022
Motion-Focused Contrastive Learning of Video Representations
Rui Li
Yiheng Zhang
Zhaofan Qiu
Ting Yao
Dong Liu
Tao Mei
SSL
39
34
0
11 Jan 2022
Representing Videos as Discriminative Sub-graphs for Action Recognition
Dong Li
Zhaofan Qiu
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
44
26
0
11 Jan 2022
Boosting Video Representation Learning with Multi-Faceted Integration
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xiaoping Zhang
Dong Wu
Tao Mei
31
8
0
11 Jan 2022
Condensing a Sequence to One Informative Frame for Video Recognition
Zhaofan Qiu
Ting Yao
Y. Shu
Chong-Wah Ngo
Tao Mei
42
9
0
11 Jan 2022
Optimization Planning for 3D ConvNets
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
3DPC
3DH
42
9
0
11 Jan 2022
TSA-Net: Tube Self-Attention Network for Action Quality Assessment
Shunli Wang
Dingkang Yang
Peng Zhai
Chixiao Chen
Lihua Zhang
ViT
37
63
0
11 Jan 2022
A ConvNet for the 2020s
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
ViT
42
4,989
0
10 Jan 2022
TVNet: Temporal Voting Network for Action Localization
Hanyuan Wang
Dima Damen
Majid Mirmehdi
Toby Perrett
30
6
0
02 Jan 2022
Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network
Jun Wan
Hui Xi
Jie Zhou
Zhihui Lai
Witold Pedrycz
Xu Wang
Hang Sun
CVBM
14
17
0
23 Dec 2021
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition
Xiangbo Shu
Jiawen Yang
Rui Yan
Yan Song
25
147
0
21 Dec 2021
Precondition and Effect Reasoning for Action Recognition
Hongsang Yoo
Haopeng Li
Qiuhong Ke
Liangchen Liu
Rui Zhang
CML
51
4
0
19 Dec 2021
Adversarial Memory Networks for Action Prediction
Zhiqiang Tao
Yue Bai
Handong Zhao
Sheng Li
Yuanyuan Kong
Y. Fu
GAN
18
2
0
18 Dec 2021
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Yinghao Xu
Fangyun Wei
Xiao Sun
Ceyuan Yang
Yujun Shen
Bo Dai
Bolei Zhou
Stephen Lin
VLM
33
52
0
17 Dec 2021
Distillation of Human-Object Interaction Contexts for Action Recognition
Muna Almushyti
Frederick W. Li
39
3
0
17 Dec 2021
Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark
Pierre-Etienne Martin
22
7
0
16 Dec 2021
Rethinking Nearest Neighbors for Visual Classification
Menglin Jia
Bor-Chun Chen
Zuxuan Wu
Claire Cardie
Serge Belongie
Ser-Nam Lim
SSL
41
10
0
15 Dec 2021
Temporal Action Proposal Generation with Background Constraint
Haosen Yang
Wenhao Wu
Lining Wang
Sheng Jin
Boyang Xia
Huanjin Yao
Hujie Huang
23
27
0
15 Dec 2021
Temporal Transformer Networks with Self-Supervision for Action Recognition
Yongkang Zhang
Jun Li
Guoming Wu
Hanjie Zhang
Zhiping Shi
Zhaoxun Liu
Zizhang Wu
ViT
32
4
0
14 Dec 2021
Co-training Transformer with Videos and Images Improves Action Recognition
Bowen Zhang
Jiahui Yu
Christopher Fifty
Wei Han
Andrew M. Dai
Ruoming Pang
Fei Sha
ViT
28
54
0
14 Dec 2021
SVIP: Sequence VerIfication for Procedures in Videos
Yichen Qian
Weixin Luo
Dongze Lian
Xu Tang
P. Zhao
Shenghua Gao
ViT
36
17
0
13 Dec 2021
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Jiaqi Tang
Zhaoyang Liu
Chao Qian
Wayne Wu
Limin Wang
17
17
0
09 Dec 2021
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
33
364
0
08 Dec 2021
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
44
17
0
07 Dec 2021
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Guo Chen
Yin-Dong Zheng
Limin Wang
Tong Lu
AI4TS
37
70
0
07 Dec 2021
Previous
1
2
3
...
11
12
13
...
44
45
46
Next