Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,275 papers shown
Title
Representation Learning via Global Temporal Alignment and Cycle-Consistency
Isma Hadji
Konstantinos G. Derpanis
Allan D. Jepson
AI4TS
35
54
0
11 May 2021
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Yikang Shen
Chun-Fu Chen
Quanfu Fan
Ximeng Sun
Kate Saenko
A. Oliva
Rogerio Feris
41
47
0
11 May 2021
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions
Mathew Monfort
SouYoung Jin
Alexander H. Liu
David Harwath
Rogerio Feris
James Glass
Aude Oliva
22
59
0
10 May 2021
Coupling Intent and Action for Pedestrian Crossing Behavior Prediction
Yu Yao
E. Atkins
Matthew Johnson-Roberson
Ram Vasudevan
Xiaoxiao Du
23
33
0
10 May 2021
PLSM: A Parallelized Liquid State Machine for Unintentional Action Detection
Dipayan Das
Saumik Bhattacharya
Umapada Pal
S. Chanda
26
8
0
06 May 2021
Motion-Augmented Self-Training for Video Recognition at Smaller Scale
Kirill Gavrilyuk
Mihir Jain
I. Karmanov
Cees G. M. Snoek
18
21
0
04 May 2021
FedProto: Federated Prototype Learning across Heterogeneous Clients
Yue Tan
Guodong Long
Lu Liu
Tianyi Zhou
Qinghua Lu
Jing Jiang
Chengqi Zhang
FedML
170
460
0
01 May 2021
Unsupervised Discriminative Embedding for Sub-Action Learning in Complex Activities
S. Swetha
Hilde Kuehne
Yogesh S Rawat
M. Shah
29
16
0
30 Apr 2021
Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification
Yichao Yan
Jie Qin
Jiaxin Chen
Li Liu
Fan Zhu
Ying Tai
Ling Shao
25
130
0
30 Apr 2021
BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification
Rui Hou
Hong Chang
Bingpeng Ma
Rui Huang
Shiguang Shan
32
85
0
30 Apr 2021
CoCon: Cooperative-Contrastive Learning
Nishant Rai
Ehsan Adeli
Kuan-Hui Lee
Adrien Gaidon
Juan Carlos Niebles
SSL
20
18
0
30 Apr 2021
High-Resolution Optical Flow from 1D Attention and Correlation
Haofei Xu
Jiaolong Yang
Jianfei Cai
Juyong Zhang
Xin Tong
81
76
0
28 Apr 2021
Revisiting Skeleton-based Action Recognition
Haodong Duan
Yue Zhao
Kai-xiang Chen
Dahua Lin
Bo Dai
3DH
37
486
0
28 Apr 2021
Three-stream network for enriched Action Recognition
Ivaxi Sheth
27
4
0
27 Apr 2021
Modeling long-term interactions to enhance action recognition
Alejandro Cartas
Petia Radeva
Mariella Dimiccoli
EgoV
27
6
0
23 Apr 2021
Low Pass Filter for Anti-aliasing in Temporal Action Localization
Cece Jin
Yuanqi Chen
Ge Li
Tao Zhang
Thomas H. Li
19
1
0
23 Apr 2021
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
63
1,226
0
22 Apr 2021
H2O: Two Hands Manipulating Objects for First Person Interaction Recognition
Taein Kwon
Bugra Tekin
Jan Stühmer
Federica Bogo
Marc Pollefeys
EgoV
37
169
0
22 Apr 2021
Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
Yanbei Chen
Yongqin Xian
A. Sophia Koepke
Ying Shan
Zeynep Akata
82
82
0
22 Apr 2021
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
Yuan Zhi
Zhan Tong
Limin Wang
Gangshan Wu
TTA
19
72
0
20 Apr 2021
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition
Zejia Weng
Zuxuan Wu
Hengduo Li
Jingjing Chen
Yu-Gang Jiang
37
4
0
20 Apr 2021
Temporal Query Networks for Fine-grained Video Understanding
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
24
83
0
19 Apr 2021
What can human minimal videos tell us about dynamic recognition models?
Guy Ben-Yosef
Gabriel Kreiman
S. Ullman
24
2
0
19 Apr 2021
BM-NAS: Bilevel Multimodal Neural Architecture Search
Yihang Yin
Siyu Huang
Xiang Zhang
37
27
0
19 Apr 2021
Higher Order Recurrent Space-Time Transformer for Video Action Prediction
Tsung-Ming Tai
G. Fiameni
Cheng-Kuang Lee
Oswald Lanz
41
9
0
17 Apr 2021
Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representations
Lingyu Zhu
Esa Rahtu
29
25
0
17 Apr 2021
Temporally smooth online action detection using cycle-consistent future anticipation
Young Hwi Kim
Seonghyeon Nam
Seon Joo Kim
OffRL
19
28
0
16 Apr 2021
Adaptive Intermediate Representations for Video Understanding
Juhana Kangaspunta
A. Piergiovanni
Rico Jonschkowski
Michael S. Ryoo
A. Angelova
26
3
0
14 Apr 2021
Weakly But Deeply Supervised Occlusion-Reasoned Parametric Road Layouts
Buyu Liu
Bingbing Zhuang
Manmohan Chandraker
29
4
0
14 Apr 2021
ADNet: Temporal Anomaly Detection in Surveillance Videos
H. Öztürk
Ahmet Burak Can
27
15
0
14 Apr 2021
Learning Log-Determinant Divergences for Positive Definite Matrices
A. Cherian
P. Stanitsas
Jue Wang
Mehrtash Harandi
V. Morellas
Nikolaos Papanikolopoulos
15
4
0
13 Apr 2021
Temporal Consistency Two-Stream CNN for Human Motion Prediction
Jin Tang
Jin Zhang
Jianqin Yin
3DH
32
17
0
11 Apr 2021
Object Priors for Classifying and Localizing Unseen Actions
Pascal Mettes
William Thong
Cees G. M. Snoek
32
20
0
10 Apr 2021
Warp Consistency for Unsupervised Learning of Dense Correspondences
Prune Truong
Martin Danelljan
Feng Yu
Luc Van Gool
30
45
0
07 Apr 2021
Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Changxin Gao
Nong Sang
33
68
0
07 Apr 2021
ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization
Sanqing Qu
Guang Chen
Zhijun Li
Lijun Zhang
Fan Lu
Alois C. Knoll
17
54
0
07 Apr 2021
Multimodal Object Detection via Probabilistic Ensembling
Yi-Ting Chen
Jing Shi
Zelin Ye
Christoph Mertz
Deva Ramanan
Shu Kong
18
104
0
07 Apr 2021
The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions
Jennifer J. Sun
Tomomi Karigo
Dipam Chakraborty
Sharada Mohanty
Benjamin Wild
...
Chen Chen
D. Anderson
Pietro Perona
Yisong Yue
Ann Kennedy
39
48
0
06 Apr 2021
Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning
Pramod Chunduri
J. Bang
Yao Lu
Joy Arulraj
29
11
0
06 Apr 2021
A Video Is Worth Three Views: Trigeminal Transformers for Video-based Person Re-identification
Xuehu Liu
Pingping Zhang
Chenyang Yu
Huchuan Lu
Xuesheng Qian
Xiaoyun Yang
ViT
38
44
0
05 Apr 2021
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories
Xitong Yang
Haoqi Fan
Lorenzo Torresani
L. Davis
Heng Wang
VLM
27
20
0
02 Apr 2021
Self-supervised Video Representation Learning by Context and Motion Decoupling
Lianghua Huang
Yu Liu
Bin Wang
Pan Pan
Yinghui Xu
Rong Jin
SSL
49
51
0
02 Apr 2021
Multiview Pseudo-Labeling for Semi-supervised Learning from Video
Bo Xiong
Haoqi Fan
Kristen Grauman
Christoph Feichtenhofer
SSL
32
49
0
01 Apr 2021
Self-supervised Motion Learning from Static Images
Ziyuan Huang
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Rong Jin
M. Ang
SSL
26
29
0
01 Apr 2021
DCVNet: Dilated Cost Volume Networks for Fast Optical Flow
Huaizu Jiang
Erik Learned-Miller
3DPC
36
5
0
31 Mar 2021
Embracing Uncertainty: Decoupling and De-bias for Robust Temporal Grounding
Hao Zhou
Chongyang Zhang
Yan Luo
Yanjun Chen
Chuanping Hu
18
52
0
31 Mar 2021
Learning Representational Invariances for Data-Efficient Action Recognition
Yuliang Zou
Jinwoo Choi
Qitong Wang
Jia-Bin Huang
22
40
0
30 Mar 2021
Broaden Your Views for Self-Supervised Video Learning
Adrià Recasens
Pauline Luc
Jean-Baptiste Alayrac
Luyu Wang
Ross Hemsley
...
Florent Altché
M. Valko
Jean-Bastien Grill
Aaron van den Oord
Andrew Zisserman
SSL
AI4TS
35
127
0
30 Mar 2021
Spatiotemporal Transformer for Video-based Person Re-identification
Tianyu Zhang
Longhui Wei
Lingxi Xie
Zijie Zhuang
Yongfei Zhang
Yue Liu
Qi Tian
ViT
34
32
0
30 Mar 2021
Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
Shuning Chang
Pichao Wang
F. Wang
Hao Li
Jiashi Feng
ViT
52
41
0
30 Mar 2021
Previous
1
2
3
...
15
16
17
...
44
45
46
Next