Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.01794
Cited By
VideoLSTM Convolves, Attends and Flows for Action Recognition
6 July 2016
Zhenyang Li
E. Gavves
Mihir Jain
Cees G. M. Snoek
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VideoLSTM Convolves, Attends and Flows for Action Recognition"
50 / 158 papers shown
Title
Temporal Query Networks for Fine-grained Video Understanding
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
16
82
0
19 Apr 2021
Higher Order Recurrent Space-Time Transformer for Video Action Prediction
Tsung-Ming Tai
G. Fiameni
Cheng-Kuang Lee
O. Lanz
22
9
0
17 Apr 2021
Unsupervised Sound Localization via Iterative Contrastive Learning
Yan-Bo Lin
Hung-Yu Tseng
Hsin-Ying Lee
Yen-Yu Lin
Ming-Hsuan Yang
SSL
19
34
0
01 Apr 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
21
175
0
31 Mar 2021
MoViNets: Mobile Video Networks for Efficient Video Recognition
Dan Kondratyuk
Liangzhe Yuan
Yandong Li
Li Zhang
Mingxing Tan
Matthew A. Brown
Boqing Gong
13
228
0
21 Mar 2021
PGT: A Progressive Method for Training Models on Long Videos
Bo Pang
Gao Peng
Yizhuo Li
Cewu Lu
VLM
19
12
0
21 Mar 2021
Crop mapping from image time series: deep learning with multi-scale label hierarchies
Mehmet Özgür Türkoglu
Stefano Dáronco
Gregor Perich
F. Liebisch
Constantin Streit
Konrad Schindler
Jan Dirk Wegner
87
129
0
17 Feb 2021
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
EgoV
25
15
0
16 Feb 2021
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Hengduo Li
Zuxuan Wu
Abhinav Shrivastava
L. Davis
27
35
0
29 Dec 2020
Human Action Recognition from Various Data Modalities: A Review
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
37
502
0
22 Dec 2020
NUTA: Non-uniform Temporal Aggregation for Action Recognition
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Hao Chen
Joseph Tighe
ViT
14
16
0
15 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
30
184
0
11 Dec 2020
A Grid-based Representation for Human Action Recognition
Soufiane Lamghari
Guillaume-Alexandre Bilodeau
Nicolas Saunier
23
3
0
17 Oct 2020
Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Hezhen Hu
Wen-gang Zhou
Junfu Pu
Houqiang Li
SLR
14
51
0
24 Aug 2020
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization
Yuxi Li
Weiyao Lin
John See
N. Xu
Shugong Xu
Ke Yan
Cong Yang
56
17
0
19 Aug 2020
Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition
M. E. Kalfaoglu
Sinan Kalkan
Aydin Alatan
3DPC
28
140
0
03 Aug 2020
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
17
23
0
25 Jul 2020
Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection
Xianyu Chen
Ming Jiang
Qi Zhao
ObjD
8
12
0
23 Jul 2020
Directional Temporal Modeling for Action Recognition
Xinyu Li
Bing Shuai
Joseph Tighe
6
41
0
21 Jul 2020
Region-based Non-local Operation for Video Classification
Guoxi Huang
A. Bors
14
11
0
17 Jul 2020
Universal-to-Specific Framework for Complex Action Recognition
Peisen Zhao
Lingxi Xie
Ya-Qin Zhang
Qi Tian
14
9
0
13 Jul 2020
Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos
Mahsa Ehsanpour
Alireza Abedin
F. Saleh
Javen Qinfeng Shi
Ian Reid
Hamid Rezatofighi
29
71
0
06 Jul 2020
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
66
998
0
09 Apr 2020
Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Juan-Manuel Perez-Rua
Brais Martínez
Xiatian Zhu
Antoine Toisoul
Victor Escorcia
Tao Xiang
37
19
0
02 Apr 2020
Temporal Accumulative Features for Sign Language Recognition
A. Kındıroglu
Ogulcan Özdemir
L. Akarun
SLR
6
18
0
02 Apr 2020
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
19
178
0
28 Mar 2020
Action Localization through Continual Predictive Learning
Sathyanarayanan N. Aakur
Sudeep Sarkar
6
12
0
26 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
18
13
0
18 Mar 2020
Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks
Joonatan Mänttäri
Sofia Broomé
John Folkesson
Hedvig Kjellström
FAtt
14
27
0
02 Feb 2020
Gate-Shift Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
3DPC
6
155
0
01 Dec 2019
Gating Revisited: Deep Multi-layer RNNs That Can Be Trained
Mehmet Özgür Türkoglu
Stefano Dáronco
Jan Dirk Wegner
Konrad Schindler
8
47
0
25 Nov 2019
Weakly-Supervised Completion Moment Detection using Temporal Attention
Farnoosh Heidarivincheh
Majid Mirmehdi
Dima Damen
16
9
0
22 Oct 2019
Multitask Learning to Improve Egocentric Action Recognition
G. Kapidis
R. Poppe
E. V. Dam
L. Noldus
R. Veltkamp
EgoV
16
36
0
15 Sep 2019
Multi-Grained Spatio-temporal Modeling for Lip-reading
Chenhao Wang
11
51
0
30 Aug 2019
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
Sebastian Agethen
Winston H. Hsu
HAI
17
25
0
30 Jul 2019
Domain-Specific Priors and Meta Learning for Few-Shot First-Person Action Recognition
Huseyin Coskun
Zeeshan Zia
Bugra Tekin
Federica Bogo
Nassir Navab
Federico Tombari
H. Sawhney
17
27
0
22 Jul 2019
TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection
Lin Song
Shiwei Zhang
Gang Yu
Hongbin Sun
14
82
0
31 May 2019
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
21
0
0
25 May 2019
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
Noureldien Hussein
E. Gavves
A. Smeulders
14
77
0
13 May 2019
Frame-Recurrent Video Inpainting by Robust Optical Flow Inference
Yifan Ding
Chuan Wang
Haibin Huang
Jiaming Liu
Jue Wang
Liqiang Wang
14
12
0
08 May 2019
DeepSignals: Predicting Intent of Drivers Through Visual Signals
Davi Frossard
Eric Kee
R. Urtasun
ViT
7
17
0
03 May 2019
Memory-Augmented Temporal Dynamic Learning for Action Recognition
Yuan. Yuan
Dong Wang
Qi. Wang
22
13
0
30 Apr 2019
Cross-Modal Message Passing for Two-stream Fusion
Dong Wang
Yuan. Yuan
Qi. Wang
11
2
0
30 Apr 2019
FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition
Qingqing Wang
W. Jia
Xiangjian He
Yue Lu
Michael Blumenstein
Ye Huang
19
41
0
20 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
19
69
0
11 Apr 2019
Convolutional Relational Machine for Group Activity Recognition
Sina Mokhtarzadeh Azar
Mina Ghadimi Atigh
A. Nickabadi
Alexandre Alahi
BDL
9
105
0
05 Apr 2019
Attention Distillation for Learning Video Representations
Miao Liu
Xin Chen
Yun C. Zhang
Yin Li
James M. Rehg
16
2
0
05 Apr 2019
Dance with Flow: Two-in-One Stream Action Detection
Jiaojiao Zhao
Cees G. M. Snoek
12
83
0
01 Apr 2019
Counting with Focus for Free
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
3DV
3DPC
9
106
0
28 Mar 2019
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
Longlong Jing
Yingli Tian
SSL
20
1,686
0
16 Feb 2019
Previous
1
2
3
4
Next