ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.01794
  4. Cited By
VideoLSTM Convolves, Attends and Flows for Action Recognition

VideoLSTM Convolves, Attends and Flows for Action Recognition

6 July 2016
Zhenyang Li
E. Gavves
Mihir Jain
Cees G. M. Snoek
ArXivPDFHTML

Papers citing "VideoLSTM Convolves, Attends and Flows for Action Recognition"

50 / 158 papers shown
Title
Temporal Query Networks for Fine-grained Video Understanding
Temporal Query Networks for Fine-grained Video Understanding
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
16
82
0
19 Apr 2021
Higher Order Recurrent Space-Time Transformer for Video Action
  Prediction
Higher Order Recurrent Space-Time Transformer for Video Action Prediction
Tsung-Ming Tai
G. Fiameni
Cheng-Kuang Lee
O. Lanz
22
9
0
17 Apr 2021
Unsupervised Sound Localization via Iterative Contrastive Learning
Unsupervised Sound Localization via Iterative Contrastive Learning
Yan-Bo Lin
Hung-Yu Tseng
Hsin-Ying Lee
Yen-Yu Lin
Ming-Hsuan Yang
SSL
19
34
0
01 Apr 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
21
175
0
31 Mar 2021
MoViNets: Mobile Video Networks for Efficient Video Recognition
MoViNets: Mobile Video Networks for Efficient Video Recognition
Dan Kondratyuk
Liangzhe Yuan
Yandong Li
Li Zhang
Mingxing Tan
Matthew A. Brown
Boqing Gong
13
228
0
21 Mar 2021
PGT: A Progressive Method for Training Models on Long Videos
PGT: A Progressive Method for Training Models on Long Videos
Bo Pang
Gao Peng
Yizhuo Li
Cewu Lu
VLM
19
12
0
21 Mar 2021
Crop mapping from image time series: deep learning with multi-scale
  label hierarchies
Crop mapping from image time series: deep learning with multi-scale label hierarchies
Mehmet Özgür Türkoglu
Stefano Dáronco
Gregor Perich
F. Liebisch
Constantin Streit
Konrad Schindler
Jan Dirk Wegner
87
129
0
17 Feb 2021
Learning to Recognize Actions on Objects in Egocentric Video with
  Attention Dictionaries
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
EgoV
25
15
0
16 Feb 2021
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video
  Recognition
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Hengduo Li
Zuxuan Wu
Abhinav Shrivastava
L. Davis
27
35
0
29 Dec 2020
Human Action Recognition from Various Data Modalities: A Review
Human Action Recognition from Various Data Modalities: A Review
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
37
502
0
22 Dec 2020
NUTA: Non-uniform Temporal Aggregation for Action Recognition
NUTA: Non-uniform Temporal Aggregation for Action Recognition
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Hao Chen
Joseph Tighe
ViT
14
16
0
15 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
30
184
0
11 Dec 2020
A Grid-based Representation for Human Action Recognition
A Grid-based Representation for Human Action Recognition
Soufiane Lamghari
Guillaume-Alexandre Bilodeau
Nicolas Saunier
23
3
0
17 Oct 2020
Global-local Enhancement Network for NMFs-aware Sign Language
  Recognition
Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Hezhen Hu
Wen-gang Zhou
Junfu Pu
Houqiang Li
SLR
14
51
0
24 Aug 2020
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action
  Localization
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization
Yuxi Li
Weiyao Lin
John See
N. Xu
Shugong Xu
Ke Yan
Cong Yang
56
17
0
19 Aug 2020
Late Temporal Modeling in 3D CNN Architectures with BERT for Action
  Recognition
Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition
M. E. Kalfaoglu
Sinan Kalkan
Aydin Alatan
3DPC
28
140
0
03 Aug 2020
Approximated Bilinear Modules for Temporal Modeling
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
17
23
0
25 Jul 2020
Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object
  Detection
Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection
Xianyu Chen
Ming Jiang
Qi Zhao
ObjD
8
12
0
23 Jul 2020
Directional Temporal Modeling for Action Recognition
Directional Temporal Modeling for Action Recognition
Xinyu Li
Bing Shuai
Joseph Tighe
6
41
0
21 Jul 2020
Region-based Non-local Operation for Video Classification
Region-based Non-local Operation for Video Classification
Guoxi Huang
A. Bors
14
11
0
17 Jul 2020
Universal-to-Specific Framework for Complex Action Recognition
Universal-to-Specific Framework for Complex Action Recognition
Peisen Zhao
Lingxi Xie
Ya-Qin Zhang
Qi Tian
14
9
0
13 Jul 2020
Joint Learning of Social Groups, Individuals Action and Sub-group
  Activities in Videos
Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos
Mahsa Ehsanpour
Alireza Abedin
F. Saleh
Javen Qinfeng Shi
Ian Reid
Hamid Rezatofighi
29
71
0
06 Jul 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
66
998
0
09 Apr 2020
Knowing What, Where and When to Look: Efficient Video Action Modeling
  with Attention
Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Juan-Manuel Perez-Rua
Brais Martínez
Xiatian Zhu
Antoine Toisoul
Victor Escorcia
Tao Xiang
37
19
0
02 Apr 2020
Temporal Accumulative Features for Sign Language Recognition
Temporal Accumulative Features for Sign Language Recognition
A. Kındıroglu
Ogulcan Özdemir
L. Akarun
SLR
6
18
0
02 Apr 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
19
178
0
28 Mar 2020
Action Localization through Continual Predictive Learning
Action Localization through Continual Predictive Learning
Sathyanarayanan N. Aakur
Sudeep Sarkar
6
12
0
26 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range
  Activities
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
18
13
0
18 Mar 2020
Interpreting video features: a comparison of 3D convolutional networks
  and convolutional LSTM networks
Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks
Joonatan Mänttäri
Sofia Broomé
John Folkesson
Hedvig Kjellström
FAtt
14
27
0
02 Feb 2020
Gate-Shift Networks for Video Action Recognition
Gate-Shift Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
3DPC
6
155
0
01 Dec 2019
Gating Revisited: Deep Multi-layer RNNs That Can Be Trained
Gating Revisited: Deep Multi-layer RNNs That Can Be Trained
Mehmet Özgür Türkoglu
Stefano Dáronco
Jan Dirk Wegner
Konrad Schindler
8
47
0
25 Nov 2019
Weakly-Supervised Completion Moment Detection using Temporal Attention
Weakly-Supervised Completion Moment Detection using Temporal Attention
Farnoosh Heidarivincheh
Majid Mirmehdi
Dima Damen
16
9
0
22 Oct 2019
Multitask Learning to Improve Egocentric Action Recognition
Multitask Learning to Improve Egocentric Action Recognition
G. Kapidis
R. Poppe
E. V. Dam
L. Noldus
R. Veltkamp
EgoV
16
36
0
15 Sep 2019
Multi-Grained Spatio-temporal Modeling for Lip-reading
Multi-Grained Spatio-temporal Modeling for Lip-reading
Chenhao Wang
11
51
0
30 Aug 2019
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based
  Mechanism for Videos
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
Sebastian Agethen
Winston H. Hsu
HAI
17
25
0
30 Jul 2019
Domain-Specific Priors and Meta Learning for Few-Shot First-Person
  Action Recognition
Domain-Specific Priors and Meta Learning for Few-Shot First-Person Action Recognition
Huseyin Coskun
Zeeshan Zia
Bugra Tekin
Federica Bogo
Nassir Navab
Federico Tombari
H. Sawhney
17
27
0
22 Jul 2019
TACNet: Transition-Aware Context Network for Spatio-Temporal Action
  Detection
TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection
Lin Song
Shiwei Zhang
Gang Yu
Hongbin Sun
14
82
0
31 May 2019
Exploring Temporal Information for Improved Video Understanding
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
21
0
0
25 May 2019
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
Noureldien Hussein
E. Gavves
A. Smeulders
14
77
0
13 May 2019
Frame-Recurrent Video Inpainting by Robust Optical Flow Inference
Frame-Recurrent Video Inpainting by Robust Optical Flow Inference
Yifan Ding
Chuan Wang
Haibin Huang
Jiaming Liu
Jue Wang
Liqiang Wang
14
12
0
08 May 2019
DeepSignals: Predicting Intent of Drivers Through Visual Signals
DeepSignals: Predicting Intent of Drivers Through Visual Signals
Davi Frossard
Eric Kee
R. Urtasun
ViT
7
17
0
03 May 2019
Memory-Augmented Temporal Dynamic Learning for Action Recognition
Memory-Augmented Temporal Dynamic Learning for Action Recognition
Yuan. Yuan
Dong Wang
Qi. Wang
22
13
0
30 Apr 2019
Cross-Modal Message Passing for Two-stream Fusion
Cross-Modal Message Passing for Two-stream Fusion
Dong Wang
Yuan. Yuan
Qi. Wang
11
2
0
30 Apr 2019
FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition
FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition
Qingqing Wang
W. Jia
Xiangjian He
Yue Lu
Michael Blumenstein
Ye Huang
19
41
0
20 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
19
69
0
11 Apr 2019
Convolutional Relational Machine for Group Activity Recognition
Convolutional Relational Machine for Group Activity Recognition
Sina Mokhtarzadeh Azar
Mina Ghadimi Atigh
A. Nickabadi
Alexandre Alahi
BDL
9
105
0
05 Apr 2019
Attention Distillation for Learning Video Representations
Attention Distillation for Learning Video Representations
Miao Liu
Xin Chen
Yun C. Zhang
Yin Li
James M. Rehg
16
2
0
05 Apr 2019
Dance with Flow: Two-in-One Stream Action Detection
Dance with Flow: Two-in-One Stream Action Detection
Jiaojiao Zhao
Cees G. M. Snoek
12
83
0
01 Apr 2019
Counting with Focus for Free
Counting with Focus for Free
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
3DV
3DPC
9
106
0
28 Mar 2019
Self-supervised Visual Feature Learning with Deep Neural Networks: A
  Survey
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
Longlong Jing
Yingli Tian
SSL
20
1,686
0
16 Feb 2019
Previous
1234
Next