ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.01794
  4. Cited By
VideoLSTM Convolves, Attends and Flows for Action Recognition

VideoLSTM Convolves, Attends and Flows for Action Recognition

6 July 2016
Zhenyang Li
E. Gavves
Mihir Jain
Cees G. M. Snoek
ArXivPDFHTML

Papers citing "VideoLSTM Convolves, Attends and Flows for Action Recognition"

50 / 158 papers shown
Title
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches
Guodong Shen
Yuqi Ouyang
Junru Lu
Yixuan Yang
Victor Sanchez
33
1
0
20 Apr 2025
Enhancing Action Recognition by Leveraging the Hierarchical Structure of
  Actions and Textual Context
Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context
Manuel Benavent-Lledo
David Mulero-Pérez
David Ortiz-Perez
José García Rodríguez
Antonis Argyros
24
0
0
28 Oct 2024
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a
  Hybrid Model
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
Khaled Alomar
Halil Ibrahim Aysel
Xiaohao Cai
MedIm
ViT
35
7
0
02 Jun 2024
Deep video representation learning: a survey
Deep video representation learning: a survey
Elham Ravanbakhsh
Yongqing Liang
J. Ramanujam
Xin Li
46
3
0
10 May 2024
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a
  Large Foundational Video Understanding Model
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model
Till Grutschus
Ola Karrar
Emir Esenov
Ekta Vats
18
0
0
29 Jan 2024
A multimodal gesture recognition dataset for desktop human-computer
  interaction
A multimodal gesture recognition dataset for desktop human-computer interaction
Qi Wang
Fengchao Zhu
Guangming Zhu
Liang Zhang
Ning Li
Eryang Gao
24
0
0
08 Jan 2024
Early Action Recognition with Action Prototypes
Early Action Recognition with Action Prototypes
G. Camporese
Alessandro Bergamo
Xunyu Lin
Joseph Tighe
Davide Modolo
EgoV
16
0
0
11 Dec 2023
Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large
  Vision-Language Models
Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models
Dong Li
Jiandong Jin
Yuhao Zhang
Yanlin Zhong
Yaoyang Wu
Lan Chen
Xiao Wang
Bin Luo
63
5
0
30 Nov 2023
Learning Scene Flow With Skeleton Guidance For 3D Action Recognition
Learning Scene Flow With Skeleton Guidance For 3D Action Recognition
Vasileios Magoulianitis
A. Psaltis
3DH
3DPC
19
0
0
23 Jun 2023
Group Activity Recognition via Dynamic Composition and Interaction
Group Activity Recognition via Dynamic Composition and Interaction
Youliang Zhang
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
25
0
0
09 May 2023
Optical Flow Estimation in 360$^\circ$ Videos: Dataset, Model and
  Application
Optical Flow Estimation in 360∘^\circ∘ Videos: Dataset, Model and Application
Bin Duan
Keshav Bhandari
Gaowen Liu
Yan Yan
19
0
0
27 Jan 2023
Transformers in Action Recognition: A Review on Temporal Modeling
Transformers in Action Recognition: A Review on Temporal Modeling
Elham Shabaninia
Hossein Nezamabadi-pour
Fatemeh Shafizadegan
ViT
21
8
0
29 Dec 2022
A Survey on Human Action Recognition
A Survey on Human Action Recognition
Zhou Shuchang
26
0
0
20 Dec 2022
Inductive Attention for Video Action Anticipation
Inductive Attention for Video Action Anticipation
Tsung-Ming Tai
G. Fiameni
Cheng-Kuang Lee
Simon See
O. Lanz
31
1
0
17 Dec 2022
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal
  Action Localization
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Chen Zhao
Shuming Liu
K. Mangalam
Bernard Ghanem
19
17
0
25 Nov 2022
Evaluating the Faithfulness of Saliency-based Explanations for Deep
  Learning Models for Temporal Colour Constancy
Evaluating the Faithfulness of Saliency-based Explanations for Deep Learning Models for Temporal Colour Constancy
Matteo Rizzo
Cristina Conati
Daesik Jang
Hui Hu
FAtt
13
2
0
15 Nov 2022
SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity
  Recognition
SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition
Santosh Kumar Yadav
Esha Pahwa
Achleshwar Luthra
K. Tiwari
Hari Mohan Pandey
Peter Corcoran
15
4
0
10 Nov 2022
TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early
  Intent Prediction
TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early Intent Prediction
Nada Osman
Guglielmo Camporese
Lamberto Ballan
23
8
0
26 Oct 2022
Linear Video Transformer with Feature Fixation
Linear Video Transformer with Feature Fixation
Kaiyue Lu
Zexia Liu
Jianyuan Wang
Weixuan Sun
Zhen Qin
...
Xuyang Shen
Huizhong Deng
Xiaodong Han
Yuchao Dai
Yiran Zhong
30
4
0
15 Oct 2022
Compressed Vision for Efficient Video Understanding
Compressed Vision for Efficient Video Understanding
Olivia Wiles
João Carreira
Iain Barr
Andrew Zisserman
Mateusz Malinowski
14
7
0
06 Oct 2022
Rethinking Resolution in the Context of Efficient Video Recognition
Rethinking Resolution in the Context of Efficient Video Recognition
Chuofan Ma
Qiushan Guo
Yi-Xin Jiang
Zehuan Yuan
Ping Luo
Xiaojuan Qi
60
12
0
26 Sep 2022
On the Surprising Effectiveness of Transformers in Low-Labeled Video
  Recognition
On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition
Farrukh Rahman
Ömer Mubarek
Z. Kira
ViT
10
2
0
15 Sep 2022
Attentive pooling for Group Activity Recognition
Attentive pooling for Group Activity Recognition
Ding Li
Yuan Xie
Wensheng Zhang
Yongqiang Tang
Zhizhong Zhang
15
0
0
31 Aug 2022
Video Mobile-Former: Video Recognition with Efficient Global
  Spatial-temporal Modeling
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Rui Wang
Zuxuan Wu
Dongdong Chen
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Luowei Zhou
Lu Yuan
Yu-Gang Jiang
ViT
35
4
0
25 Aug 2022
Hierarchical Compositional Representations for Few-shot Action
  Recognition
Hierarchical Compositional Representations for Few-shot Action Recognition
Chang-bo Li
Jie M. Zhang
Shuzhe Wu
Xin Jin
Shiguang Shan
22
20
0
19 Aug 2022
ViT-ReT: Vision and Recurrent Transformer Neural Networks for Human
  Activity Recognition in Videos
ViT-ReT: Vision and Recurrent Transformer Neural Networks for Human Activity Recognition in Videos
James Wensel
Hayat Ullah
Arslan Munir
ViT
16
42
0
16 Aug 2022
Human Activity Recognition Using Cascaded Dual Attention CNN and
  Bi-Directional GRU Framework
Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework
Hayat Ullah
Arslan Munir
HAI
19
27
0
09 Aug 2022
Robotic Detection of a Human-Comprehensible Gestural Language for
  Underwater Multi-Human-Robot Collaboration
Robotic Detection of a Human-Comprehensible Gestural Language for Underwater Multi-Human-Robot Collaboration
Sadman Sakib Enan
Michael Fulton
Junaed Sattar
31
8
0
12 Jul 2022
Video Anomaly Detection via Prediction Network with Enhanced
  Spatio-Temporal Memory Exchange
Video Anomaly Detection via Prediction Network with Enhanced Spatio-Temporal Memory Exchange
Guodong Shen
Yuqi Ouyang
Victor Sanchez
AI4TS
19
7
0
26 Jun 2022
Hierarchical Self-supervised Representation Learning for Movie
  Understanding
Hierarchical Self-supervised Representation Learning for Movie Understanding
Fanyi Xiao
Kaustav Kundu
Joseph Tighe
Davide Modolo
SSL
37
24
0
06 Apr 2022
Gate-Shift-Fuse for Video Action Recognition
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
20
22
0
16 Mar 2022
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient
  Long-Term Video Recognition
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Chao-Yuan Wu
Yanghao Li
K. Mangalam
Haoqi Fan
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
37
198
0
20 Jan 2022
OCSampler: Compressing Videos to One Clip with Single-step Sampling
OCSampler: Compressing Videos to One Clip with Single-step Sampling
Jintao Lin
Haodong Duan
Kai-xiang Chen
Dahua Lin
Limin Wang
32
24
0
12 Jan 2022
maskGRU: Tracking Small Objects in the Presence of Large Background
  Motions
maskGRU: Tracking Small Objects in the Presence of Large Background Motions
Constantine J. Roros
A. Kak
29
2
0
03 Jan 2022
E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action
  Recognition
E2^22(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition
Chiara Plizzari
M. Planamente
Gabriele Goletto
Marco Cannici
Emanuele Gusso
Matteo Matteucci
Barbara Caputo
EgoV
20
56
0
07 Dec 2021
MViTv2: Improved Multiscale Vision Transformers for Classification and
  Detection
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li
Chaoxia Wu
Haoqi Fan
K. Mangalam
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
46
677
0
02 Dec 2021
Stacked Temporal Attention: Improving First-person Action Recognition by
  Emphasizing Discriminative Clips
Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
Lijin Yang
Yifei Huang
Yusuke Sugano
Yoichi Sato
20
5
0
02 Dec 2021
Attention Mechanisms in Computer Vision: A Survey
Attention Mechanisms in Computer Vision: A Survey
Meng-Hao Guo
Tianhan Xu
Jiangjiang Liu
Zheng-Ning Liu
Peng-Tao Jiang
Tai-Jiang Mu
Song-Hai Zhang
Ralph Robert Martin
Ming-Ming Cheng
Shimin Hu
19
1,633
0
15 Nov 2021
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Elahe Vahdani
Yingli Tian
38
60
0
30 Sep 2021
TSM: Temporal Shift Module for Efficient and Scalable Video
  Understanding on Edge Device
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device
Ji Lin
Chuang Gan
Kuan-Chieh Jackson Wang
Song Han
38
64
0
27 Sep 2021
Efficient Action Recognition Using Confidence Distillation
Efficient Action Recognition Using Confidence Distillation
Shervin Manzuri Shalmani
Fei Chiang
Ronghuo Zheng
19
6
0
05 Sep 2021
Working Memory Connections for LSTM
Working Memory Connections for LSTM
Federico Landi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
KELM
13
156
0
31 Aug 2021
Delta Sampling R-BERT for limited data and low-light action recognition
Delta Sampling R-BERT for limited data and low-light action recognition
Sanchit Hira
Ritwik Das
Abhinav Modi
D. Pakhomov
75
17
0
12 Jul 2021
VideoLightFormer: Lightweight Action Recognition using Transformers
Raivo Koot
Haiping Lu
ViT
14
6
0
01 Jul 2021
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Mateusz Malinowski
Dimitrios Vytiniotis
G. Swirszcz
Viorica Patraucean
João Carreira
22
8
0
15 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
11
11
0
12 Jun 2021
Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K
  Videos using a Joint Detection-and-Tracking Approach
Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach
Ryota Yoshihashi
Rei Kawakami
Shaodi You
T. Trinh
M. Iida
T. Naemura
ObjD
VOT
17
3
0
18 May 2021
Actor-centered Representations for Action Localization in Streaming
  Videos
Actor-centered Representations for Action Localization in Streaming Videos
Sathyanarayanan N. Aakur
Sudeep Sarkar
24
3
0
29 Apr 2021
Multiscale Vision Transformers
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
19
1,221
0
22 Apr 2021
HCMS: Hierarchical and Conditional Modality Selection for Efficient
  Video Recognition
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition
Zejia Weng
Zuxuan Wu
Hengduo Li
Jingjing Chen
Yu-Gang Jiang
18
4
0
20 Apr 2021
1234
Next