ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.11866
  4. Cited By
Human Action Recognition from Various Data Modalities: A Review

Human Action Recognition from Various Data Modalities: A Review

22 December 2020
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
    MU
ArXivPDFHTML

Papers citing "Human Action Recognition from Various Data Modalities: A Review"

40 / 40 papers shown
Title
An LLM-Empowered Low-Resolution Vision System for On-Device Human Behavior Understanding
An LLM-Empowered Low-Resolution Vision System for On-Device Human Behavior Understanding
Siyang Jiang
Bufang Yang
Lilin Xu
Mu Yuan
Yeerzhati Abudunuer
...
Liekang Zeng
Hongkai Chen
Zhenyu Yan
Xiaofan Jiang
Guoliang Xing
VLM
37
0
0
03 May 2025
MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
Trung Thanh Nguyen
Yasutomo Kawanishi
Vijay John
Takahiro Komamizu
Ichiro Ide
41
0
0
03 Apr 2025
TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition
Jeonghyeok Do
Munchurl Kim
42
1
0
16 Nov 2024
Deep Learning for Video Anomaly Detection: A Review
Deep Learning for Video Anomaly Detection: A Review
Peng Wu
Chengyu Pan
Yuting Yan
Guansong Pang
Peng Wang
Yanning Zhang
VLM
AI4TS
22
6
0
09 Sep 2024
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition
Ahmed Abdelkawy
Asem A. Ali
Aly A. Farag
3DPC
18
0
0
10 Aug 2024
A Comprehensive Review of Few-shot Action Recognition
A Comprehensive Review of Few-shot Action Recognition
Yuyang Wanyan
Xiaoshan Yang
Weiming Dong
Changsheng Xu
VLM
42
3
0
20 Jul 2024
Spatio-Temporal Encoding and Decoding-Based Method for Future Human
  Activity Skeleton Synthesis
Spatio-Temporal Encoding and Decoding-Based Method for Future Human Activity Skeleton Synthesis
Tingyu Liu
Jun Huang
Chenyi Weng
11
0
0
08 Jul 2024
NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative
NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative
Asmar Nadeem
Faegheh Sardari
R. Dawes
Syed Sameed Husain
Adrian Hilton
Armin Mustafa
42
4
0
10 Jun 2024
MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition
MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition
Stefan Gerd Fritsch
Cennet Oğuz
Vitor Fortes Rey
L. Ray
Maximilian Kiefer-Emmanouilidis
P. Lukowicz
HAI
32
0
0
06 Jun 2024
Networking Systems for Video Anomaly Detection: A Tutorial and Survey
Networking Systems for Video Anomaly Detection: A Tutorial and Survey
Jing Liu
Yang Liu
Jieyu Lin
Jielin Li
Peng Sun
Bo Hu
Liang Song
Azzedine Boukerche
Victor C.M. Leung
Victor C.M. Leung
39
10
0
16 May 2024
D-STGCNT: A Dense Spatio-Temporal Graph Conv-GRU Network based on transformer for assessment of patient physical rehabilitation
D-STGCNT: A Dense Spatio-Temporal Graph Conv-GRU Network based on transformer for assessment of patient physical rehabilitation
Youssef Mourchid
Rim Slama
MedIm
16
15
0
21 Dec 2023
ConFormer: A Novel Collection of Deep Learning Models to Assist
  Cardiologists in the Assessment of Cardiac Function
ConFormer: A Novel Collection of Deep Learning Models to Assist Cardiologists in the Assessment of Cardiac Function
Ethan Thomas
Salman Aslam
MedIm
6
0
0
13 Dec 2023
Non-contact Multimodal Indoor Human Monitoring Systems: A Survey
Non-contact Multimodal Indoor Human Monitoring Systems: A Survey
L. Nguyen
Praneeth Susarla
Anirban Mukherjee
Manuel Lage Cañellas
Constantino Álvarez Casado
Xiaoting Wu
Olli Silvén
D. Jayagopi
Miguel Bordallo López
8
1
0
11 Dec 2023
Large Scale Foundation Models for Intelligent Manufacturing
  Applications: A Survey
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey
Haotian Zhang
S. D. Semujju
Zhicheng Wang
Xianwei Lv
Kang Xu
...
Jing Wu
Zhuo Long
Wensheng Liang
Xiaoguang Ma
Ruiyan Zhuang
UQCV
AI4TS
AI4CE
20
4
0
11 Dec 2023
M$^3$Net: Multi-view Encoding, Matching, and Fusion for Few-shot
  Fine-grained Action Recognition
M3^33Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition
Hao Tang
Jun Liu
Shuanglin Yan
Rui Yan
Zechao Li
Jinhui Tang
6
36
0
06 Aug 2023
Deep Neural Networks in Video Human Action Recognition: A Review
Deep Neural Networks in Video Human Action Recognition: A Review
Zihan Wang
Yang Yang
Zhi Liu
Y. Zheng
32
4
0
25 May 2023
Focalized Contrastive View-invariant Learning for Self-supervised
  Skeleton-based Action Recognition
Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition
Qianhui Men
Edmond S. L. Ho
Hubert P. H. Shum
Howard Leung
SSL
12
18
0
03 Apr 2023
SLIC: Self-Supervised Learning with Iterative Clustering for Human
  Action Videos
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos
S. H. Khorasgani
Yuxuan Chen
Florian Shkurti
SSL
27
22
0
25 Jun 2022
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
Hehe Fan
Xin Yu
Yuhang Ding
Yi Yang
Mohan S. Kankanhalli
3DPC
104
108
0
27 May 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
212
682
0
13 Oct 2021
Adversarial Bone Length Attack on Action Recognition
Adversarial Bone Length Attack on Action Recognition
Nariki Tanaka
Hiroshi Kera
K. Kawamoto
AAML
11
13
0
13 Sep 2021
STAR: Sparse Transformer-based Action Recognition
STAR: Sparse Transformer-based Action Recognition
Feng Shi
Chonghan Lee
Liang Qiu
Yizhou Zhao
Tianyi Shen
Shivran Muralidhar
Tian Han
Song-Chun Zhu
V. Narayanan
ViT
10
21
0
15 Jul 2021
3D Human Action Representation Learning via Cross-View Consistency
  Pursuit
3D Human Action Representation Learning via Cross-View Consistency Pursuit
Linguo Li
Minsi Wang
Bingbing Ni
Hang Wang
Jiancheng Yang
Wenjun Zhang
108
154
0
29 Apr 2021
VidTr: Video Transformer Without Convolutions
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
116
178
0
23 Apr 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw
  Video, Audio and Text
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
229
573
0
22 Apr 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
272
1,939
0
09 Feb 2021
Video Transformer Network
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
188
375
0
01 Feb 2021
Trear: Transformer-based RGB-D Egocentric Action Recognition
Trear: Transformer-based RGB-D Egocentric Action Recognition
Xiangyu Li
Yonghong Hou
Pichao Wang
Zhimin Gao
Mingliang Xu
Wanqing Li
ViT
166
88
0
05 Jan 2021
Transformers in Vision: A Survey
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
F. Khan
M. Shah
ViT
216
2,404
0
04 Jan 2021
Temporal Binary Representation for Event-Based Action Recognition
Temporal Binary Representation for Event-Based Action Recognition
Simone Undri Innocenti
Federico Becattini
F. Pernici
A. Bimbo
39
54
0
18 Oct 2020
Dynamic Multiscale Graph Neural Networks for 3D Skeleton-Based Human
  Motion Prediction
Dynamic Multiscale Graph Neural Networks for 3D Skeleton-Based Human Motion Prediction
Maosen Li
Siheng Chen
Yangheng Zhao
Ya-Qin Zhang
Yanfeng Wang
Qi Tian
AI4CE
3DH
85
267
0
17 Mar 2020
Gimme Signals: Discriminative signal encoding for multimodal activity
  recognition
Gimme Signals: Discriminative signal encoding for multimodal activity recognition
Raphael Memmesheimer
Nick Theisen
Dietrich Paulus
38
50
0
13 Mar 2020
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
Bin Ren
Mengyuan Liu
Runwei Ding
Hong Liu
16
118
0
14 Feb 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
181
204
0
23 Jan 2020
AdaFrame: Adaptive Frame Selection for Fast Video Recognition
AdaFrame: Adaptive Frame Selection for Fast Video Recognition
Zuxuan Wu
Caiming Xiong
Chih-Yao Ma
R. Socher
L. Davis
110
194
0
29 Nov 2018
ECO: Efficient Convolutional Network for Online Video Understanding
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
111
495
0
24 Apr 2018
Depth Pooling Based Large-scale 3D Action Recognition with Convolutional
  Neural Networks
Depth Pooling Based Large-scale 3D Action Recognition with Convolutional Neural Networks
Pichao Wang
W. Li
Zhimin Gao
Chang-Fu Tang
P. Ogunbona
3DV
95
137
0
17 Mar 2018
Scene Flow to Action Map: A New Representation for RGB-D based Action
  Recognition with Convolutional Neural Networks
Scene Flow to Action Map: A New Representation for RGB-D based Action Recognition with Convolutional Neural Networks
Pichao Wang
W. Li
Zhimin Gao
Yuyao Zhang
Chang-Fu Tang
P. Ogunbona
3DPC
159
131
0
28 Feb 2017
Convolutional LSTM Network: A Machine Learning Approach for
  Precipitation Nowcasting
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
188
7,816
0
13 Jun 2015
Learning Human Activities and Object Affordances from RGB-D Videos
Learning Human Activities and Object Affordances from RGB-D Videos
H. Koppula
Rudhir Gupta
Ashutosh Saxena
83
723
0
04 Oct 2012
1