ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.02808
  4. Cited By
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

11 April 2016
Amir Shahroudy
Jun Liu
T. Ng
G. Wang
ArXiv (abs)PDFHTML

Papers citing "NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis"

50 / 848 papers shown
SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation
SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature AggregationIEEE transactions on multimedia (TMM), 2025
Zongye Zhang
Wenrui Cai
Qingjie Liu
Yanjie Wang
286
0
0
16 Apr 2025
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Piyush Bagad
Hazel Doughty
Bernard Ghanem
Cees G. M. Snoek
ViTSSL
343
0
0
08 Apr 2025
PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition
PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video RecognitionInternational Conference on Learning Representations (ICLR), 2025
Jie Wang
Tingfa Xu
Lihe Ding
Xinjie Zhang
Long Bai
Jianan Li
3DPC
939
2
0
07 Apr 2025
Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos
Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos
Zhi Zuo
Chenyi Zhuang
Zhiqiang Shen
Pan Gao
Jie Qin
Nicu Sebe
3DPC
344
1
0
07 Apr 2025
MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor FusionIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025
Trung Thanh Nguyen
Yasutomo Kawanishi
Vijay John
Takahiro Komamizu
Ichiro Ide
464
1
0
03 Apr 2025
Towards Generalizing Temporal Action Segmentation to Unseen Views
Towards Generalizing Temporal Action Segmentation to Unseen Views
Emad Bahrami
Olga Zatsarynna
Gianpiero Francesca
Juergen Gall
EgoV
224
0
0
03 Apr 2025
Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation
Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation
Mingrui Ye
Lianping Yang
Hegui Zhu
Zenghao Zheng
Xin Wang
Yantao Lo
ViT
297
1
0
02 Apr 2025
Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry
Learning to Normalize on the SPD Manifold under Bures-Wasserstein GeometryComputer Vision and Pattern Recognition (CVPR), 2025
Rui Wang
Shaocheng Jin
Ziheng Chen
Xiaoqing Luo
Rui Wang
278
2
0
01 Apr 2025
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Boyuan Wang
Xiaofeng Wang
Chaojun Ni
Guosheng Zhao
Zhiqin Yang
...
Yukun Zhou
Xinze Chen
Guan Huang
Lihong Liu
Xingang Wang
VGen
389
16
0
31 Mar 2025
Action Recognition in Real-World Ambient Assisted Living Environment
Action Recognition in Real-World Ambient Assisted Living EnvironmentBig Data Mining and Analytics (BDMA), 2025
Vincent Gbouna Zakka
Zhuangzhuang Dai
Luis J. Manso
199
2
0
29 Mar 2025
LLaVAction: evaluating and training multi-modal large language models for action understanding
LLaVAction: evaluating and training multi-modal large language models for action understanding
Shaokai Ye
Haozhe Qi
Alexander Mathis
Mackenzie W. Mathis
354
4
0
24 Mar 2025
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Zhihang Liu
Chen-Wei Xie
Nianzu Yang
Liming Zhao
Longxiang Tang
Yun Zheng
Chuanbin Liu
Hongtao Xie
VLM
227
14
0
20 Mar 2025
Body-Hand Modality Expertized Networks with Cross-attention for Fine-grained Skeleton Action Recognition
Body-Hand Modality Expertized Networks with Cross-attention for Fine-grained Skeleton Action RecognitionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2025
Seungyeon Cho
Tae-Kyun Kim
319
1
0
19 Mar 2025
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window AttentionAAAI Conference on Artificial Intelligence (AAAI), 2025
Jiangning Wei
Lixiong Qin
Bo Yu
Tianjian Zou
Chuhan Yan
Dandan Xiao
Yang Yu
Lan Yang
Ke Li
Jun Liu
221
3
0
14 Mar 2025
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Jianhong Bai
Menghan Xia
Xiao Fu
Xintao Wang
Lianrui Mu
...
Zuozhu Liu
Haoji Hu
Xiang Bai
Pengfei Wan
Di Zhang
DiffMVGen
434
96
0
14 Mar 2025
FastVID: Dynamic Density Pruning for Fast Video Large Language Models
FastVID: Dynamic Density Pruning for Fast Video Large Language Models
Leqi Shen
Guoqiang Gong
Tao He
Yifeng Zhang
Pengzhang Liu
Sicheng Zhao
Guiguang Ding
VLM
399
14
0
14 Mar 2025
Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation
Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Qiji Zhou
Yifan Gong
Guangsheng Bao
Hongjie Qiu
Jinqiang Li
Xiangrong Zhu
Huajian Zhang
Yue Zhang
LRM
258
3
0
12 Mar 2025
DreamRelation: Relation-Centric Video Customization
Yujie Wei
Shiwei Zhang
Hangjie Yuan
Biao Gong
Longxiang Tang
...
Haonan Qiu
Hengjia Li
Shuai Tan
Yujiao Shi
Hongming Shan
VGen
261
15
0
10 Mar 2025
Video Action DifferencingInternational Conference on Learning Representations (ICLR), 2025
James Burgess
Xiaohan Wang
Yuhui Zhang
Anita Rau
Alejandro Lozano
Lisa Dunlap
Trevor Darrell
Serena Yeung-Levy
VGen
310
7
0
10 Mar 2025
Modeling Human Skeleton Joint Dynamics for Fall DetectionInternational Conference on Digital Image Computing: Techniques and Applications (DICTA), 2021
Sania Zahan
Ghulam Mubashar Hassan
Lin Wang
3DH
186
3
0
10 Mar 2025
SDFA: Structure Aware Discriminative Feature Aggregation for Efficient Human Fall Detection in VideoIEEE Transactions on Industrial Informatics (IEEE TII), 2023
Sania Zahan
Ghulam Mubashar Hassan
Lin Wang
200
18
0
10 Mar 2025
SGA-INTERACT: A 3D Skeleton-based Benchmark for Group Activity Understanding in Modern Basketball Tactic
Yue Yang
Wei Wang
Yifei Liu
Linfeng Dong
Hao Wu
Mingxin Zhang
Zhihang Zhong
Xiao-Fu Sun
256
5
0
09 Mar 2025
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Xiao Wang
Jingyun Hua
Weihong Lin
Yujiao Shi
Fuzheng Zhang
Yue Yu
Di Zhang
Liqiang Nie
VLM
667
1
0
28 Feb 2025
Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition
Balanced Representation Learning for Long-tailed Skeleton-based Action RecognitionMachine Intelligence Research (MIR), 2023
Hongda Liu
Yunlong Wang
Min Ren
Junxing Hu
Zhengquan Luo
Guangqi Hou
Zhe Sun
281
3
0
24 Feb 2025
MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models
MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models
Hengzhi Li
Megan Tjandrasuwita
Yi R. Fung
Armando Solar-Lezama
Paul Pu Liang
477
7
0
23 Feb 2025
MoFM: A Large-Scale Human Motion Foundation Model
MoFM: A Large-Scale Human Motion Foundation Model
Mohammadreza Baharani
Ghazal Alinezhad Noghre
Armin Danesh Pazho
Gabriel Maldonado
Hamed Tabkhi
AI4CE
1.1K
2
0
08 Feb 2025
DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology Modeling
DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology Modeling
Hu Cui
Renjing Huang
Ruoyu Zhang
Tessai Hayama
248
6
0
21 Jan 2025
Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion
Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion
L. Ray
Bo Zhou
Sungho Suh
P. Lukowicz
VLM
306
0
0
13 Jan 2025
Improving Skeleton-based Action Recognition with Interactive Object Information
Improving Skeleton-based Action Recognition with Interactive Object Information
Hao Wen
Ziqian Lu
Fengli Shen
Zhe-Ming Lu
Jialin Cui
221
0
0
10 Jan 2025
High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition
High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition
Ziao Li
Junyi Wang
Bangli Liu
Haibin Cai
Mohamad Saada
Guhong Nie
3DH
319
0
0
08 Jan 2025
Evolving Skeletons: Motion Dynamics in Action Recognition
Evolving Skeletons: Motion Dynamics in Action RecognitionThe Web Conference (WWW), 2025
Jushang Qiu
Lei Wang
506
1
0
05 Jan 2025
FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition
FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action RecognitionIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024
Wenhan Wu
Peng Wang
Chong Chen
Aidong Lu
269
1
0
31 Dec 2024
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency ModelEuropean Conference on Computer Vision (ECCV), 2024
Wen-Dao Dai
Ling-Hao Chen
Jingbo Wang
Jinpeng Liu
Bo Dai
Yansong Tang
531
113
0
31 Dec 2024
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action RecognitionNeural Information Processing Systems (NeurIPS), 2024
Yuhang Wen
Mengyuan Liu
Songtao Wu
Beichen Ding
324
1
0
31 Dec 2024
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language EmbeddingComputer Vision and Pattern Recognition (CVPR), 2024
Chenxin Tao
Shiqian Su
X. Zhu
Chenyu Zhang
Zhe Chen
...
Wenhai Wang
Lewei Lu
Gao Huang
Yu Qiao
Jifeng Dai
MLLMVLM
500
5
0
20 Dec 2024
PVC: Progressive Visual Token Compression for Unified Image and Video
  Processing in Large Vision-Language Models
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2024
Chenyu Yang
Xuan Dong
X. Zhu
Weijie Su
Jiahao Wang
H. Tian
Zheyu Chen
Wenhai Wang
Lewei Lu
Jifeng Dai
VLM
225
10
0
12 Dec 2024
USDRL: Unified Skeleton-Based Dense Representation Learning with
  Multi-Grained Feature Decorrelation
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature DecorrelationAAAI Conference on Artificial Intelligence (AAAI), 2024
Wanjiang Weng
Hongsong Wang
Junbo He
Lei He
Guosen Xie
414
12
0
12 Dec 2024
SkelMamba: A State Space Model for Efficient Skeleton Action Recognition
  of Neurological Disorders
SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders
N. Martinel
Mariano Serrao
C. Micheloni
288
3
0
29 Nov 2024
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Hui Li
Mingwang Xu
Yun Zhan
Shan Mu
Jiaye Li
...
Yukang Chen
Tan Chen
Mao Ye
Jingdong Wang
Siyu Zhu
VGen
550
39
0
28 Nov 2024
When Spatial meets Temporal in Action Recognition
When Spatial meets Temporal in Action Recognition
Wei Xu
Lei Wang
Yuxiao Chen
Tom Gedeon
Piotr Koniusz
301
3
0
22 Nov 2024
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Youwei Zhou
Tianyang Xu
Cong Wu
Xiaojun Wu
J. Kittler
3DH
552
4
0
22 Nov 2024
X as Supervision: Contending with Depth Ambiguity in Unsupervised
  Monocular 3D Pose Estimation
X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation
Yuchen Yang
Xuanyi Liu
Xing Gao
Zhihang Zhong
Xingwu Sun
324
0
0
20 Nov 2024
Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2024
Yang Chen
Jingcai Guo
Song Guo
Dacheng Tao
320
3
0
18 Nov 2024
Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition
Jeonghyeok Do
Munchurl Kim
596
1
0
16 Nov 2024
Extended multi-stream temporal-attention module for skeleton-based human
  action recognition (HAR)
Extended multi-stream temporal-attention module for skeleton-based human action recognition (HAR)Computers in Human Behavior (CHB), 2024
Faisal Mehmood
Xin Guo
Enqing Chen
Muhammad Azeem Akbar
A. Khan
Sami Ullah
326
8
0
10 Nov 2024
Human Action Recognition (HAR) Using Skeleton-based Spatial Temporal
  Relative Transformer Network: ST-RTR
Human Action Recognition (HAR) Using Skeleton-based Spatial Temporal Relative Transformer Network: ST-RTR
Faisal Mehmood
Enqing Chen
Touqeer Abbas
Samah M. Alzanin
270
1
0
31 Oct 2024
Recovering Complete Actions for Cross-dataset Skeleton Action
  Recognition
Recovering Complete Actions for Cross-dataset Skeleton Action RecognitionNeural Information Processing Systems (NeurIPS), 2024
Hanchao Liu
Yujiang Li
Tai-Jiang Mu
Shi-Min Hu
240
2
0
31 Oct 2024
LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group
  Activity Recognition
LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
N. V. R. Chappa
Khoa Luu
178
2
0
28 Oct 2024
Idempotent Unsupervised Representation Learning for Skeleton-Based
  Action Recognition
Idempotent Unsupervised Representation Learning for Skeleton-Based Action RecognitionEuropean Conference on Computer Vision (ECCV), 2024
Lilang Lin
Lehong Wu
Jiahang Zhang
Jiaying Liu
302
9
0
27 Oct 2024
That was not what I was aiming at! Differentiating human intent and
  outcome in a physically dynamic throwing task
That was not what I was aiming at! Differentiating human intent and outcome in a physically dynamic throwing taskAutonomous Robots (AR), 2022
Vidullan Surendran
Alan R. Wagner
116
0
0
26 Oct 2024
Previous
12345...151617
Next