Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1604.02808
Cited By

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis

11 April 2016

Jun Liu

ArXiv (abs)PDF HTML

Papers citing "NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis"

50 / 848 papers shown

SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation

SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature AggregationIEEE transactions on multimedia (TMM), 2025

286

0

0

16 Apr 2025

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning

SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning

Fida Mohammad Thoker

Cees G. M. Snoek

343

0

0

08 Apr 2025

PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition

PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video RecognitionInternational Conference on Learning Representations (ICLR), 2025

939

2

0

07 Apr 2025

Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos

Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos

344

1

0

07 Apr 2025

MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion

MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor FusionIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025

Trung Thanh Nguyen

Yasutomo Kawanishi

Takahiro Komamizu

464

1

0

03 Apr 2025

Towards Generalizing Temporal Action Segmentation to Unseen Views

Towards Generalizing Temporal Action Segmentation to Unseen Views

Olga Zatsarynna

Gianpiero Francesca

224

0

0

03 Apr 2025

Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation

Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation

297

1

0

02 Apr 2025

Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry

Learning to Normalize on the SPD Manifold under Bures-Wasserstein GeometryComputer Vision and Pattern Recognition (CVPR), 2025

278

2

0

01 Apr 2025

HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation

HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled GenerationComputer Vision and Pattern Recognition (CVPR), 2025

...

389

16

0

31 Mar 2025

Action Recognition in Real-World Ambient Assisted Living Environment

Action Recognition in Real-World Ambient Assisted Living EnvironmentBig Data Mining and Analytics (BDMA), 2025

Vincent Gbouna Zakka

Zhuangzhuang Dai

199

2

0

29 Mar 2025

LLaVAction: evaluating and training multi-modal large language models for action understanding

LLaVAction: evaluating and training multi-modal large language models for action understanding

Alexander Mathis

Mackenzie W. Mathis

354

4

0

24 Mar 2025

Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models

Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language ModelsComputer Vision and Pattern Recognition (CVPR), 2025

227

14

0

20 Mar 2025

Body-Hand Modality Expertized Networks with Cross-attention for Fine-grained Skeleton Action Recognition

Body-Hand Modality Expertized Networks with Cross-attention for Fine-grained Skeleton Action RecognitionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2025

319

1

0

19 Mar 2025

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window AttentionAAAI Conference on Artificial Intelligence (AAAI), 2025

221

3

0

14 Mar 2025

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

...

434

96

0

14 Mar 2025

FastVID: Dynamic Density Pruning for Fast Video Large Language Models

FastVID: Dynamic Density Pruning for Fast Video Large Language Models

399

14

0

14 Mar 2025

Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation

Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question EvaluationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

258

3

0

12 Mar 2025

DreamRelation: Relation-Centric Video Customization

...

261

15

0

10 Mar 2025

Video Action DifferencingInternational Conference on Learning Representations (ICLR), 2025

Alejandro Lozano

Serena Yeung-Levy

310

7

0

10 Mar 2025

Modeling Human Skeleton Joint Dynamics for Fall DetectionInternational Conference on Digital Image Computing: Techniques and Applications (DICTA), 2021

Ghulam Mubashar Hassan

186

3

0

10 Mar 2025

SDFA: Structure Aware Discriminative Feature Aggregation for Efficient Human Fall Detection in VideoIEEE Transactions on Industrial Informatics (IEEE TII), 2023

Ghulam Mubashar Hassan

200

18

0

10 Mar 2025

SGA-INTERACT: A 3D Skeleton-based Benchmark for Group Activity Understanding in Modern Basketball Tactic

256

5

0

09 Mar 2025

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

667

1

0

28 Feb 2025

Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition

Balanced Representation Learning for Long-tailed Skeleton-based Action RecognitionMachine Intelligence Research (MIR), 2023

281

3

0

24 Feb 2025

MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models

MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models

Megan Tjandrasuwita

Armando Solar-Lezama

477

7

0

23 Feb 2025

MoFM: A Large-Scale Human Motion Foundation Model

MoFM: A Large-Scale Human Motion Foundation Model

Mohammadreza Baharani

Ghazal Alinezhad Noghre

Armin Danesh Pazho

Gabriel Maldonado

1.1K

2

0

08 Feb 2025

DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology Modeling

DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology Modeling

248

6

0

21 Jan 2025

Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion

Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion

306

0

0

13 Jan 2025

Improving Skeleton-based Action Recognition with Interactive Object Information

Improving Skeleton-based Action Recognition with Interactive Object Information

221

0

0

10 Jan 2025

High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition

High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition

319

0

0

08 Jan 2025

Evolving Skeletons: Motion Dynamics in Action Recognition

Evolving Skeletons: Motion Dynamics in Action RecognitionThe Web Conference (WWW), 2025

506

1

0

05 Jan 2025

FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action Recognition

FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action RecognitionIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2024

269

1

0

31 Dec 2024

MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model

MotionLCM: Real-time Controllable Motion Generation via Latent Consistency ModelEuropean Conference on Computer Vision (ECCV), 2024

531

113

0

31 Dec 2024

CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition

CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action RecognitionNeural Information Processing Systems (NeurIPS), 2024

324

1

0

31 Dec 2024

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language EmbeddingComputer Vision and Pattern Recognition (CVPR), 2024

...

500

5

0

20 Dec 2024

PVC: Progressive Visual Token Compression for Unified Image and Video
Processing in Large Vision-Language Models

PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2024

225

10

0

12 Dec 2024

USDRL: Unified Skeleton-Based Dense Representation Learning with
Multi-Grained Feature Decorrelation

USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature DecorrelationAAAI Conference on Artificial Intelligence (AAAI), 2024

414

12

0

12 Dec 2024

SkelMamba: A State Space Model for Efficient Skeleton Action Recognition
of Neurological Disorders

SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders

288

3

0

29 Nov 2024

OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024

...

550

39

0

28 Nov 2024

When Spatial meets Temporal in Action Recognition

When Spatial meets Temporal in Action Recognition

301

3

0

22 Nov 2024

Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections

Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections

552

4

0

22 Nov 2024

X as Supervision: Contending with Depth Ambiguity in Unsupervised
Monocular 3D Pose Estimation

X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation

324

0

0

20 Nov 2024

Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2024

320

3

0

18 Nov 2024

Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition

596

1

0

16 Nov 2024

Extended multi-stream temporal-attention module for skeleton-based human
action recognition (HAR)

Extended multi-stream temporal-attention module for skeleton-based human action recognition (HAR)Computers in Human Behavior (CHB), 2024

Muhammad Azeem Akbar

326

8

0

10 Nov 2024

Human Action Recognition (HAR) Using Skeleton-based Spatial Temporal
Relative Transformer Network: ST-RTR

Human Action Recognition (HAR) Using Skeleton-based Spatial Temporal Relative Transformer Network: ST-RTR

Samah M. Alzanin

270

1

0

31 Oct 2024

Recovering Complete Actions for Cross-dataset Skeleton Action
Recognition

Recovering Complete Actions for Cross-dataset Skeleton Action RecognitionNeural Information Processing Systems (NeurIPS), 2024

240

2

0

31 Oct 2024

LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group
Activity Recognition

LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

N. V. R. Chappa

178

2

0

28 Oct 2024

Idempotent Unsupervised Representation Learning for Skeleton-Based
Action Recognition

Idempotent Unsupervised Representation Learning for Skeleton-Based Action RecognitionEuropean Conference on Computer Vision (ECCV), 2024

302

9

0

27 Oct 2024

That was not what I was aiming at! Differentiating human intent and
outcome in a physically dynamic throwing task

That was not what I was aiming at! Differentiating human intent and outcome in a physically dynamic throwing taskAutonomous Robots (AR), 2022

Vidullan Surendran

116

0

0

26 Oct 2024

1 2 3 4 5...15 16 17