Multiscaled Multi-Head Attention-based Video Transformer Network for Hand Gesture Recognition

IEEE Signal Processing Letters (IEEE SPL), 2025

3 January 2025

ArXiv (abs)PDF HTML Github

Papers citing "Multiscaled Multi-Head Attention-based Video Transformer Network for Hand Gesture Recognition"

22 / 22 papers shown

MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition

333

05 Sep 2024

A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data ModalitiesIEEE Access (IEEE Access), 2024

289

10 Aug 2024

GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition

363

18 May 2024

End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction ContextIEEE Signal Processing Letters (IEEE SPL), 2023

Zhiguo Cao

441

27 Oct 2023

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection

Christoph Feichtenhofer

ViT

588

886

02 Dec 2021

Multi-Task and Multi-Modal Learning for RGB Dynamic Gesture RecognitionIEEE Sensors Journal (IEEE Sens. J.), 2021

258

29 Oct 2021

Multiscale Vision TransformersIEEE International Conference on Computer Vision (ICCV), 2021

Christoph Feichtenhofer

ViT

607

1,592

22 Apr 2021

CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image ClassificationIEEE International Conference on Computer Vision (ICCV), 2021

473

2,043

27 Mar 2021

Video Transformer Network

1.3K

486

01 Feb 2021

Training data-efficient image transformers & distillation through attentionInternational Conference on Machine Learning (ICML), 2020

Alexandre Sablayrolles

Edouard Grave

ViT

785

8,831

23 Dec 2020

Multi-modal Fusion for Single-Stage Continuous Gesture Recognition

395

10 Nov 2020

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Alexey Dosovitskiy

...

1.6K

60,663

22 Oct 2020

Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition

Stan Z. Li

288

114

21 Aug 2020

Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture Recognition in VideosInternational Conference on 3D Vision (3DV), 2019

Naina Dhingra

A. Kunz

3DPC SLR

330

04 Jan 2020

Real-time Hand Gesture Detection and Classification Using Convolutional Neural NetworksIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2019

548

236

29 Jan 2019

Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training

Mahdi Abavisani

Hamid Reza Vaezi Joze

Vishal M. Patel

318

157

14 Dec 2018

Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition

Okan Kopuklu

Neslihan Köse

Gerhard Rigoll

336

122

19 Apr 2018

Exploiting Recurrent Neural Networks and Leap Motion Controller for Sign Language and Semaphoric Gesture Recognition

300

164

28 Mar 2018

Attention Is All You NeedNeural Information Processing Systems (NeurIPS), 2017

8.4K

172,602

12 Jun 2017

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

João Carreira

Andrew Zisserman

973

9,368

22 May 2017

A robust and efficient video representation for action recognition

Heng Wang

Dan Oneaţă

Jakob Verbeek

Cordelia Schmid

241

338

21 Apr 2015

Two-Stream Convolutional Networks for Action Recognition in VideosNeural Information Processing Systems (NeurIPS), 2014

Karen Simonyan

Andrew Zisserman

1.1K

8,139

09 Jun 2014