Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 August 2016

Limin Wang

Yuanjun Xiong

Zhe Wang

Yu Qiao

Luc Van Gool

Papers citing "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"

50 / 1,449 papers shown

SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model

242

14 Apr 2025

H-MoRe: Learning Human-centric Motion Representation for Action AnalysisComputer Vision and Pattern Recognition (CVPR), 2025

285

14 Apr 2025

SocialGesture: Delving into Multi-person Gesture UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025

230

03 Apr 2025

Is Temporal Prompting All We Need For Limited Labeled Action Recognition?

351

02 Apr 2025

FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action DetectionInternational Conference on Intelligent Computing (ICIC), 2025

298

01 Apr 2025

Sample-level Adaptive Knowledge Distillation for Action Recognition

331

01 Apr 2025

OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition

221

30 Mar 2025

BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors

Chengyang Hu

Yuduo Chen

Lizhuang Ma

180

26 Mar 2025

Video-ColBERT: Contextualized Late Interaction for Text-to-Video RetrievalComputer Vision and Pattern Recognition (CVPR), 2025

356

24 Mar 2025

Context-Enhanced Memory-Refined Transformer for Online Action DetectionComputer Vision and Pattern Recognition (CVPR), 2025

353

24 Mar 2025

Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video BenchmarksComputer Vision and Pattern Recognition (CVPR), 2025

268

24 Mar 2025

STOP: Integrated Spatial-Temporal Dynamic Prompting for Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025

430

20 Mar 2025

Quantum EigenGame for excited state calculation

David Quiroga

Jason Han

Anastasios Kyrillidis

280

17 Mar 2025

Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition

332

17 Mar 2025

Elderly Activity Recognition in the Wild: Results from the EAR Challenge

Anh-Kiet Duong

185

10 Mar 2025

OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection

...

Juan Carlos León Alcázar

297

27 Feb 2025

Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets

234

27 Feb 2025

Online Meta-learning for AutoML in Real-time (OnMAR)

Mia Gerber

Anna Sergeevna Bosman

J. D. Villiers

OffRL

259

27 Feb 2025

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the WildNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

161

17 Feb 2025

Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis

Amir Hosein Fadaei

M. Dehaqani

328

11 Feb 2025

Conformal Predictions for Human Action Recognition with Vision-Language Models

357

10 Feb 2025

Seeing in the Dark: A Teacher-Student Framework for Dark Video Action Recognition via Knowledge Distillation and Contrastive Learning

Sharana Dharshikgan Suresh Dass

H. Barua

Ganesh Krishnasamy

Raveendran Paramesran

Raphael C.-W. Phan

397

06 Feb 2025

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using SuperquadricsAAAI Conference on Artificial Intelligence (AAAI), 2025

464

13 Jan 2025

Optimizing Multitask Industrial Processes with Predictive Action GuidanceIEEE Transactions on Automation Science and Engineering (T-ASE), 2025

162

10 Jan 2025

An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks

269

08 Jan 2025

High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition

321

08 Jan 2025

Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences

Antonios Gasteratos

Stavros N. Moutsis

Konstantinos A. Tsintotas

Yiannis Aloimonos

194

17 Dec 2024

Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

286

15 Dec 2024

A Decade of Deep Learning: A Survey on The Magnificent Seven

Dilshod Azizov

Muhammad Arslan Manzoor

...

300

13 Dec 2024

EdgeOAR: Real-time Online Action Recognition On Edge Devices

241

02 Dec 2024

When Spatial meets Temporal in Action Recognition

301

22 Nov 2024

Privacy-Preserving Video Anomaly Detection: A Survey

524

21 Nov 2024

Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition

428

18 Nov 2024

OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene ParsingEuropean Conference on Computer Vision (ECCV), 2024

Pranav Gupta

Rishubh Singh

Pradeep Shenoy

Ravikiran Sarvadevabhatla

221

05 Nov 2024

Lost in Context: The Influence of Context on Feature Attribution Methods for Object RecognitionIndian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2024

Sayanta Adhikari

Rishav Kumar

Konda Reddy Mopuri

Rajalakshmi Pachamuthu

240

05 Nov 2024

Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence ChallengeNeural Information Processing Systems (NeurIPS), 2024

...

479

04 Nov 2024

STAA: Spatio-Temporal Attention Attribution for Real-Time Interpreting Transformer-based Video Models

Zerui Wang

Yan Liu

313

01 Nov 2024

Recovering Complete Actions for Cross-dataset Skeleton Action RecognitionNeural Information Processing Systems (NeurIPS), 2024

246

31 Oct 2024

Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image DatasetsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

Adrian Iordache

B. Alexe

Radu Tudor Ionescu

306

29 Oct 2024

Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual ContextComputer Vision and Image Understanding (CVIU), 2024

Manuel Benavent-Lledo

David Mulero-Pérez

David Ortiz-Perez

José García Rodríguez

Antonis Argyros

320

28 Oct 2024

MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition DatasetNeural Information Processing Systems (NeurIPS), 2024

Xin Shen

...

188

25 Oct 2024

Making Every Frame Matter: Continuous Activity Recognition in Streaming Video via Adaptive Video Context Modeling

Hao Wu

Yunxin Liu

Fengyuan Xu

588

19 Oct 2024

Pseudo Dataset Generation for Out-of-Domain Multi-Camera View RecommendationVisual Communications and Image Processing (VCIP), 2024

Kuan-Ying Lee

Qian Zhou

Klara Nahrstedt

265

17 Oct 2024

On-the-fly Modulation for Balanced Multimodal LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

240

15 Oct 2024

VidCompress: Memory-Enhanced Temporal Compression for Video Understanding in Large Language Models

181

15 Oct 2024

Movie Trailer Genre Classification Using Multimodal Pretrained FeaturesExpert systems with applications (ESWA), 2024

214

11 Oct 2024

Fourier-based Action Recognition for Wildlife Behavior Quantification with Event CamerasAdvanced Intelligent Systems (AIS), 2024

Friedhelm Hamann

Suman Ghosh

Ignacio Juarez Martinez

Tom Hart

Alex Kacelnik

Guillermo Gallego

211

09 Oct 2024

Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for Action DetectionACM Multimedia (MM), 2024

252

08 Oct 2024

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

263

04 Oct 2024

Loose Social-Interaction Recognition in Real-world Therapy ScenariosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

Francois Bremond

283

30 Sep 2024