Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 August 2016

Limin Wang

Yuanjun Xiong

Zhe Wang

Yu Qiao

Luc Van Gool

Papers citing "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"

50 / 1,449 papers shown

Revisiting the Spatial and Temporal Modeling for Few-shot Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2023

Mengmeng Wang

213

19 Jan 2023

Temporal Perceiving Video-Language Pre-training

Heng Wang

Yi Yang

206

18 Jan 2023

CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey

Michael Perez

Corey Toler-Franklin

MedIm

194

15 Jan 2023

ViTs for SITS: Vision Transformers for Satellite Image Time SeriesComputer Vision and Pattern Recognition (CVPR), 2023

283

12 Jan 2023

HierVL: Learning Hierarchical Video-Language EmbeddingsComputer Vision and Pattern Recognition (CVPR), 2023

440

05 Jan 2023

Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition

280

03 Jan 2023

Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on VideosIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

311

03 Jan 2023

Hierarchical Explanations for Video Action Recognition

354

01 Jan 2023

Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022

Jingdong Wang

Wanli Ouyang

395

31 Dec 2022

Representation Learning in Deep RL via Discrete Information BottleneckInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

...

177

28 Dec 2022

Deep set conditioned latent representations for action recognitionVISIGRAPP (VISIGRAPP), 2022

168

21 Dec 2022

C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Dipika Singhania

R. Rahaman

Angela Yao

193

20 Dec 2022

A Survey on Human Action Recognition

Zhou Shuchang

226

20 Dec 2022

Egocentric Video Task TranslationComputer Vision and Pattern Recognition (CVPR), 2022

267

13 Dec 2022

Contextual Explainable Video Representation: Human Perception-based UnderstandingAsilomar Conference on Signals, Systems and Computers (ACSSC), 2022

Ngan Le

226

12 Dec 2022

Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

155

12 Dec 2022

Multimodal Prototype-Enhanced Network for Few-Shot Action RecognitionInternational Conference on Multimedia Retrieval (ICMR), 2022

Yong Liu

Yujiu Yang

276

09 Dec 2022

Leveraging Spatio-Temporal Dependency for Skeleton-Based Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2022

226

09 Dec 2022

Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene SegmentationIEEE Access (IEEE Access), 2022

Wei Liu

190

09 Dec 2022

DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity RecognitionNeural Networks (NN), 2022

212

07 Dec 2022

Fine-tuned CLIP Models are Efficient Video LearnersComputer Vision and Pattern Recognition (CVPR), 2022

H. Rasheed

Muhammad Uzair Khattak

Muhammad Maaz

Salman Khan

Fahad Shahbaz Khan

CLIP VLM

404

225

06 Dec 2022

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Yi Wang

...

Yu Qiao

455

448

06 Dec 2022

VLG: General Video Recognition with Web Textual KnowledgeInternational Journal of Computer Vision (IJCV), 2022

237

03 Dec 2022

Masked Contrastive Pre-Training for Efficient Video-Text Retrieval

185

02 Dec 2022

Lightweight Structure-Aware Attention for Visual UnderstandingInternational Journal of Computer Vision (IJCV), 2022

200

29 Nov 2022

Post-Processing Temporal Action DetectionComputer Vision and Pattern Recognition (CVPR), 2022

166

27 Nov 2022

Towards Good Practices for Missing Modality Robust Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2022

241

25 Nov 2022

Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies

124

24 Nov 2022

Video Test-Time Adaptation for Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2022

261

24 Nov 2022

SVFormer: Semi-supervised Video Transformer for Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2022

Zuxuan Wu

197

120

23 Nov 2022

Dynamic Appearance: A Video Representation for Action Recognition with Joint Training

Guoxi Huang

A. Bors

178

23 Nov 2022

Look More but Care Less in Video RecognitionNeural Information Processing Systems (NeurIPS), 2022

219

18 Nov 2022

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Yi Wang

Yu Qiao

227

156

17 Nov 2022

Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive SurveyACM Computing Surveys (ACM CSUR), 2022

Yuecong Xu

Lihua Xie

232

17 Nov 2022

Language-Assisted Deep Learning for Autistic Behaviors RecognitionSmart Health (SH), 2022

Qian Chen

180

17 Nov 2022

A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Fan Wang

221

16 Nov 2022

Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022

184

16 Nov 2022

Dynamic Temporal Filtering in Video ModelsEuropean Conference on Computer Vision (ECCV), 2022

Fuchen Long

Zhaofan Qiu

Yingwei Pan

Ting Yao

Chong-Wah Ngo

Tao Mei

AI4TS

237

15 Nov 2022

EVA: Exploring the Limits of Masked Visual Representation Learning at ScaleComputer Vision and Pattern Recognition (CVPR), 2022

621

901

14 Nov 2022

Deep Unsupervised Key Frame Extraction for Efficient Video Classification

103

12 Nov 2022

Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization TasksComputer Vision and Pattern Recognition (CVPR), 2022

281

11 Nov 2022

SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition

175

10 Nov 2022

SimOn: A Simple Framework for Online Temporal Action Localization

161

08 Nov 2022

Facial Tic Detection in Untrimmed Videos of Tourette Syndrome PatientsInternational Conference on Pattern Recognition (ICPR), 2022

133

07 Nov 2022

Bringing Online Egocentric Action Recognition into the wildIEEE Robotics and Automation Letters (RA-L), 2022

225

06 Nov 2022

Event and Entity Extraction from Generated Video CaptionsInternational Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2022

Johannes Scherer

A. Scherp

Deepayan Bhowmik

238

05 Nov 2022

Self-Supervised Learning for Speech Enhancement through SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

194

04 Nov 2022

Video Event Extraction via Tracking Visual States of ArgumentsAAAI Conference on Artificial Intelligence (AAAI), 2022

Heng Ji

204

03 Nov 2022

Deep Learning Computer Vision Algorithms for Real-time UAVs On-board Camera Image Processing

A. Palmas

P. Andronico

225

02 Nov 2022

TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early Intent PredictionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Nada Osman

Guglielmo Camporese

Lamberto Ballan

127

26 Oct 2022