Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 August 2016

Limin Wang

Yuanjun Xiong

Zhe Wang

Yu Qiao

Luc Van Gool

Papers citing "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"

50 / 1,449 papers shown

Refining Action Boundaries for One-stage DetectionAdvanced Video and Signal Based Surveillance (AVSS), 2022

Dima Damen

153

25 Oct 2022

GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action PredictionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Samrudhdhi B. Rangrej

Kevin J. Liang

Tal Hassner

James J. Clark

288

24 Oct 2022

Anticipative Feature Fusion Transformer for Multi-Modal Action AnticipationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

181

23 Oct 2022

Grounded Video Situation RecognitionNeural Information Processing Systems (NeurIPS), 2022

Zeeshan Khan

C. V. Jawahar

Makarand Tapaswi

192

19 Oct 2022

FedForgery: Generalized Face Forgery Detection with Residual Federated LearningIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2022

Xinbo Gao

319

18 Oct 2022

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows

183

17 Oct 2022

Semantic Video Moments Retrieval at Scale: A New Task and a Baseline

Na Li

240

15 Oct 2022

MMTSA: Multimodal Temporal Segment Attention Network for Efficient Human Activity RecognitionProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2022

214

14 Oct 2022

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Ding Zhao

149

12 Oct 2022

Students taught by multimodal teachers are superior action recognizers

Gorjan Radevski

Dusan Grujicic

Matthew Blaschko

Marie-Francine Moens

Tinne Tuytelaars

211

09 Oct 2022

Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal ModelingBritish Machine Vision Conference (BMVC), 2022

Hsin-Ying Lee

Hung-Ting Su

312

08 Oct 2022

Multi-Scale Wavelet Transformer for Face Forgery DetectionAsian Conference on Computer Vision (ACCV), 2022

234

08 Oct 2022

Alignment-guided Temporal Attention for Video Action RecognitionNeural Information Processing Systems (NeurIPS), 2022

155

30 Sep 2022

Learning Transferable Spatiotemporal Representations from Natural Script KnowledgeComputer Vision and Pattern Recognition (CVPR), 2022

Ping Luo

213

30 Sep 2022

AdaFocusV3: On Unified Spatial-temporal Dynamic Video RecognitionEuropean Conference on Computer Vision (ECCV), 2022

Yulin Wang

Gao Huang

269

27 Sep 2022

EgoSpeed-Net: Forecasting Speed-Control in Driver Behavior from Egocentric Video Data

192

27 Sep 2022

Rethinking Resolution in the Context of Efficient Video RecognitionNeural Information Processing Systems (NeurIPS), 2022

Ping Luo

Xiaojuan Qi

220

26 Sep 2022

Multi-modal Video Chapter Generation

190

26 Sep 2022

Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks

115

20 Sep 2022

MSA-GCN:Multiscale Adaptive Graph Convolution Network for Gait Emotion RecognitionPattern Recognition (Pattern Recogn.), 2022

178

19 Sep 2022

MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like DomainComputer Vision and Image Understanding (CVIU), 2022

236

19 Sep 2022

Action-based Early Autism Diagnosis Using Contrastive Feature LearningMultimedia Systems (Multimed. Syst.), 2022

Asha Rani

Pankaj Yadav

Yashaswi Verma

210

12 Sep 2022

Graphing the Future: Activity and Next Active Object Prediction using Graph-based Activity RepresentationsInternational Symposium on Visual Computing (ISVC), 2022

Victoria Manousaki

K. Papoutsakis

Antonis Argyros

145

12 Sep 2022

Predicting the Next Action by Modeling the Abstract GoalInternational Conference on Pattern Recognition (ICPR), 2022

Debaditya Roy

Basura Fernando

EgoV

368

12 Sep 2022

MAiVAR: Multimodal Audio-Image and Video Action RecognizerVisual Communications and Image Processing (VCIP), 2022

Muhammad Bilal Shaikh

Douglas Chai

S. Islam

Naveed Akhtar

160

11 Sep 2022

An Empirical Study of End-to-End Video-Language Transformers with Masked Visual ModelingComputer Vision and Pattern Recognition (CVPR), 2022

Zicheng Liu

633

04 Sep 2022

Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action RecognitionEuropean Conference on Computer Vision (ECCV), 2022

218

03 Sep 2022

Attentive pooling for Group Activity Recognition

Yuan Xie

185

31 Aug 2022

A Circular Window-based Cascade Transformer for Online Action Detection

192

30 Aug 2022

Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos

Fan Yang

Norimichi Ukita

S. Sakti

Satoshi Nakamura

205

27 Aug 2022

Adaptive Perception Transformer for Temporal Action Localization

Yizheng Ouyang

Tianjin Zhang

Weibo Gu

Hongfa Wang

240

25 Aug 2022

Modality Mixer for Multi-modal Action RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

177

24 Aug 2022

Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2022

282

213

23 Aug 2022

Hierarchical Compositional Representations for Few-shot Action RecognitionComputer Vision and Image Understanding (CVIU), 2022

269

19 Aug 2022

Spatial Temporal Graph Attention Network for Skeleton-Based Action Recognition

175

18 Aug 2022

Progressive Cross-modal Knowledge Distillation for Human Action RecognitionACM Multimedia (ACM MM), 2022

Jianyuan Ni

A. Ngu

Yan Yan

HAI

204

17 Aug 2022

UAV-CROWD: Violent and non-violent crowd activity simulator from the perspective of UAV

119

13 Aug 2022

Sports Video Analysis on Large-Scale DataEuropean Conference on Computer Vision (ECCV), 2022

Dekun Wu

Henghui Zhao

Xingce Bao

Richard P. Wildes

146

09 Aug 2022

BabyNet: A Lightweight Network for Infant Reaching Action Recognition in Unconstrained Environments to Support Future Pediatric Rehabilitation ApplicationsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2021

Dannya Enriquez Barrundia

Elena Kokkoni

Konstantinos Karydis

167

09 Aug 2022

Video-based Human Action Recognition using Deep Learning: A Review

174

07 Aug 2022

Frozen CLIP Models are Efficient Video LearnersEuropean Conference on Computer Vision (ECCV), 2022

Yu Qiao

260

254

06 Aug 2022

Expanding Language-Image Pretrained Models for General Video RecognitionEuropean Conference on Computer Vision (ECCV), 2022

337

433

04 Aug 2022

Uncertainty-Driven Action Quality Assessment

Caixia Zhou

Yaping Huang

330

29 Jul 2022

Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action RecognitionEuropean Conference on Computer Vision (ECCV), 2022

Lei Zhang

161

27 Jul 2022

Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art EvaluationACM Multimedia (ACM MM), 2022

285

26 Jul 2022

P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos

Jiang Bian

Haoyi Xiong

195

26 Jul 2022

Cross-Modal Causal Relational Reasoning for Event-Level Visual Question AnsweringIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Yang Liu

Guanbin Li

LRM

575

148

26 Jul 2022

MAR: Masked Autoencoders for Efficient Action RecognitionIEEE transactions on multimedia (IEEE TMM), 2022

248

24 Jul 2022

EgoEnv: Human-centric environment representations from egocentric videoNeural Information Processing Systems (NeurIPS), 2022

Tushar Nagarajan

Santhosh Kumar Ramakrishnan

Ruta Desai

James M. Hillis

Kristen Grauman

EgoV

311

22 Jul 2022

NSNet: Non-saliency Suppression Sampler for Efficient Video RecognitionEuropean Conference on Computer Vision (ECCV), 2022

Wanli Ouyang

230

21 Jul 2022