ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.00859
  4. Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action
  Recognition

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
    ViT
ArXiv (abs)PDFHTML

Papers citing "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"

50 / 1,449 papers shown
EgoLM: Multi-Modal Language Model of Egocentric Motions
EgoLM: Multi-Modal Language Model of Egocentric MotionsComputer Vision and Pattern Recognition (CVPR), 2024
Fangzhou Hong
Vladimir Guzov
Hyo Jin Kim
Yuting Ye
Richard Newcombe
Ziwei Liu
Lingni Ma
178
12
0
26 Sep 2024
Deep Learning for Video Anomaly Detection: A Review
Deep Learning for Video Anomaly Detection: A Review
Peng Wu
Chengyu Pan
Yuting Yan
Guansong Pang
Peng Wang
Yanning Zhang
VLMAI4TS
221
32
0
09 Sep 2024
Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment
Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment
Keyne Oei
Amr Gomaa
Anna Maria Feit
João Belo
329
1
0
06 Sep 2024
GMFL-Net: A Global Multi-geometric Feature Learning Network for
  Repetitive Action Counting
GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Jun Li
Jinying Wu
Qiming Li
Feifei Guo
289
1
0
31 Aug 2024
Joint Temporal Pooling for Improving Skeleton-based Action Recognition
Joint Temporal Pooling for Improving Skeleton-based Action RecognitionInternational Conference on Digital Image Computing: Techniques and Applications (DICTA), 2023
Shanaka Ramesh Gunasekara
Wanqing Li
Jack Yang
P. Ogunbona
196
3
0
18 Aug 2024
Flatten: Video Action Recognition is an Image Classification task
Flatten: Video Action Recognition is an Image Classification task
Junlin Chen
Chengcheng Xu
Yangfan Xu
Zhiqiang Wang
Jun Yu Li
Zhiping Shi
249
2
0
17 Aug 2024
Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation
  Approach
Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation ApproachEuropean Conference on Computer Vision (ECCV), 2024
Shizhou Zhang
Wenlong Luo
De Cheng
Qingchun Yang
Lingyan Ran
Yinghui Xing
Yanning Zhang
VOS
209
20
0
14 Aug 2024
Dynamic and Compressive Adaptation of Transformers From Images to Videos
Dynamic and Compressive Adaptation of Transformers From Images to Videos
Guozhen Zhang
Jingyu Liu
Shengming Cao
Xiaotong Zhao
Kevin Zhao
Kai Ma
Limin Wang
ViT
459
2
0
13 Aug 2024
HAT: History-Augmented Anchor Transformer for Online Temporal Action
  Localization
HAT: History-Augmented Anchor Transformer for Online Temporal Action LocalizationEuropean Conference on Computer Vision (ECCV), 2024
Sakib Reza
Yuexi Zhang
Mohsen Moghaddam
Mario Sznaier
230
5
0
12 Aug 2024
Weakly Supervised Video Anomaly Detection and Localization with
  Spatio-Temporal Prompts
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal PromptsACM Multimedia (MM), 2024
Peng Wu
Xuerong Zhou
Guansong Pang
Zhiwei Yang
Qingsen Yan
Peng Wang
Yanning Zhang
411
40
0
12 Aug 2024
FADE: A Dataset for Detecting Falling Objects around Buildings in Video
FADE: A Dataset for Detecting Falling Objects around Buildings in VideoIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Zhigang Tu
Zhengbo Zhang
Zitao Gao
Chunluan Zhou
J. Yuan
Bo Du
418
1
0
11 Aug 2024
A Methodological and Structural Review of Hand Gesture Recognition
  Across Diverse Data Modalities
A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data ModalitiesIEEE Access (IEEE Access), 2024
Jungpil Shin
Abu Saleh Musa Miah
Md. Humaun Kabir
M. Rahim
Abdullah Al Shiam
247
41
0
10 Aug 2024
MU-MAE: Multimodal Masked Autoencoders-Based One-Shot Learning
MU-MAE: Multimodal Masked Autoencoders-Based One-Shot LearningConference on Multimedia Information Processing and Retrieval (MIPR), 2024
Rex Liu
Xin Liu
267
1
0
08 Aug 2024
Online Temporal Action Localization with Memory-Augmented Transformer
Online Temporal Action Localization with Memory-Augmented TransformerEuropean Conference on Computer Vision (ECCV), 2024
Youngkil Song
Dongkeun Kim
Minsu Cho
Suha Kwak
241
3
0
06 Aug 2024
RICA2: Rubric-Informed, Calibrated Assessment of Actions
RICA2: Rubric-Informed, Calibrated Assessment of ActionsEuropean Conference on Computer Vision (ECCV), 2024
Abrar Majeedi
Viswanatha Reddy Gajjala
Satya Sai Srinath Namburi Gnvv
Yin Li
CML
440
12
0
04 Aug 2024
Text-Guided Video Masked Autoencoder
Text-Guided Video Masked AutoencoderEuropean Conference on Computer Vision (ECCV), 2024
D. Fan
Jue Wang
Shuai Liao
Zhikang Zhang
Vimal Bhat
Xinyu Li
VGen
164
7
0
01 Aug 2024
Hyper-parameter tuning for text guided image editing
Hyper-parameter tuning for text guided image editing
Shiwen Zhang
DiffM
226
3
0
31 Jul 2024
Start from Video-Music Retrieval: An Inter-Intra Modal Loss for Cross
  Modal Retrieval
Start from Video-Music Retrieval: An Inter-Intra Modal Loss for Cross Modal Retrieval
Zeyu Chen
Pengfei Zhang
Kai Ye
Wei Dong
Xin Feng
Yana Zhang
230
1
0
28 Jul 2024
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Habib Hajimolahoseini
Walid Ahmed
Austin Wen
Yang Liu
227
0
0
23 Jul 2024
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action RecognitionACM Multimedia (MM), 2024
Wenbo Huang
Jinghui Zhang
Xuwei Qian
Zhen Wu
Meng Wang
Lei Zhang
290
9
0
23 Jul 2024
A Comprehensive Review of Few-shot Action Recognition
A Comprehensive Review of Few-shot Action Recognition
Yuyang Wanyan
Xiaoshan Yang
Weiming Dong
Changsheng Xu
VLM
538
13
0
20 Jul 2024
Pose-guided multi-task video transformer for driver action recognition
Pose-guided multi-task video transformer for driver action recognition
Ricardo Pizarro
Roberto Valle
L. Bergasa
J. M. Buenaposada
Luis Baumela
ViT
195
1
0
18 Jul 2024
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in
  Streaming Videos
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Hyolim Kang
Jeongseok Hyun
Joungbin An
Youngjae Yu
Seon Joo Kim
169
1
0
17 Jul 2024
Human-Centric Transformer for Domain Adaptive Action Recognition
Human-Centric Transformer for Domain Adaptive Action Recognition
Kun-Yu Lin
Jiaming Zhou
Wei-Shi Zheng
225
10
0
15 Jul 2024
Hypergraph Multi-modal Large Language Model: Exploiting EEG and
  Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video
  Understanding
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding
Minghui Wu
Chenxu Zhao
Anyang Su
Donglin Di
Tianyu Fu
...
Min He
Ya Gao
Meng Ma
Kun Yan
Ping Wang
323
6
0
11 Jul 2024
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal
  Action Localization
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization
Feixiang Zhou
Bryan M. Williams
Hossein Rahmani
209
3
0
10 Jul 2024
C2C: Component-to-Composition Learning for Zero-Shot Compositional
  Action Recognition
C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition
Rongchang Li
Zhenhua Feng
Tianyang Xu
Linze Li
Xiao-Jun Wu
Muhammad Awais
Sara Atito
Josef Kittler
CoGe
409
11
0
08 Jul 2024
DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain
  Few-shot Action Recognition
DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
Fei-Yu Guo
YiKang Wang
Han Qi
Li Zhu
Jing Sun
371
6
0
08 Jul 2024
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Jianwen Jiang
Gaojie Lin
Zhengkun Rong
Chao Liang
Yongming Zhu
Jiaqi Yang
Tianyun Zhong
3DH
401
13
0
08 Jul 2024
Computer Vision for Clinical Gait Analysis: A Gait Abnormality Video
  Dataset
Computer Vision for Clinical Gait Analysis: A Gait Abnormality Video Dataset
Rahm Ranjan
David Ahmedt-Aristizabal
M. Armin
Juno Kim
245
13
0
05 Jul 2024
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Le Yang
Ziwei Zheng
Yizeng Han
Hao-Ran Cheng
Shiji Song
Gao Huang
Fan Li
299
21
0
03 Jul 2024
SVFormer: A Direct Training Spiking Transformer for Efficient Video
  Action Recognition
SVFormer: A Direct Training Spiking Transformer for Efficient Video Action Recognition
Liutao Yu
Liwei Huang
Chenlin Zhou
Han Zhang
Zhengyu Ma
Huihui Zhou
Yonghong Tian
ViT
238
8
0
21 Jun 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled
  Motion-Appearance Distillation
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Yuanhao Zhai
Kevin Lin
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Chung-Ching Lin
David Doermann
Junsong Yuan
Lijuan Wang
VGenDiffM
247
26
0
11 Jun 2024
SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible
  Neural Networks
SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks
Yi Pan
Jun-Jie Huang
Zihan Chen
Wentao Zhao
Ziyue Wang
196
5
0
04 Jun 2024
Object Aware Egocentric Online Action Detection
Object Aware Egocentric Online Action Detection
Joungbin An
Yunsu Park
Hyolim Kang
Seon Joo Kim
EgoV
210
1
0
03 Jun 2024
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a
  Hybrid Model
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
Khaled Alomar
Halil Ibrahim Aysel
Xiaohao Cai
MedImViT
293
26
0
02 Jun 2024
Flow-Assisted Motion Learning Network for Weakly-Supervised Group
  Activity Recognition
Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition
Muhammad Adi Nugroho
Sangmin Woo
Sumin Lee
Jinyoung Park
Yooseung Wang
Donguk Kim
Changick Kim
174
3
0
28 May 2024
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities
Hao Dong
Yue Zhao
Eleni Chatzi
Olga Fink
OODD
218
25
0
27 May 2024
Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to
  Biological Motion Perception
Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception
Shuangpeng Han
Ziyu Wang
Mengmi Zhang
275
1
0
26 May 2024
From CNNs to Transformers in Multimodal Human Action Recognition: A
  Survey
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
Muhammad Bilal Shaikh
Syed Mohammed Shamsul Islam
Douglas Chai
Naveed Akhtar
347
30
0
22 May 2024
OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality
OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better PracticalityIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Chao Fan
Saihui Hou
Junhao Liang
Chuanfu Shen
Jingzhe Ma
Dongyang Jin
Yongzhen Huang
Shiqi Yu
316
18
0
15 May 2024
No Time to Waste: Squeeze Time into Channel for Mobile Video
  Understanding
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding
Yingjie Zhai
Wenshuo Li
Yehui Tang
Xinghao Chen
Yunhe Wang
ViT
225
2
0
14 May 2024
Deep video representation learning: a survey
Deep video representation learning: a survey
Elham Ravanbakhsh
Yongqing Liang
J. Ramanujam
Xin Li
217
5
0
10 May 2024
A Survey on Backbones for Deep Video Action Recognition
A Survey on Backbones for Deep Video Action Recognition
Zixuan Tang
Youjun Zhao
Yuhang Wen
Mengyuan Liu
176
3
0
09 May 2024
Bidirectional Progressive Transformer for Interaction Intention
  Anticipation
Bidirectional Progressive Transformer for Interaction Intention AnticipationEuropean Conference on Computer Vision (ECCV), 2024
Zichen Zhang
Hongcheng Luo
Wei Zhai
Yang Cao
Yu Kang
325
8
0
09 May 2024
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action RecognitionIEEE transactions on multimedia (IEEE TMM), 2024
Hongyu Qu
Rui Yan
Xiangbo Shu
Haoliang Gao
Peng Huang
Guo-Sen Xie
447
16
0
03 May 2024
Uncertainty-boosted Robust Video Activity Anticipation
Uncertainty-boosted Robust Video Activity Anticipation
Zhaobo Qi
Shuhui Wang
Weigang Zhang
Qingming Huang
292
10
0
29 Apr 2024
Movie101v2: Improved Movie Narration Benchmark
Movie101v2: Improved Movie Narration Benchmark
Zihao Yue
Yepeng Zhang
Ziheng Wang
Qin Jin
VGen
297
3
0
20 Apr 2024
STAT: Towards Generalizable Temporal Action Localization
STAT: Towards Generalizable Temporal Action Localization
Yangcen Liu
Ziyi Liu
Yuanhao Zhai
Wen Li
David Doerman
Junsong Yuan
237
3
0
20 Apr 2024
On the Content Bias in Fréchet Video Distance
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
256
32
0
18 Apr 2024
Previous
123456...272829
Next