ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.00859
  4. Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action
  Recognition

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
    ViT
ArXiv (abs)PDFHTML

Papers citing "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"

50 / 1,449 papers shown
Are current long-term video understanding datasets long-term?
Are current long-term video understanding datasets long-term?
Ombretta Strafforello
Klamer Schutte
Jan van Gemert
207
10
0
22 Aug 2023
Temporal-Distributed Backdoor Attack Against Video Based Action
  Recognition
Temporal-Distributed Backdoor Attack Against Video Based Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2023
Xi Li
Songhe Wang
Rui Huang
Mahanth K. Gowda
G. Kesidis
AAML
391
7
0
21 Aug 2023
ResQ: Residual Quantization for Video Perception
ResQ: Residual Quantization for Video PerceptionIEEE International Conference on Computer Vision (ICCV), 2023
Davide Abati
H. Yahia
Markus Nagel
A. Habibian
MQ
221
3
0
18 Aug 2023
Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching
Boosting Few-shot Action Recognition with Graph-guided Hybrid MatchingIEEE International Conference on Computer Vision (ICCV), 2023
Jiazheng Xing
Mengmeng Wang
Yudi Ruan
Bofan Chen
Yaowei Guo
B. Mu
Guangwen Dai
Jingdong Wang
Yong-Jin Liu
207
34
0
18 Aug 2023
Unlimited Knowledge Distillation for Action Recognition in the Dark
Unlimited Knowledge Distillation for Action Recognition in the Dark
Ruibing Jin
Guosheng Lin
Ruibing Jin
Jie Lin
Zhengguo Li
Xiaoli Li
Zhenghua Chen
155
2
0
18 Aug 2023
Progression-Guided Temporal Action Detection in Videos
Progression-Guided Temporal Action Detection in Videos
Chongkai Lu
Man-Wai Mak
Ruimin Li
Z. Chi
Hong Fu
AI4TS
167
0
0
18 Aug 2023
Memory-and-Anticipation Transformer for Online Action Understanding
Memory-and-Anticipation Transformer for Online Action UnderstandingIEEE International Conference on Computer Vision (ICCV), 2023
Jiahao Wang
Guo Chen
Yifei Huang
Liming Wang
Tong Lu
OffRL
303
60
0
15 Aug 2023
ViGT: Proposal-free Video Grounding with Learnable Token in Transformer
ViGT: Proposal-free Video Grounding with Learnable Token in TransformerScience China Information Sciences (Sci China Inf Sci), 2023
Kun Li
Dan Guo
Meng Wang
ViT
156
62
0
11 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
205
17
0
10 Aug 2023
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset
  Student-Teacher Scenario for Video Action Recognition
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition
L. Bicsi
B. Alexe
Radu Tudor Ionescu
Marius Leordeanu
257
2
0
09 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
View while Moving: Efficient Video Recognition in Long-untrimmed VideosACM Multimedia (ACM MM), 2023
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
261
10
0
09 Aug 2023
Long-Distance Gesture Recognition using Dynamic Neural Networks
Long-Distance Gesture Recognition using Dynamic Neural NetworksIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Shubhang Bhatnagar
S. Gopal
Narendra Ahuja
Liu Ren
191
6
0
09 Aug 2023
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based RecognitionIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2023
Tianlin Li
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
369
23
0
08 Aug 2023
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings
  for Video Action Recognition
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action RecognitionIndian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2023
S. Chaudhuri
Saumik Bhattacharya
175
6
0
07 Aug 2023
M$^3$Net: Multi-view Encoding, Matching, and Fusion for Few-shot
  Fine-grained Action Recognition
M3^33Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action RecognitionACM Multimedia (ACM MM), 2023
Hao Tang
Jun Liu
Shuanglin Yan
Rui Yan
Zechao Li
Jinhui Tang
278
74
0
06 Aug 2023
Multimodal Adaptation of CLIP for Few-Shot Action Recognition
Multimodal Adaptation of CLIP for Few-Shot Action RecognitionPattern Recognition (Pattern Recogn.), 2023
Jiazheng Xing
Mengmeng Wang
Xiaojun Hou
Guangwen Dai
Jingdong Wang
Yong-Jin Liu
VLM
181
1
0
03 Aug 2023
SkateboardAI: The Coolest Video Action Recognition for Skateboarding
SkateboardAI: The Coolest Video Action Recognition for SkateboardingAAAI Conference on Artificial Intelligence (AAAI), 2023
Hanxiao Chen
ViT
118
4
0
02 Aug 2023
MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using
  Transformers
MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using TransformersEuropean Workshop on Visual Information Processing (EUVIP), 2023
Muhammad Bilal Shaikh
Douglas Chai
Syed Mohammed Shamsul Islam
Naveed Akhtar
293
7
0
01 Aug 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature
  Restoration
Sample Less, Learn More: Efficient Action Recognition via Frame Feature RestorationACM Multimedia (ACM MM), 2023
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
220
9
0
27 Jul 2023
Unlocking the Emotional World of Visual Media: An Overview of the
  Science, Research, and Impact of Understanding Emotion
Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding EmotionProceedings of the IEEE (Proc. IEEE), 2023
James Z. Wang
Sicheng Zhao
Chenyan Wu
Reginald B. Adams
M. Newman
T. Shafir
Rachelle Tsachor
338
54
0
25 Jul 2023
Spatiotemporal Modeling Encounters 3D Medical Image Analysis:
  Slice-Shift UNet with Multi-View Fusion
Spatiotemporal Modeling Encounters 3D Medical Image Analysis: Slice-Shift UNet with Multi-View FusionInternational Conference on Machine Vision and Applications (ICMVA), 2023
C. Ugwu
S. Casarin
Oswald Lanz
179
0
0
24 Jul 2023
In Defense of Clip-based Video Relation Detection
In Defense of Clip-based Video Relation DetectionIEEE Transactions on Image Processing (IEEE TIP), 2023
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
179
7
0
18 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
212
17
0
18 Jul 2023
Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Video-Mined Task Graphs for Keystep Recognition in Instructional VideosNeural Information Processing Systems (NeurIPS), 2023
Kumar Ashutosh
Santhosh Kumar Ramakrishnan
Triantafyllos Afouras
Kristen Grauman
299
37
0
17 Jul 2023
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence
  Pre-training
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-trainingIEEE International Conference on Computer Vision (ICCV), 2023
Hongfei Yan
Zehua Wang
Yushen Wei
Zerui Li
Guanbin Li
Guanbin Li
279
64
0
17 Jul 2023
Multimodal Distillation for Egocentric Action Recognition
Multimodal Distillation for Egocentric Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2023
Gorjan Radevski
Dusan Grujicic
Marie-Francine Moens
Matthew Blaschko
Tinne Tuytelaars
EgoV
331
35
0
14 Jul 2023
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel
  Segmentation
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel SegmentationNeural Information Processing Systems (NeurIPS), 2023
MD Wahiduzzaman Khan
Hong Sheng
Hu Zhang
Heming Du
Sen Wang
...
Jack Phu
A. Agar
Zichen Huang
M. Golzan
Xin Yu
160
10
0
13 Jul 2023
VS-TransGRU: A Novel Transformer-GRU-based Framework Enhanced by
  Visual-Semantic Fusion for Egocentric Action Anticipation
VS-TransGRU: A Novel Transformer-GRU-based Framework Enhanced by Visual-Semantic Fusion for Egocentric Action Anticipation
Congqi Cao
Ze Sun
Qinyi Lv
Lingtong Min
Yanning Zhang
ViT
159
8
0
08 Jul 2023
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic
  Facial Expression Recognition
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression RecognitionACM Multimedia (ACM MM), 2023
Guoying Zhao
Zheng Lian
B. Liu
Jianhua Tao
238
73
0
05 Jul 2023
Task-Specific Alignment and Multiple Level Transformer for Few-Shot
  Action Recognition
Task-Specific Alignment and Multiple Level Transformer for Few-Shot Action RecognitionNeurocomputing (Neurocomputing), 2023
Fei-Yu Guo
Li Zhu
Yiwang Wang
Jing Sun
ViT
232
10
0
05 Jul 2023
Streaming egocentric action anticipation: An evaluation scheme and
  approach
Streaming egocentric action anticipation: An evaluation scheme and approachComputer Vision and Image Understanding (CVIU), 2023
Antonino Furnari
G. Farinella
EgoV
177
5
0
29 Jun 2023
Bullying10K: A Large-Scale Neuromorphic Dataset towards
  Privacy-Preserving Bullying Recognition
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying RecognitionNeural Information Processing Systems (NeurIPS), 2023
Yiting Dong
Yang Li
Dongcheng Zhao
Guobin Shen
Yi Zeng
185
20
0
20 Jun 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
E2E-LOAD: End-to-End Long-form Online Action DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
216
12
0
13 Jun 2023
Enhanced Multimodal Representation Learning with Cross-modal KD
Enhanced Multimodal Representation Learning with Cross-modal KDComputer Vision and Pattern Recognition (CVPR), 2023
Mengxi Chen
Linyu Xing
Yu Wang
Ya Zhang
149
18
0
13 Jun 2023
Action Recognition with Multi-stream Motion Modeling and Mutual
  Information Maximization
Action Recognition with Multi-stream Motion Modeling and Mutual Information MaximizationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Yu-Huan Yang
Haipeng Chen
Zhenguang Liu
Y. Lyu
Beibei Zhang
Shuang Wu
Peng Kuang
Kui Ren
246
10
0
13 Jun 2023
Boosting Breast Ultrasound Video Classification by the Guidance of
  Keyframe Feature Centers
Boosting Breast Ultrasound Video Classification by the Guidance of Keyframe Feature CentersInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
AnLan Sun
Zhao Zhang
Meng Lei
Yuting Dai
Dong Wang
Liwei Wang
153
12
0
12 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action
  Recognition
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
182
4
0
07 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video
  Using Multiple Instances Learning
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
139
13
0
06 Jun 2023
Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification
Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification
Jintao Rong
Hao Chen
Tianrun Chen
Linlin Ou
Xinyi Yu
Yifan Liu
VLMVPVLM
194
8
0
04 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
VideoComposer: Compositional Video Synthesis with Motion ControllabilityNeural Information Processing Systems (NeurIPS), 2023
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGenDiffM
478
459
0
03 Jun 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning
  Challenges
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
188
9
0
31 May 2023
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal
  Action Localization
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action LocalizationComputer Vision and Pattern Recognition (CVPR), 2023
Huantao Ren
Wenfei Yang
Tianzhu Zhang
Yongdong Zhang
360
42
0
29 May 2023
Action Sensitivity Learning for Temporal Action Localization
Action Sensitivity Learning for Temporal Action LocalizationIEEE International Conference on Computer Vision (ICCV), 2023
Jiayi Shao
Xiaohan Wang
Ruijie Quan
Junjun Zheng
Jiang Yang
Yezhou Yang
331
41
0
25 May 2023
Cross-view Action Recognition Understanding From Exocentric to
  Egocentric Perspective
Cross-view Action Recognition Understanding From Exocentric to Egocentric PerspectiveNeurocomputing (Neurocomputing), 2023
Thanh-Dat Truong
Khoa Luu
EgoV
389
15
0
25 May 2023
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at
  Scale
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Ziyun Zeng
Yixiao Ge
Zhan Tong
Xihui Liu
Shutao Xia
Ying Shan
275
13
0
23 May 2023
VideoLLM: Modeling Video Sequence with Large Language Models
VideoLLM: Modeling Video Sequence with Large Language Models
Guo Chen
Yin-Dong Zheng
Jiahao Wang
Jilan Xu
Yifei Huang
...
Yi Wang
Yali Wang
Yu Qiao
Tong Lu
Limin Wang
MLLM
261
112
0
22 May 2023
Learning Higher-order Object Interactions for Keypoint-based Video
  Understanding
Learning Higher-order Object Interactions for Keypoint-based Video Understanding
Yi Huang
Asim Kadav
Farley Lai
Deep Patel
H. Graf
125
1
0
16 May 2023
Exploring Few-Shot Adaptation for Activity Recognition on Diverse
  Domains
Exploring Few-Shot Adaptation for Activity Recognition on Diverse Domains
Kunyu Peng
Di Wen
David Schneider
Kailai Li
Kailun Yang
M. Sarfraz
Rainer Stiefelhagen
Alina Roitberg
335
3
0
15 May 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and
  External Cameras via Spatial-Temporal Transformers
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Lingxi Li
Kyungtae Han
Rohit Gupta
Ziran Wang
171
18
0
13 May 2023
Few-shot Action Recognition via Intra- and Inter-Video Information
  Maximization
Few-shot Action Recognition via Intra- and Inter-Video Information Maximization
Huabin Liu
W. Lin
Yun Xu
Yuxi Li
Shuyuan Li
John See
225
9
0
10 May 2023
Previous
123...567...272829
Next