ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.00859
  4. Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action
  Recognition

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
    ViT
ArXiv (abs)PDFHTML

Papers citing "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"

50 / 1,449 papers shown
Spotting Temporally Precise, Fine-Grained Events in Video
Spotting Temporally Precise, Fine-Grained Events in VideoEuropean Conference on Computer Vision (ECCV), 2022
James Hong
Haotian Zhang
Michael Gharbi
Matthew Fisher
Kayvon Fatahalian
327
50
0
20 Jul 2022
ViGAT: Bottom-up event recognition and explanation in video using
  factorized graph attention network
ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention networkIEEE Access (IEEE Access), 2022
Nikolaos Gkalelis
Dimitrios Daskalakis
Vasileios Mezaris
204
12
0
20 Jul 2022
Co-Located Human-Human Interaction Analysis using Nonverbal Cues: A
  Survey
Co-Located Human-Human Interaction Analysis using Nonverbal Cues: A SurveyACM Computing Surveys (ACM CSUR), 2022
Cigdem Beyan
Alessandro Vinciarelli
Alessio Del Bue
212
13
0
20 Jul 2022
Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action
  Recognition
Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action RecognitionACM Multimedia (ACM MM), 2022
Huabin Liu
Weixian Lv
John See
W. Lin
TTA
291
15
0
20 Jul 2022
Action Quality Assessment with Temporal Parsing Transformer
Action Quality Assessment with Temporal Parsing TransformerEuropean Conference on Computer Vision (ECCV), 2022
Yang Bai
Desen Zhou
Songyang Zhang
Jian Wang
Errui Ding
Yu Guan
Yang Long
Jingdong Wang
ViT
167
66
0
19 Jul 2022
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based
  Action Recognition
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Yansong Tang
Xingyu Liu
Xumin Yu
Danyang Zhang
Jiwen Lu
Jie Zhou
258
23
0
17 Jul 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision
  and Language Models
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming-Hsuan Yang
Serge Belongie
Huayu Chen
VLM
182
26
0
15 Jul 2022
Semi-Supervised Temporal Action Detection with Proposal-Free Masking
Semi-Supervised Temporal Action Detection with Proposal-Free MaskingEuropean Conference on Computer Vision (ECCV), 2022
Sauradip Nag
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
148
21
0
14 Jul 2022
Forcing the Whole Video as Background: An Adversarial Learning Strategy
  for Weakly Temporal Action Localization
Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action LocalizationACM Multimedia (ACM MM), 2022
Ziqiang Li
Yongxin Ge
Jiaruo Yu
Zhongming Chen
203
19
0
14 Jul 2022
Proposal-Free Temporal Action Detection via Global Segmentation Mask
  Learning
Proposal-Free Temporal Action Detection via Global Segmentation Mask LearningEuropean Conference on Computer Vision (ECCV), 2022
Sauradip Nag
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
ViT
198
56
0
14 Jul 2022
Compound Prototype Matching for Few-shot Action Recognition
Compound Prototype Matching for Few-shot Action RecognitionEuropean Conference on Computer Vision (ECCV), 2022
Yifei Huang
Lijin Yang
Yoichi Sato
364
59
0
12 Jul 2022
Robotic Detection of a Human-Comprehensible Gestural Language for
  Underwater Multi-Human-Robot Collaboration
Robotic Detection of a Human-Comprehensible Gestural Language for Underwater Multi-Human-Robot CollaborationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Sadman Sakib Enan
Michael Fulton
Junaed Sattar
116
9
0
12 Jul 2022
Efficient Human Vision Inspired Action Recognition using Adaptive
  Spatiotemporal Sampling
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal SamplingIEEE Transactions on Image Processing (IEEE TIP), 2022
Khoi-Nguyen C. Mac
Minh Do
Minh Vo
TTA
282
3
0
12 Jul 2022
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval
Jinbin Bai
Chunhui Liu
Feiyue Ni
Haofan Wang
Mengying Hu
Xiaofeng Guo
Lele Cheng
182
14
0
11 Jul 2022
1st Place Solution to the EPIC-Kitchens Action Anticipation Challenge
  2022
1st Place Solution to the EPIC-Kitchens Action Anticipation Challenge 2022
Zeyu Jiang
Changxing Ding
EgoV
149
1
0
10 Jul 2022
Beyond Transfer Learning: Co-finetuning for Action Localisation
Beyond Transfer Learning: Co-finetuning for Action Localisation
Anurag Arnab
Xuehan Xiong
A. Gritsenko
Rob Romijnders
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
266
10
0
08 Jul 2022
VidConv: A modernized 2D ConvNet for Efficient Video Recognition
VidConv: A modernized 2D ConvNet for Efficient Video Recognition
Chuong H. Nguyen
Su Huynh
Vinh Nguyen
Ngoc-Khanh Nguyen
ViT
181
3
0
08 Jul 2022
Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework
Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework
Xiaojiang Peng
Xiaomao Fan
Q. Wu
Jieyan Zhao
Pan Gao
51
3
0
08 Jul 2022
MVP: Robust Multi-View Practice for Driving Action Localization
MVP: Robust Multi-View Practice for Driving Action LocalizationInternational Conference on Information Systems and Computer Aided Education (ICISCAE), 2022
Jingjie Shang
Kunchang Li
Kaibin Tian
Haisheng Su
Yangguang Li
180
3
0
05 Jul 2022
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of
  3D Human Motions and Texts
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and TextsEuropean Conference on Computer Vision (ECCV), 2022
Chuan Guo
Xinxin Xuo
Sen Wang
Li Cheng
VGen
464
344
0
04 Jul 2022
Large-scale Robustness Analysis of Video Action Recognition Models
Large-scale Robustness Analysis of Video Action Recognition ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Madeline Chantry Schiappa
Naman Biyani
Prudvi Kamtam
Shruti Vyas
Hamid Palangi
Vibhav Vineet
Yogesh S Rawat
AAML
280
36
0
04 Jul 2022
Continuous Sign Language Recognition via Temporal Super-Resolution
  Network
Continuous Sign Language Recognition via Temporal Super-Resolution NetworkThe Arabian journal for science and engineering (AJSE), 2022
Qidan Zhu
Jing Li
Fei Yuan
Quan Gan
SLR
137
12
0
03 Jul 2022
Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Skeleton-based Action Recognition via Adaptive Cross-Form LearningACM Multimedia (ACM MM), 2022
Xuanhan Wang
Yan Dai
Lianli Gao
Jingkuan Song
209
32
0
30 Jun 2022
Multi-Scale Spatial Temporal Graph Convolutional Network for
  Skeleton-Based Action Recognition
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2021
Zhan Chen
Sicheng Li
Bing Yang
Qinghan Li
Hong Liu
165
318
0
27 Jun 2022
Explore Spatio-temporal Aggregation for Insubstantial Object Detection:
  Benchmark Dataset and Baseline
Explore Spatio-temporal Aggregation for Insubstantial Object Detection: Benchmark Dataset and BaselineComputer Vision and Pattern Recognition (CVPR), 2022
Kailai Zhou
Yibo Wang
Tao Lv
Yunqian Li
Linsen Chen
Qiu Shen
Xun Cao
211
18
0
23 Jun 2022
Bi-Calibration Networks for Weakly-Supervised Video Representation
  Learning
Bi-Calibration Networks for Weakly-Supervised Video Representation LearningInternational Journal of Computer Vision (IJCV), 2022
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
255
9
0
21 Jun 2022
Pyramid Region-based Slot Attention Network for Temporal Action Proposal
  Generation
Pyramid Region-based Slot Attention Network for Temporal Action Proposal GenerationBritish Machine Vision Conference (BMVC), 2022
Shuaicheng Li
Feng Zhang
Ruiwei Zhao
Rui Feng
Kunlin Yang
Lin-Na Liu
Jun Hou
ViT
194
6
0
21 Jun 2022
Self-Supervised Learning for Videos: A Survey
Self-Supervised Learning for Videos: A SurveyACM Computing Surveys (ACM CSUR), 2022
Madeline Chantry Schiappa
Yogesh S Rawat
M. Shah
SSL
480
168
0
18 Jun 2022
Scalable Temporal Localization of Sensitive Activities in Movies and TV
  Episodes
Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Xiang Hao
Jingxiang Chen
Shixing Chen
Ahmed Saad
Raffay Hamid
AI4TS
206
0
0
16 Jun 2022
Human Eyes Inspired Recurrent Neural Networks are More Robust Against
  Adversarial Noises
Human Eyes Inspired Recurrent Neural Networks are More Robust Against Adversarial NoisesNeural Computation (Neural Comput.), 2022
Minkyu Choi
Yizhen Zhang
Kuan Han
Xiaokai Wang
Zhongming Liu
AAMLGAN
144
6
0
15 Jun 2022
It's Time for Artistic Correspondence in Music and Video
It's Time for Artistic Correspondence in Music and VideoComputer Vision and Pattern Recognition (CVPR), 2022
Dídac Surís
Carl Vondrick
Bryan C. Russell
Justin Salamon
160
42
0
14 Jun 2022
Stand-Alone Inter-Frame Attention in Video Models
Stand-Alone Inter-Frame Attention in Video ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Jiebo Luo
Tao Mei
ViT
189
59
0
14 Jun 2022
Lost in Transmission: On the Impact of Networking Corruptions on Video
  Machine Learning Models
Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models
Trenton Chang
Daniel Y. Fu
115
2
0
10 Jun 2022
GateHUB: Gated History Unit with Background Suppression for Online
  Action Detection
GateHUB: Gated History Unit with Background Suppression for Online Action DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Junwen Chen
Gaurav Mittal
Ye Yu
Yu Kong
Mei Chen
237
53
0
09 Jun 2022
PrivHAR: Recognizing Human Actions From Privacy-preserving Lens
PrivHAR: Recognizing Human Actions From Privacy-preserving LensEuropean Conference on Computer Vision (ECCV), 2022
Carlos Hinojosa
M. Márquez
Henry Arguello
Ehsan Adeli
L. Fei-Fei
Juan Carlos Niebles
PICV
255
26
0
08 Jun 2022
Revealing Single Frame Bias for Video-and-Language Learning
Revealing Single Frame Bias for Video-and-Language LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
239
142
0
07 Jun 2022
Revisiting the "Video" in Video-Language Understanding
Revisiting the "Video" in Video-Language UnderstandingComputer Vision and Pattern Recognition (CVPR), 2022
S. Buch
Cristobal Eyzaguirre
Adrien Gaidon
Jiajun Wu
L. Fei-Fei
Juan Carlos Niebles
216
202
0
03 Jun 2022
Future Transformer for Long-term Action Anticipation
Future Transformer for Long-term Action AnticipationComputer Vision and Pattern Recognition (CVPR), 2022
Dayoung Gong
Joonseok Lee
Manjin Kim
S. Ha
Minsu Cho
AI4TS
128
83
0
27 May 2022
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
PSTNet: Point Spatio-Temporal Convolution on Point Cloud SequencesInternational Conference on Learning Representations (ICLR), 2022
Hehe Fan
Xin Yu
Yuhang Ding
Yi Yang
Mohan Kankanhalli
3DPC
333
133
0
27 May 2022
Learning What and Where: Disentangling Location and Identity Tracking
  Without Supervision
Learning What and Where: Disentangling Location and Identity Tracking Without SupervisionInternational Conference on Learning Representations (ICLR), 2022
Manuel Traub
S. Otte
Tobias Menge
Matthias Karlbauer
Jannik Thummel
Martin Volker Butz
404
22
0
26 May 2022
Learning Muti-expert Distribution Calibration for Long-tailed Video
  Classification
Learning Muti-expert Distribution Calibration for Long-tailed Video ClassificationIEEE transactions on multimedia (IEEE TMM), 2022
Yufan Hu
Junyu Gao
Changsheng Xu
119
9
0
22 May 2022
Structured Attention Composition for Temporal Action Localization
Structured Attention Composition for Temporal Action LocalizationIEEE Transactions on Image Processing (IEEE TIP), 2022
Le Yang
Junwei Han
Tao Zhao
Nian Liu
Dingwen Zhang
202
18
0
20 May 2022
A CLIP-Hitchhiker's Guide to Long Video Retrieval
A CLIP-Hitchhiker's Guide to Long Video Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
CLIP
422
73
0
17 May 2022
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion EnhancementInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Bing Li
Jiaxin Chen
Dongming Zhang
Xiuguo Bao
Di Huang
160
19
0
07 May 2022
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action DetectionComputer Vision and Image Understanding (CVIU), 2022
Mingdong Yang
Guo Chen
Yin-Dong Zheng
Tong Lu
Limin Wang
287
54
0
05 May 2022
Unsupervised Domain Adaptation Learning for Hierarchical Infant Pose
  Recognition with Synthetic Data
Unsupervised Domain Adaptation Learning for Hierarchical Infant Pose Recognition with Synthetic DataIEEE International Conference on Multimedia and Expo (ICME), 2022
Cheng-Yen Yang
Zhongyu Jiang
Shi Gu
Lei Li
Jang-Hee Yoo
3DH
112
6
0
04 May 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
In Defense of Image Pre-Training for Spatiotemporal RecognitionEuropean Conference on Computer Vision (ECCV), 2022
Xianhang Li
Huiyu Wang
Chen Wei
Jieru Mei
Alan Yuille
Yuyin Zhou
Cihang Xie
166
1
0
03 May 2022
Cross-modal Representation Learning for Zero-shot Action Recognition
Cross-modal Representation Learning for Zero-shot Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Chung-Ching Lin
Kevin Qinghong Lin
Linjie Li
Lijuan Wang
Zicheng Liu
ViT
152
29
0
03 May 2022
CenterCLIP: Token Clustering for Efficient Text-Video Retrieval
CenterCLIP: Token Clustering for Efficient Text-Video RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022
Shuai Zhao
Linchao Zhu
Xiaohan Wang
Yi Yang
VLMCLIP
203
152
0
02 May 2022
Tragedy Plus Time: Capturing Unintended Human Activities from
  Weakly-labeled Videos
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Arnav Chakravarthy
Zhiyuan Fang
Yezhou Yang
154
2
0
28 Apr 2022
Previous
123...91011...272829
Next
Page 10 of 29
Pageof 29