ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.00859
  4. Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action
  Recognition

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
    ViT
ArXiv (abs)PDFHTML

Papers citing "Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"

50 / 1,449 papers shown
Knowing What, Where and When to Look: Efficient Video Action Modeling
  with Attention
Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Juan-Manuel Perez-Rua
Brais Martínez
Xiatian Zhu
Antoine Toisoul
Victor Escorcia
Tao Xiang
228
21
0
02 Apr 2020
Weakly-Supervised Action Localization with Expectation-Maximization
  Multi-Instance Learning
Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance LearningEuropean Conference on Computer Vision (ECCV), 2020
Zhekun Luo
Devin Guillory
Baifeng Shi
Wei Ke
Fang Wan
Trevor Darrell
Huijuan Xu
485
131
0
31 Mar 2020
Explaining Motion Relevance for Activity Recognition in Video Deep
  Learning Models
Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models
Liam Hiley
Alun D. Preece
Y. Hicks
Supriyo Chakraborty
Prudhvi K. Gurram
Richard J. Tomsett
FAtt
196
16
0
31 Mar 2020
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
Spatio-Temporal Graph for Video Captioning with Knowledge DistillationComputer Vision and Pattern Recognition (CVPR), 2020
Boxiao Pan
Haoye Cai
De-An Huang
Kuan-Hui Lee
Adrien Gaidon
Ehsan Adeli
Juan Carlos Niebles
240
264
0
31 Mar 2020
Speech2Action: Cross-modal Supervision for Action Recognition
Speech2Action: Cross-modal Supervision for Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2020
Arsha Nagrani
Chen Sun
David A. Ross
Rahul Sukthankar
Cordelia Schmid
Andrew Zisserman
166
59
0
30 Mar 2020
Learning Interactions and Relationships between Movie Characters
Learning Interactions and Relationships between Movie CharactersComputer Vision and Pattern Recognition (CVPR), 2020
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
227
57
0
29 Mar 2020
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with
  a Wise Selection
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise SelectionComputer Vision and Pattern Recognition (CVPR), 2020
Jie Chen
Zhiheng Li
Jiebo Luo
Chenliang Xu
166
15
0
29 Mar 2020
Omni-sourced Webly-supervised Learning for Video Recognition
Omni-sourced Webly-supervised Learning for Video RecognitionEuropean Conference on Computer Vision (ECCV), 2020
Haodong Duan
Yue Zhao
Yuanjun Xiong
Wentao Liu
Dahua Lin
VLM
204
101
0
29 Mar 2020
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D NetworksAAAI Conference on Artificial Intelligence (AAAI), 2020
Qihang Yu
Yingwei Li
Jieru Mei
Yuyin Zhou
Alan Yuille
3DPC
215
3
0
28 Mar 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity RecognitionComputer Vision and Pattern Recognition (CVPR), 2020
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
174
210
0
28 Mar 2020
Weakly-Supervised Action Localization by Generative Attention Modeling
Weakly-Supervised Action Localization by Generative Attention ModelingComputer Vision and Pattern Recognition (CVPR), 2020
Baifeng Shi
Jingdong Sun
Yadong Mu
Jingdong Wang
WSOL
221
166
0
27 Mar 2020
Video Object Grounding using Semantic Roles in Language Description
Video Object Grounding using Semantic Roles in Language DescriptionComputer Vision and Pattern Recognition (CVPR), 2020
Arka Sadhu
Kan Chen
Ram Nevatia
231
50
0
24 Mar 2020
Comprehensive Instructional Video Analysis: The COIN Dataset and
  Performance Evaluation
Comprehensive Instructional Video Analysis: The COIN Dataset and Performance EvaluationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Yansong Tang
Jiwen Lu
Jie Zhou
186
42
0
20 Mar 2020
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
Xu Li
Jingwen Wang
Lin Ma
Kaihao Zhang
Fengzong Lian
Zhanhui Kang
Jinjun Wang
140
5
0
18 Mar 2020
SF-Net: Single-Frame Supervision for Temporal Action Localization
SF-Net: Single-Frame Supervision for Temporal Action LocalizationEuropean Conference on Computer Vision (ECCV), 2020
Fan Ma
Linchao Zhu
Yi Yang
Shengxin Cindy Zha
Gourab Kundu
Matt Feiszli
Zheng Shou
377
160
0
15 Mar 2020
Energy-based Periodicity Mining with Deep Features for Action Repetition
  Counting in Unconstrained Videos
Energy-based Periodicity Mining with Deep Features for Action Repetition Counting in Unconstrained Videos
Jianqin Yin
Yanchun Wu
Huaping Liu
Yonghao Dang
Zhiyi Liu
Jun Liu
105
13
0
15 Mar 2020
Is There Tradeoff between Spatial and Temporal in Video
  Super-Resolution?
Is There Tradeoff between Spatial and Temporal in Video Super-Resolution?
Haochen Zhang
Dong Liu
Zhiwei Xiong
SupR
117
2
0
13 Mar 2020
Top-1 Solution of Multi-Moments in Time Challenge 2019
Top-1 Solution of Multi-Moments in Time Challenge 2019
Manyuan Zhang
Hao Shao
Guanglu Song
Yu Liu
Junjie Yan
125
3
0
12 Mar 2020
MoVi: A Large Multipurpose Motion and Video Dataset
MoVi: A Large Multipurpose Motion and Video DatasetPLoS ONE (PLOS ONE), 2020
Saeed Ghorbani
Kimia Mahdaviani
A. Thaler
Konrad Paul Kording
D. Cook
Gunnar Blohm
N. Troje
225
92
0
04 Mar 2020
Rethinking Zero-shot Video Classification: End-to-end Training for
  Realistic Applications
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic ApplicationsComputer Vision and Pattern Recognition (CVPR), 2020
Biagio Brattoli
Joseph Tighe
Fedor Zhdanov
Pietro Perona
Krzysztof Chalupka
VLM
552
136
0
03 Mar 2020
DriverMHG: A Multi-Modal Dataset for Dynamic Recognition of Driver Micro
  Hand Gestures and a Real-Time Recognition Framework
DriverMHG: A Multi-Modal Dataset for Dynamic Recognition of Driver Micro Hand Gestures and a Real-Time Recognition FrameworkIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2020
Okan Kopuklu
Thomas Ledwon
Yao Rong
Neslihan Köse
Gerhard Rigoll
208
26
0
02 Mar 2020
VideoSSL: Semi-Supervised Learning for Video Classification
VideoSSL: Semi-Supervised Learning for Video ClassificationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Longlong Jing
T. Parag
Zhe Wu
Yingli Tian
Hongcheng Wang
142
60
0
29 Feb 2020
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Peng Xu
Kun Liu
Tao Xiang
Timothy M. Hospedales
Zhanyu Ma
Jun Guo
Yi-Zhe Song
250
37
0
21 Feb 2020
Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction
Spatiotemporal Relationship Reasoning for Pedestrian Intent PredictionIEEE Robotics and Automation Letters (RA-L), 2020
Bingbin Liu
Ehsan Adeli
Zhangjie Cao
Kuan-Hui Lee
Abhijeet Shenoi
Adrien Gaidon
Juan Carlos Niebles
289
181
0
20 Feb 2020
Human Action Recognition using Local Two-Stream Convolution Neural
  Network Features and Support Vector Machines
Human Action Recognition using Local Two-Stream Convolution Neural Network Features and Support Vector Machines
David Torpey
Turgay Celik
54
8
0
19 Feb 2020
Knowledge Integration Networks for Action Recognition
Knowledge Integration Networks for Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2020
Shiwen Zhang
Sheng Guo
Limin Wang
Weilin Huang
Matthew R. Scott
231
20
0
18 Feb 2020
V4D:4D Convolutional Neural Networks for Video-level Representation
  Learning
V4D:4D Convolutional Neural Networks for Video-level Representation LearningInternational Conference on Learning Representations (ICLR), 2020
Shiwen Zhang
Sheng Guo
Weilin Huang
Matthew R. Scott
Limin Wang
156
80
0
18 Feb 2020
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
A Survey on 3D Skeleton-Based Action Recognition Using Learning MethodCyborg and Bionic Systems (CBS), 2020
Bin Ren
Mengyuan Liu
Runwei Ding
Hong Liu
293
178
0
14 Feb 2020
An End-to-End Visual-Audio Attention Network for Emotion Recognition in
  User-Generated Videos
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated VideosAAAI Conference on Artificial Intelligence (AAAI), 2020
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
179
116
0
12 Feb 2020
Learning spatio-temporal representations with temporal squeeze pooling
Learning spatio-temporal representations with temporal squeeze poolingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Guoxi Huang
A. Bors
ViT
157
14
0
11 Feb 2020
Self-Supervised Joint Encoding of Motion and Appearance for First Person
  Action Recognition
Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition
M. Planamente
A. Bottino
Barbara Caputo
EgoV
153
3
0
10 Feb 2020
Dynamic Inference: A New Approach Toward Efficient Video Action
  Recognition
Dynamic Inference: A New Approach Toward Efficient Video Action Recognition
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Yi Yang
Shilei Wen
172
37
0
09 Feb 2020
CTM: Collaborative Temporal Modeling for Action Recognition
CTM: Collaborative Temporal Modeling for Action Recognition
Li-Yu Daisy Liu
Tao Wang
Jie Liu
Yang Guan
Qi Bu
Longfei Yang
TTA
97
0
0
08 Feb 2020
Symbiotic Attention with Privileged Information for Egocentric Action
  Recognition
Symbiotic Attention with Privileged Information for Egocentric Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2020
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
189
65
0
08 Feb 2020
iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge:
  Hierarchical Group-wise Attention
iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention
Li-Yu Daisy Liu
Dongyang Cai
Jie Liu
Nan Ding
Tao Wang
89
0
0
07 Feb 2020
Learning Class Regularized Features for Action Recognition
Learning Class Regularized Features for Action Recognition
Alexandros Stergiou
R. Poppe
R. Veltkamp
94
4
0
07 Feb 2020
An Information-rich Sampling Technique over Spatio-Temporal CNN for
  Classification of Human Actions in Videos
An Information-rich Sampling Technique over Spatio-Temporal CNN for Classification of Human Actions in VideosMultimedia tools and applications (MTA), 2020
S. H. Shabbeer Basha
Viswanath Pulabaigari
Snehasis Mukherjee
135
20
0
06 Feb 2020
Modality Compensation Network: Cross-Modal Adaptation for Action
  Recognition
Modality Compensation Network: Cross-Modal Adaptation for Action RecognitionIEEE Transactions on Image Processing (TIP), 2020
Sijie Song
Jiaying Liu
Yanghao Li
Zongming Guo
145
57
0
31 Jan 2020
Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
Multi-Modal Domain Adaptation for Fine-Grained Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2020
Jonathan Munro
Dima Damen
EgoV
283
229
0
27 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
630
232
0
23 Jan 2020
Zero-Shot Activity Recognition with Videos
Zero-Shot Activity Recognition with Videos
Evin Pınar Örnek
96
1
0
22 Jan 2020
Weakly Supervised Temporal Action Localization Using Deep Metric
  Learning
Weakly Supervised Temporal Action Localization Using Deep Metric LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
Ashraful Islam
Richard J. Radke
160
48
0
21 Jan 2020
The benefits of synthetic data for action categorization
The benefits of synthetic data for action categorizationIEEE International Joint Conference on Neural Network (IJCNN), 2020
Mohamad Ballout
Mohammad Tuqan
Daniel C. Asmar
Elie A. Shammas
George E. Sakr
82
6
0
20 Jan 2020
Tree-Structured Policy based Progressive Reinforcement Learning for
  Temporally Language Grounding in Video
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in VideoAAAI Conference on Artificial Intelligence (AAAI), 2020
Jie Wu
Guanbin Li
Si Liu
Liang Lin
OffRL
170
116
0
18 Jan 2020
Temporal Interlacing Network
Temporal Interlacing NetworkAAAI Conference on Artificial Intelligence (AAAI), 2020
Hao Shao
Shengju Qian
Yu Liu
237
108
0
17 Jan 2020
Modality-Balanced Models for Visual Dialogue
Modality-Balanced Models for Visual DialogueAAAI Conference on Artificial Intelligence (AAAI), 2020
Hyounghun Kim
Hao Tan
Joey Tianyi Zhou
118
29
0
17 Jan 2020
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Tianhao Li
Limin Wang
VGen
159
60
0
16 Jan 2020
Rethinking Motion Representation: Residual Frames with 3D ConvNets for
  Better Action Recognition
Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action RecognitionIEEE Transactions on Image Processing (TIP), 2020
Li Tao
Xueting Wang
T. Yamasaki
3DPC
149
26
0
16 Jan 2020
Recognizing Video Events with Varying Rhythms
Recognizing Video Events with Varying Rhythms
Yikang Li
Tianshu Yu
Baoxin Li
88
4
0
14 Jan 2020
Video Coding for Machines: A Paradigm of Collaborative Compression and
  Intelligent Analytics
Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent AnalyticsIEEE Transactions on Image Processing (TIP), 2020
Ling-yu Duan
Jiaying Liu
Wenhan Yang
Tiejun Huang
Wen Gao
329
226
0
10 Jan 2020
Previous
123...192021...272829
Next
Page 20 of 29
Pageof 29