Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1608.00859
Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"
50 / 1,449 papers shown
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos
Merey Ramazanova
Victor Escorcia
Fabian Caba Heilbron
Chen Zhao
Guohao Li
223
4
0
10 Feb 2022
CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors
IEEE Sensors Journal (IEEE Sens. J.), 2022
Xin Chao
Zhenjie Hou
Yu Mo
194
26
0
07 Feb 2022
A Coding Framework and Benchmark towards Compressed Video Understanding
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yuan Tian
Guo Lu
Manwen Liao
Guangtao Zhai
Lixing Chen
Zhiyong Gao
175
4
0
06 Feb 2022
Should I take a walk? Estimating Energy Expenditure from Video Data
Kunyu Peng
Alina Roitberg
Kailun Yang
Kailai Li
Rainer Stiefelhagen
183
7
0
01 Feb 2022
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks
International Conference on Machine Learning (ICML), 2022
Noam Razin
Asaf Maman
Nadav Cohen
423
33
0
27 Jan 2022
Learning To Recognize Procedural Activities with Distant Supervision
Computer Vision and Pattern Recognition (CVPR), 2022
Xudong Lin
Fabio Petroni
Gedas Bertasius
Marcus Rohrbach
Shih-Fu Chang
Lorenzo Torresani
260
96
0
26 Jan 2022
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Kunchang Li
Yali Wang
Junhao Zhang
Shiyang Feng
Guanglu Song
Yu Liu
Jiaming Song
Yu Qiao
ViT
546
532
0
24 Jan 2022
Rich Action-semantic Consistent Knowledge for Early Action Prediction
IEEE Transactions on Image Processing (IEEE TIP), 2022
Xiaoli Liu
Jianqin Yin
Dianming Guo
Huaping Liu
196
7
0
23 Jan 2022
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Chao-Yuan Wu
Yanghao Li
K. Mangalam
Haoqi Fan
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
489
245
0
20 Jan 2022
Action Keypoint Network for Efficient Video Recognition
IEEE Transactions on Image Processing (IEEE TIP), 2022
Xu Chen
Yahong Han
Xiaohan Wang
Yifang Sun
Yi Yang
3DPC
260
9
0
17 Jan 2022
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals
Lijun Yu
Yijun Qian
Wenhe Liu
Alexander G. Hauptmann
174
14
0
14 Jan 2022
Hand-Object Interaction Reasoning
Advanced Video and Signal Based Surveillance (AVSS), 2022
Jian Ma
Dima Damen
189
7
0
13 Jan 2022
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning
International Conference on Learning Representations (ICLR), 2022
Kunchang Li
Yali Wang
Shiyang Feng
Guanglu Song
Yu Liu
Jiaming Song
Yu Qiao
ViT
489
320
0
12 Jan 2022
OCSampler: Compressing Videos to One Clip with Single-step Sampling
Computer Vision and Pattern Recognition (CVPR), 2022
Jintao Lin
Haodong Duan
Kai-xiang Chen
Dahua Lin
Limin Wang
181
31
0
12 Jan 2022
Representing Videos as Discriminative Sub-graphs for Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Dong Li
Zhaofan Qiu
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
229
33
0
11 Jan 2022
Boosting Video Representation Learning with Multi-Faceted Integration
Computer Vision and Pattern Recognition (CVPR), 2021
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xiaoping Zhang
Dong Wu
Tao Mei
179
9
0
11 Jan 2022
Condensing a Sequence to One Informative Frame for Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Zhaofan Qiu
Ting Yao
Y. Shu
Chong-Wah Ngo
Tao Mei
277
11
0
11 Jan 2022
Optimization Planning for 3D ConvNets
International Conference on Machine Learning (ICML), 2022
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
3DPC
3DH
214
9
0
11 Jan 2022
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Yulin Wang
Yang Yue
Yuanze Lin
Haojun Jiang
Zihang Lai
V. Kulikov
Nikita Orlov
Humphrey Shi
Gao Huang
237
63
0
28 Dec 2021
Fine-grained Multi-Modal Self-Supervised Learning
British Machine Vision Conference (BMVC), 2021
Duo Wang
S. Karout
SSL
117
7
0
22 Dec 2021
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition
Xiangbo Shu
Jiawen Yang
Rui Yan
Yan Song
239
174
0
21 Dec 2021
Precondition and Effect Reasoning for Action Recognition
Computer Vision and Image Understanding (CVIU), 2021
Hongsang Yoo
Haopeng Li
Qiuhong Ke
Liangchen Liu
Rui Zhang
CML
190
5
0
19 Dec 2021
Adversarial Memory Networks for Action Prediction
Zhiqiang Tao
Yue Bai
Handong Zhao
Sheng Li
Yuanyuan Kong
Y. Fu
GAN
82
3
0
18 Dec 2021
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Yinghao Xu
Fangyun Wei
Xiao Sun
Ceyuan Yang
Yujun Shen
Bo Dai
Bolei Zhou
Stephen Lin
VLM
183
62
0
17 Dec 2021
Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition
Benjia Zhou
Pichao Wang
Jun Wan
Yanyan Liang
Fan Wang
Du Zhang
Zhen Lei
Hao Li
Rong Jin
219
35
0
16 Dec 2021
Temporal Action Proposal Generation with Background Constraint
Haosen Yang
Wenhao Wu
Lining Wang
Sheng Jin
Boyang Xia
Huanjin Yao
Hujie Huang
272
29
0
15 Dec 2021
Temporal Transformer Networks with Self-Supervision for Action Recognition
Yongkang Zhang
Jun Li
Guoming Wu
Hanjie Zhang
Zhiping Shi
Zhaoxun Liu
Zizhang Wu
ViT
259
8
0
14 Dec 2021
SVIP: Sequence VerIfication for Procedures in Videos
Yichen Qian
Weixin Luo
Dongze Lian
Xu Tang
P. Zhao
Shenghua Gao
ViT
341
23
0
13 Dec 2021
Rethinking the Two-Stage Framework for Grounded Situation Recognition
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Tat-Seng Chua
200
38
0
10 Dec 2021
Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
Jiaqi Tang
Zhaoyang Liu
Chao Qian
Wayne Wu
Limin Wang
272
23
0
09 Dec 2021
MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification
Rex Liu
Huan Zhang
Hamed Pirsiavash
Xin Liu
ViT
298
17
0
08 Dec 2021
Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Rui Qian
Yeqing Li
Liangzhe Yuan
Boqing Gong
Ting Liu
Matthew A. Brown
Serge Belongie
Ming-Hsuan Yang
Hartwig Adam
Huayu Chen
AI4TS
200
7
0
08 Dec 2021
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
380
462
0
08 Dec 2021
Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning
Manlin Zhang
Jinpeng Wang
A. J. Ma
173
9
0
07 Dec 2021
Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Shoubin Yu
Zhong-Hua Zhao
Haoshu Fang
Andong Deng
Haisheng Su
Dongliang Wang
Weihao Gan
Cewu Lu
Wei Wu
294
28
0
07 Dec 2021
Time-Equivariant Contrastive Video Representation Learning
Simon Jenni
Hailin Jin
SSL
AI4TS
347
61
0
07 Dec 2021
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Guo Chen
Yin-Dong Zheng
Limin Wang
Tong Lu
AI4TS
239
84
0
07 Dec 2021
E
2
^2
2
(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition
Chiara Plizzari
M. Planamente
Gabriele Goletto
Marco Cannici
Emanuele Gusso
Matteo Matteucci
Barbara Caputo
EgoV
245
72
0
07 Dec 2021
Learning Connectivity with Graph Convolutional Networks for Skeleton-based Action Recognition
International Conference on Pattern Recognition (ICPR), 2021
H. Sahbi
GNN
212
38
0
06 Dec 2021
STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
Zhaoqilin Yang
Gaoyun An
208
6
0
05 Dec 2021
Video-Text Pre-training with Learned Regions
Rui Yan
Mike Zheng Shou
Yixiao Ge
Alex Jinpeng Wang
Xudong Lin
Guanyu Cai
Jinhui Tang
261
27
0
02 Dec 2021
Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
Lijin Yang
Yifei Huang
Yusuke Sugano
Yoichi Sato
232
6
0
02 Dec 2021
Graph Convolutional Module for Temporal Action Localization in Videos
Runhao Zeng
Wenbing Huang
Zhuliang Yu
Yu Rong
P. Zhao
Junzhou Huang
Chuang Gan
206
85
0
01 Dec 2021
UBoCo : Unsupervised Boundary Contrastive Learning for Generic Event Boundary Detection
Hyolim Kang
Jinwoo Kim
Taehyun Kim
Seon Joo Kim
212
31
0
29 Nov 2021
Learning from Temporal Gradient for Semi-supervised Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Junfei Xiao
Longlong Jing
Lin Zhang
Ju He
Qi She
Zongwei Zhou
Alan Yuille
Yingwei Li
254
67
0
25 Nov 2021
PolyViT: Co-training Vision Transformers on Images, Videos and Audio
Valerii Likhosherstov
Anurag Arnab
K. Choromanski
Mario Lucic
Yi Tay
Adrian Weller
Mostafa Dehghani
ViT
195
83
0
25 Nov 2021
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling
Tsu-Jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
Wenjie Wang
Lijuan Wang
Zicheng Liu
VLM
405
240
0
24 Nov 2021
Two-stage Rule-induction Visual Reasoning on RPMs with an Application to Video Prediction
Wentao He
Jianfeng Ren
Ruibin Bai
Xudong Jiang
LRM
302
8
0
24 Nov 2021
Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis
Zhaobo Qi
Shuhui Wang
Chi Su
Li Su
Weigang Zhang
Qingming Huang
146
10
0
23 Nov 2021
Self-Regulated Learning for Egocentric Video Activity Anticipation
Zhaobo Qi
Shuhui Wang
Chi Su
Li Su
Qingming Huang
Q. Tian
EgoV
270
63
0
23 Nov 2021
Previous
1
2
3
...
11
12
13
...
27
28
29
Next
Page 12 of 29
Page
of 29
Go