Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1608.00859
Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"
50 / 1,449 papers shown
Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2023
Jiazheng Xing
Mengmeng Wang
Yong-Jin Liu
B. Mu
ViT
213
47
0
19 Jan 2023
Temporal Perceiving Video-Language Pre-training
Fan Ma
Xiaojie Jin
Heng Wang
Jingjia Huang
Linchao Zhu
Jiashi Feng
Yi Yang
VLM
206
17
0
18 Jan 2023
CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey
Michael Perez
Corey Toler-Franklin
MedIm
194
22
0
15 Jan 2023
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Computer Vision and Pattern Recognition (CVPR), 2023
Michail Tarasiou
Erik Chavez
Stefanos Zafeiriou
ViT
283
86
0
12 Jan 2023
HierVL: Learning Hierarchical Video-Language Embeddings
Computer Vision and Pattern Recognition (CVPR), 2023
Kumar Ashutosh
Rohit Girdhar
Lorenzo Torresani
Kristen Grauman
VLM
AI4TS
440
72
0
05 Jan 2023
Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition
Hasan Hammoud
Shuming Liu
Mohammad Alkhrashi
Fahad Albalawi
Guohao Li
AAML
280
12
0
03 Jan 2023
Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on Videos
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xingxing Wei
Songping Wang
Huanqian Yan
AAML
311
24
0
03 Jan 2023
Hierarchical Explanations for Video Action Recognition
Sadaf Gulshad
Teng Long
Nanne van Noord
FAtt
354
13
0
01 Jan 2023
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Computer Vision and Pattern Recognition (CVPR), 2022
Wenhao Wu
Xiaohan Wang
Haipeng Luo
Jingdong Wang
Yi Yang
Wanli Ouyang
395
80
0
31 Dec 2022
Representation Learning in Deep RL via Discrete Information Bottleneck
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Riashat Islam
Hongyu Zang
Manan Tomar
Aniket Didolkar
Md. Mofijul Islam
...
Tariq Iqbal
Xin-hui Li
Anirudh Goyal
N. Heess
Alex Lamb
SSL
OffRL
177
11
0
28 Dec 2022
Deep set conditioned latent representations for action recognition
VISIGRAPP (VISIGRAPP), 2022
Akash Singh
Tom De Schepper
Kevin Mets
P. Hellinckx
José Oramas
Steven Latré
BDL
168
2
0
21 Dec 2022
C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Dipika Singhania
R. Rahaman
Angela Yao
193
44
0
20 Dec 2022
A Survey on Human Action Recognition
Zhou Shuchang
226
0
0
20 Dec 2022
Egocentric Video Task Translation
Computer Vision and Pattern Recognition (CVPR), 2022
Zihui Xue
Yale Song
Kristen Grauman
Lorenzo Torresani
EgoV
267
18
0
13 Dec 2022
Contextual Explainable Video Representation: Human Perception-based Understanding
Asilomar Conference on Signals, Systems and Computers (ACSSC), 2022
Khoa T. Vo
Kashu Yamazaki
Phong H. Nguyen
Pha Nguyen
Khoa Luu
Ngan Le
226
11
0
12 Dec 2022
Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Dawei Du
Ameya Shringi
A. Hoogs
Christopher Funk
155
3
0
12 Dec 2022
Multimodal Prototype-Enhanced Network for Few-Shot Action Recognition
International Conference on Multimedia Retrieval (ICMR), 2022
Xin Ni
Yong Liu
Hao Wen
Yatai Ji
Jing Xiao
Yujiu Yang
276
20
0
09 Dec 2022
Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2022
Junghoon Lee
Minhyeok Lee
Suhwan Cho
Sungmin Woo
Sungjun Jang
Sangyoun Lee
226
23
0
09 Dec 2022
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
IEEE Access (IEEE Access), 2022
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
190
3
0
09 Dec 2022
DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition
Neural Networks (NN), 2022
Santosh Kumar Yadav
Achleshwar Luthra
Esha Pahwa
K. Tiwari
Heena Rathore
Hari Mohan Pandey
Peter Corcoran
212
19
0
07 Dec 2022
Fine-tuned CLIP Models are Efficient Video Learners
Computer Vision and Pattern Recognition (CVPR), 2022
H. Rasheed
Muhammad Uzair Khattak
Muhammad Maaz
Salman Khan
Fahad Shahbaz Khan
CLIP
VLM
404
225
0
06 Dec 2022
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Yi Wang
Kunchang Li
Yizhuo Li
Yinan He
Bingkun Huang
...
Junting Pan
Jiashuo Yu
Yali Wang
Limin Wang
Yu Qiao
VLM
VGen
455
448
0
06 Dec 2022
VLG: General Video Recognition with Web Textual Knowledge
International Journal of Computer Vision (IJCV), 2022
Jintao Lin
Zhaoyang Liu
Wenhai Wang
Wayne Wu
Limin Wang
237
2
0
03 Dec 2022
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval
Fangxun Shu
Biaolong Chen
Yue Liao
Shuwen Xiao
Wenyu Sun
Xiaobo Li
Yousong Zhu
Jinqiao Wang
Si Liu
CLIP
185
13
0
02 Dec 2022
Lightweight Structure-Aware Attention for Visual Understanding
International Journal of Computer Vision (IJCV), 2022
Heeseung Kwon
F. M. Castro
M. Marín-Jiménez
N. Guil
Alahari Karteek
200
3
0
29 Nov 2022
Post-Processing Temporal Action Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Sauradip Nag
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
166
10
0
27 Nov 2022
Towards Good Practices for Missing Modality Robust Action Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2022
Sangmin Woo
Sumin Lee
Yeonju Park
Muhammad Adi Nugroho
Changick Kim
241
70
0
25 Nov 2022
Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies
Matthew Kent Myers
Nick Wright
Stephen McGough
Nicholas Martin
124
2
0
24 Nov 2022
Video Test-Time Adaptation for Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Wei Lin
M. Jehanzeb Mirza
Mateusz Koziñski
Horst Possegger
Hilde Kuehne
Horst Bischof
TTA
261
47
0
24 Nov 2022
SVFormer: Semi-supervised Video Transformer for Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Zhen Xing
Jingdong Sun
Hang-Rui Hu
Yue Yu
Zuxuan Wu
Yu-Gang Jiang
ViT
197
120
0
23 Nov 2022
Dynamic Appearance: A Video Representation for Action Recognition with Joint Training
Guoxi Huang
A. Bors
178
1
0
23 Nov 2022
Look More but Care Less in Video Recognition
Neural Information Processing Systems (NeurIPS), 2022
Yitian Zhang
Yue Bai
Haiquan Wang
Yi Xu
Yun Fu
219
12
0
18 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
227
156
0
17 Nov 2022
Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive Survey
ACM Computing Surveys (ACM CSUR), 2022
Yuecong Xu
Haozhi Cao
Zhenghua Chen
Xiaoli Li
Lihua Xie
Jianfei Yang
232
23
0
17 Nov 2022
Language-Assisted Deep Learning for Autistic Behaviors Recognition
Smart Health (SH), 2022
Andong Deng
Taojiannan Yang
Chong Chen
Qian Chen
Leslie C. Neely
Sakiko Oyama
180
14
0
17 Nov 2022
A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Benjia Zhou
Pichao Wang
Jun Wan
Yan-Ni Liang
Fan Wang
221
29
0
16 Nov 2022
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022
Yin-Dong Zheng
Guo Chen
Jiahao Wang
Tong Lu
Liming Wang
184
1
0
16 Nov 2022
Dynamic Temporal Filtering in Video Models
European Conference on Computer Vision (ECCV), 2022
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Chong-Wah Ngo
Tao Mei
AI4TS
237
24
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Computer Vision and Pattern Recognition (CVPR), 2022
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
621
901
0
14 Nov 2022
Deep Unsupervised Key Frame Extraction for Efficient Video Classification
Hao Tang
L. Ding
Songsong Wu
Bin Ren
Andrii Zadaianchuk
Paolo Rota
103
45
0
12 Nov 2022
Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks
Computer Vision and Pattern Recognition (CVPR), 2022
Hyolim Kang
Hanjung Kim
Joungbin An
Minsu Cho
Seon Joo Kim
281
7
0
11 Nov 2022
SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition
Santosh Kumar Yadav
Esha Pahwa
Achleshwar Luthra
K. Tiwari
Hari Mohan Pandey
Peter Corcoran
175
4
0
10 Nov 2022
SimOn: A Simple Framework for Online Temporal Action Localization
Tuan N. Tang
Jungin Park
Kwonyoung Kim
Kwanghoon Sohn
161
5
0
08 Nov 2022
Facial Tic Detection in Untrimmed Videos of Tourette Syndrome Patients
International Conference on Pattern Recognition (ICPR), 2022
Yu-Ching Tang
Benjamín Béjar
J. K. Essoe
J. McGuire
René Vidal
133
5
0
07 Nov 2022
Bringing Online Egocentric Action Recognition into the wild
IEEE Robotics and Automation Letters (RA-L), 2022
Gabriele Goletto
M. Planamente
Barbara Caputo
Giuseppe Averta
EgoV
225
6
0
06 Nov 2022
Event and Entity Extraction from Generated Video Captions
International Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2022
Johannes Scherer
A. Scherp
Deepayan Bhowmik
238
0
0
05 Nov 2022
Self-Supervised Learning for Speech Enhancement through Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Bryce Irvin
Marko Stamenovic
M. Kegler
Li-Chia Yang
194
24
0
04 Nov 2022
Video Event Extraction via Tracking Visual States of Arguments
AAAI Conference on Artificial Intelligence (AAAI), 2022
Guang Yang
Pengfei Yu
Jiajie Zhang
Xudong Lin
Shih-Fu Chang
Heng Ji
204
13
0
03 Nov 2022
Deep Learning Computer Vision Algorithms for Real-time UAVs On-board Camera Image Processing
A. Palmas
P. Andronico
225
5
0
02 Nov 2022
TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early Intent Prediction
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Nada Osman
Guglielmo Camporese
Lamberto Ballan
127
15
0
26 Oct 2022
Previous
1
2
3
...
7
8
9
...
27
28
29
Next
Page 8 of 29
Page
of 29
Go