Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1608.00859
Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"
50 / 1,449 papers shown
Are current long-term video understanding datasets long-term?
Ombretta Strafforello
Klamer Schutte
Jan van Gemert
207
10
0
22 Aug 2023
Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2023
Xi Li
Songhe Wang
Rui Huang
Mahanth K. Gowda
G. Kesidis
AAML
391
7
0
21 Aug 2023
ResQ: Residual Quantization for Video Perception
IEEE International Conference on Computer Vision (ICCV), 2023
Davide Abati
H. Yahia
Markus Nagel
A. Habibian
MQ
221
3
0
18 Aug 2023
Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching
IEEE International Conference on Computer Vision (ICCV), 2023
Jiazheng Xing
Mengmeng Wang
Yudi Ruan
Bofan Chen
Yaowei Guo
B. Mu
Guangwen Dai
Jingdong Wang
Yong-Jin Liu
207
34
0
18 Aug 2023
Unlimited Knowledge Distillation for Action Recognition in the Dark
Ruibing Jin
Guosheng Lin
Ruibing Jin
Jie Lin
Zhengguo Li
Xiaoli Li
Zhenghua Chen
155
2
0
18 Aug 2023
Progression-Guided Temporal Action Detection in Videos
Chongkai Lu
Man-Wai Mak
Ruimin Li
Z. Chi
Hong Fu
AI4TS
167
0
0
18 Aug 2023
Memory-and-Anticipation Transformer for Online Action Understanding
IEEE International Conference on Computer Vision (ICCV), 2023
Jiahao Wang
Guo Chen
Yifei Huang
Liming Wang
Tong Lu
OffRL
303
60
0
15 Aug 2023
ViGT: Proposal-free Video Grounding with Learnable Token in Transformer
Science China Information Sciences (Sci China Inf Sci), 2023
Kun Li
Dan Guo
Meng Wang
ViT
156
62
0
11 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
205
17
0
10 Aug 2023
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition
L. Bicsi
B. Alexe
Radu Tudor Ionescu
Marius Leordeanu
257
2
0
09 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
ACM Multimedia (ACM MM), 2023
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
261
10
0
09 Aug 2023
Long-Distance Gesture Recognition using Dynamic Neural Networks
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Shubhang Bhatnagar
S. Gopal
Narendra Ahuja
Liu Ren
191
6
0
09 Aug 2023
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
IEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2023
Tianlin Li
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
369
23
0
08 Aug 2023
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition
Indian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2023
S. Chaudhuri
Saumik Bhattacharya
175
6
0
07 Aug 2023
M
3
^3
3
Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition
ACM Multimedia (ACM MM), 2023
Hao Tang
Jun Liu
Shuanglin Yan
Rui Yan
Zechao Li
Jinhui Tang
278
74
0
06 Aug 2023
Multimodal Adaptation of CLIP for Few-Shot Action Recognition
Pattern Recognition (Pattern Recogn.), 2023
Jiazheng Xing
Mengmeng Wang
Xiaojun Hou
Guangwen Dai
Jingdong Wang
Yong-Jin Liu
VLM
181
1
0
03 Aug 2023
SkateboardAI: The Coolest Video Action Recognition for Skateboarding
AAAI Conference on Artificial Intelligence (AAAI), 2023
Hanxiao Chen
ViT
118
4
0
02 Aug 2023
MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using Transformers
European Workshop on Visual Information Processing (EUVIP), 2023
Muhammad Bilal Shaikh
Douglas Chai
Syed Mohammed Shamsul Islam
Naveed Akhtar
293
7
0
01 Aug 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration
ACM Multimedia (ACM MM), 2023
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
220
9
0
27 Jul 2023
Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding Emotion
Proceedings of the IEEE (Proc. IEEE), 2023
James Z. Wang
Sicheng Zhao
Chenyan Wu
Reginald B. Adams
M. Newman
T. Shafir
Rachelle Tsachor
338
54
0
25 Jul 2023
Spatiotemporal Modeling Encounters 3D Medical Image Analysis: Slice-Shift UNet with Multi-View Fusion
International Conference on Machine Vision and Applications (ICMVA), 2023
C. Ugwu
S. Casarin
Oswald Lanz
179
0
0
24 Jul 2023
In Defense of Clip-based Video Relation Detection
IEEE Transactions on Image Processing (IEEE TIP), 2023
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
179
7
0
18 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
212
17
0
18 Jul 2023
Video-Mined Task Graphs for Keystep Recognition in Instructional Videos
Neural Information Processing Systems (NeurIPS), 2023
Kumar Ashutosh
Santhosh Kumar Ramakrishnan
Triantafyllos Afouras
Kristen Grauman
299
37
0
17 Jul 2023
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
IEEE International Conference on Computer Vision (ICCV), 2023
Hongfei Yan
Zehua Wang
Yushen Wei
Zerui Li
Guanbin Li
Guanbin Li
279
64
0
17 Jul 2023
Multimodal Distillation for Egocentric Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Gorjan Radevski
Dusan Grujicic
Marie-Francine Moens
Matthew Blaschko
Tinne Tuytelaars
EgoV
331
35
0
14 Jul 2023
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation
Neural Information Processing Systems (NeurIPS), 2023
MD Wahiduzzaman Khan
Hong Sheng
Hu Zhang
Heming Du
Sen Wang
...
Jack Phu
A. Agar
Zichen Huang
M. Golzan
Xin Yu
160
10
0
13 Jul 2023
VS-TransGRU: A Novel Transformer-GRU-based Framework Enhanced by Visual-Semantic Fusion for Egocentric Action Anticipation
Congqi Cao
Ze Sun
Qinyi Lv
Lingtong Min
Yanning Zhang
ViT
159
8
0
08 Jul 2023
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition
ACM Multimedia (ACM MM), 2023
Guoying Zhao
Zheng Lian
B. Liu
Jianhua Tao
238
73
0
05 Jul 2023
Task-Specific Alignment and Multiple Level Transformer for Few-Shot Action Recognition
Neurocomputing (Neurocomputing), 2023
Fei-Yu Guo
Li Zhu
Yiwang Wang
Jing Sun
ViT
232
10
0
05 Jul 2023
Streaming egocentric action anticipation: An evaluation scheme and approach
Computer Vision and Image Understanding (CVIU), 2023
Antonino Furnari
G. Farinella
EgoV
177
5
0
29 Jun 2023
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition
Neural Information Processing Systems (NeurIPS), 2023
Yiting Dong
Yang Li
Dongcheng Zhao
Guobin Shen
Yi Zeng
185
20
0
20 Jun 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
216
12
0
13 Jun 2023
Enhanced Multimodal Representation Learning with Cross-modal KD
Computer Vision and Pattern Recognition (CVPR), 2023
Mengxi Chen
Linyu Xing
Yu Wang
Ya Zhang
149
18
0
13 Jun 2023
Action Recognition with Multi-stream Motion Modeling and Mutual Information Maximization
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Yu-Huan Yang
Haipeng Chen
Zhenguang Liu
Y. Lyu
Beibei Zhang
Shuang Wu
Peng Kuang
Kui Ren
246
10
0
13 Jun 2023
Boosting Breast Ultrasound Video Classification by the Guidance of Keyframe Feature Centers
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
AnLan Sun
Zhao Zhang
Meng Lei
Yuting Dai
Dong Wang
Liwei Wang
153
12
0
12 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
182
4
0
07 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
139
13
0
06 Jun 2023
Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification
Jintao Rong
Hao Chen
Tianrun Chen
Linlin Ou
Xinyi Yu
Yifan Liu
VLM
VPVLM
194
8
0
04 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
Neural Information Processing Systems (NeurIPS), 2023
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
478
459
0
03 Jun 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
188
9
0
31 May 2023
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization
Computer Vision and Pattern Recognition (CVPR), 2023
Huantao Ren
Wenfei Yang
Tianzhu Zhang
Yongdong Zhang
360
42
0
29 May 2023
Action Sensitivity Learning for Temporal Action Localization
IEEE International Conference on Computer Vision (ICCV), 2023
Jiayi Shao
Xiaohan Wang
Ruijie Quan
Junjun Zheng
Jiang Yang
Yezhou Yang
331
41
0
25 May 2023
Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective
Neurocomputing (Neurocomputing), 2023
Thanh-Dat Truong
Khoa Luu
EgoV
389
15
0
25 May 2023
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale
Ziyun Zeng
Yixiao Ge
Zhan Tong
Xihui Liu
Shutao Xia
Ying Shan
275
13
0
23 May 2023
VideoLLM: Modeling Video Sequence with Large Language Models
Guo Chen
Yin-Dong Zheng
Jiahao Wang
Jilan Xu
Yifei Huang
...
Yi Wang
Yali Wang
Yu Qiao
Tong Lu
Limin Wang
MLLM
261
112
0
22 May 2023
Learning Higher-order Object Interactions for Keypoint-based Video Understanding
Yi Huang
Asim Kadav
Farley Lai
Deep Patel
H. Graf
125
1
0
16 May 2023
Exploring Few-Shot Adaptation for Activity Recognition on Diverse Domains
Kunyu Peng
Di Wen
David Schneider
Kailai Li
Kailun Yang
M. Sarfraz
Rainer Stiefelhagen
Alina Roitberg
335
3
0
15 May 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Lingxi Li
Kyungtae Han
Rohit Gupta
Ziran Wang
171
18
0
13 May 2023
Few-shot Action Recognition via Intra- and Inter-Video Information Maximization
Huabin Liu
W. Lin
Yun Xu
Yuxi Li
Shuyuan Li
John See
225
9
0
10 May 2023
Previous
1
2
3
...
5
6
7
...
27
28
29
Next