Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1608.00859
Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"
50 / 1,449 papers shown
Benchmarks for Physical Reasoning AI
Andrew Melnik
Robin Schiewer
Moritz Lange
Andrei Muresanu
Mozhgan Saeidi
Animesh Garg
Helge J. Ritter
355
9
0
17 Dec 2023
Video-based Surgical Skill Assessment using Tree-based Gaussian Process Classifier
Arefeh Rezaei
M. J. Ahmadi
Amir Molaei
H. Taghirad
207
1
0
15 Dec 2023
EZ-CLIP: Efficient Zeroshot Video Action Recognition
Shahzad Ahmad
S. Chanda
Yogesh S Rawat
VLM
278
11
0
13 Dec 2023
LMDrive: Closed-Loop End-to-End Driving with Large Language Models
Computer Vision and Pattern Recognition (CVPR), 2023
Hao Shao
Yuxuan Hu
Letian Wang
Steven L. Waslander
Yu Liu
Jiaming Song
ELM
310
238
0
12 Dec 2023
Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023)
Kabita Parajuli
S. R. Joshi
278
0
0
12 Dec 2023
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer
AAAI Conference on Artificial Intelligence (AAAI), 2023
Linglin Jing
Ying Xue
Xu Yan
Chaoda Zheng
Dong Wang
Ruimao Zhang
Zhigang Wang
Hui Fang
Bin Zhao
Zhen Li
ViT
3DPC
220
11
0
12 Dec 2023
Early Action Recognition with Action Prototypes
G. Camporese
Alessandro Bergamo
Xunyu Lin
Joseph Tighe
Davide Modolo
EgoV
134
0
0
11 Dec 2023
Sense, Predict, Adapt, Repeat: A Blueprint for Design of New Adaptive AI-Centric Sensing Systems
S. Hor
Amin Arbabian
209
2
0
11 Dec 2023
A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation
Yunheng Li
Zhongyu Li
Shanghua Gao
Qilong Wang
Qibin Hou
Ming-Ming Cheng
202
10
0
10 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
211
55
0
07 Dec 2023
Low-power, Continuous Remote Behavioral Localization with Event Cameras
Friedhelm Hamann
Suman Ghosh
Ignacio Juarez Martinez
Tom Hart
Alex Kacelnik
Guillermo Gallego
195
12
0
06 Dec 2023
Towards More Practical Group Activity Detection: A New Benchmark and Model
European Conference on Computer Vision (ECCV), 2023
Dongkeun Kim
Youngkil Song
Minsu Cho
Suha Kwak
201
10
0
05 Dec 2023
Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Min Yang
Huan Gao
Ping Guo
Limin Wang
ViT
283
17
0
04 Dec 2023
Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition
ACM Multimedia (ACM MM), 2023
Chengyou Jia
Minnan Luo
Xiaojun Chang
Zhuohang Dang
Mingfei Han
Mengmeng Wang
Guangwen Dai
Sizhe Dang
Jingdong Wang
VLM
210
14
0
04 Dec 2023
Consistency Prototype Module and Motion Compensation for Few-Shot Action Recognition (CLIP-CP
M
2
\mathbf{M^2}
M
2
C)
Fei-Yu Guo
Li Zhu
YiKang Wang
Han Qi
274
8
0
02 Dec 2023
OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Tom Tongjia Chen
Hongshan Yu
Zhengeng Yang
Zechuan Li
Wei Sun
Chen Chen
386
13
0
30 Nov 2023
Source-free Video Domain Adaptation by Learning from Noisy Labels
Pattern Recognition (Pattern Recogn.), 2023
A. Dasgupta
C. V. Jawahar
Karteek Alahari
TTA
VLM
498
13
0
30 Nov 2023
GeoDeformer: Geometric Deformable Transformer for Action Recognition
Jinhui Ye
Jiaming Zhou
Hui Xiong
Junwei Liang
ViT
111
1
0
29 Nov 2023
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Computer Vision and Pattern Recognition (CVPR), 2023
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
...
Jilan Xu
Guo Chen
Ping Luo
Limin Wang
Yu Qiao
VLM
MLLM
668
857
0
28 Nov 2023
Towards Weakly Supervised End-to-end Learning for Long-video Action Recognition
Jiaming Zhou
Hanjun Li
Kun-Yu Lin
Junwei Liang
328
2
0
28 Nov 2023
Centre Stage: Centricity-based Audio-Visual Temporal Action Detection
Hanyuan Wang
Majid Mirmehdi
Dima Damen
Toby Perrett
187
3
0
28 Nov 2023
REACT: Recognize Every Action Everywhere All At Once
Machine Vision and Applications (MVA), 2023
N. V. R. Chappa
Pha Nguyen
P. Dobbs
Khoa Luu
213
6
0
27 Nov 2023
Align before Adapt: Leveraging Entity-to-Region Alignments for Generalizable Video Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Yifei Chen
Dapeng Chen
Ruijin Liu
Sai Zhou
Wenyuan Xue
Wei Peng
295
15
0
27 Nov 2023
Introducing SSBD+ Dataset with a Convolutional Pipeline for detecting Self-Stimulatory Behaviours in Children using raw videos
International Conference on e-Health Networking, Applications and Services (HealthCom), 2023
Vaibhavi Lokegaonkar
Vijay Jaisankar
Pon Deepika
Madhav Rao
T. Srikanth
Sarbani Mallick
Manjit Sodhi
91
3
0
25 Nov 2023
Modality Mixer Exploiting Complementary Information for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Muhammad Adi Nugroho
Changick Kim
254
0
0
21 Nov 2023
Unsupervised Video Summarization
Asian Conference on Computer Vision (ACCV), 2023
Hanqing Li
Diego Klabjan
J. Utke
158
2
0
07 Nov 2023
Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols
ACM Computing Surveys (ACM Comput. Surv.), 2023
Iqra Qasim
Alexander Horsch
Dilip K. Prasad
255
14
0
05 Nov 2023
P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Abid Ali
Ashish Marisetty
François Brémond
185
6
0
04 Nov 2023
Beyond still images: Temporal features and input variance resilience
Scientific Reports (Sci Rep), 2023
AmirHosein Fadaei
M. Dehaqani
271
0
0
01 Nov 2023
Diversifying Spatial-Temporal Perception for Video Domain Generalization
Neural Information Processing Systems (NeurIPS), 2023
Kun-Yu Lin
Jia-Run Du
Yipeng Gao
Jiaming Zhou
Wei-Shi Zheng
232
22
0
27 Oct 2023
Few-shot Action Recognition with Captioning Foundation Models
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
VLM
334
9
0
16 Oct 2023
Boundary Discretization and Reliable Classification Network for Temporal Action Detection
IEEE transactions on multimedia (IEEE TMM), 2023
Zhenying Fang
Jun Yu
Richang Hong
332
4
0
10 Oct 2023
Exploiting Facial Relationships and Feature Aggregation for Multi-Face Forgery Detection
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2023
Chenhao Lin
Fangbin Yi
Hang Wang
Qian Li
Jingyi Deng
Chao Shen
CVBM
128
12
0
07 Oct 2023
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization
Edward Fish
Jon Weinbren
Andrew Gilbert
166
1
0
05 Oct 2023
A Grammatical Compositional Model for Video Action Detection
Zhijun Zhang
Xu Zou
Jiahuan Zhou
Sheng Zhong
Ying Wu
249
0
0
04 Oct 2023
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
European Conference on Computer Vision (ECCV), 2023
Xinhao Li
Yuhan Zhu
Limin Wang
VLM
324
18
0
02 Oct 2023
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
304
15
0
29 Sep 2023
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
273
23
0
28 Sep 2023
CPR-Coach: Recognizing Composite Error Actions based on Single-class Training
Computer Vision and Pattern Recognition (CVPR), 2023
Shunli Wang
Qing Yu
Shuai Wang
Dingkang Yang
Liuzhen Su
Xiao Zhao
Haopeng Kuang
Pei Zhang
Peng Zhai
Lihua Zhang
349
6
0
21 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
249
1
0
20 Sep 2023
Collaborative Three-Stream Transformers for Video Captioning
Computer Vision and Image Understanding (CVIU), 2023
Hao Wang
Libo Zhang
Hengrui Fan
Tiejian Luo
196
8
0
18 Sep 2023
Selective Volume Mixup for Video Action Recognition
Yi Tan
Zhaofan Qiu
Y. Hao
Ting Yao
Xiangnan He
Tao Mei
ViT
213
4
0
18 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
218
31
0
14 Sep 2023
Predicting Routine Object Usage for Proactive Robot Assistance
Conference on Robot Learning (CoRL), 2023
Maithili Patel
Aswin Prakash
Sonia Chernova
AI4TS
258
11
0
12 Sep 2023
ATM: Action Temporality Modeling for Video Question Answering
ACM Multimedia (ACM MM), 2023
Junwen Chen
Jie Zhu
Yu Kong
216
3
0
05 Sep 2023
Towards Contrastive Learning in Music Video Domain
Karel Veldkamp
Mariya Hendriksen
Zoltán Szlávik
Alexander Keijser
SSL
213
3
0
01 Sep 2023
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning
ACM Multimedia (ACM MM), 2023
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
Jian Zhao
209
9
0
29 Aug 2023
UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization
ACM Multimedia (ACM MM), 2023
Rui Zhang
Hongxia Wang
Ming-han Du
Hanqing Liu
Yangqiaoyu Zhou
Q. Zeng
255
46
0
28 Aug 2023
Improving Video Violence Recognition with Human Interaction Learning on 3D Skeleton Point Clouds
Qingxin Xiao
Guosheng Lin
Qingyao Wu
3DH
3DPC
197
5
0
26 Aug 2023
Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Qitong Wang
Long Zhao
Liangzhe Yuan
Ting Liu
Xi Peng
370
21
0
22 Aug 2023
Previous
1
2
3
4
5
6
...
27
28
29
Next