Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1608.00859
Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"
50 / 1,449 papers shown
SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model
Zongcan Ding
Han Zhang
Peng Wu
Guansong Pang
Zhiwei Yang
Peng Wang
Yujiao Shi
235
3
0
14 Apr 2025
H-MoRe: Learning Human-centric Motion Representation for Action Analysis
Computer Vision and Pattern Recognition (CVPR), 2025
Zhanbo Huang
Xiaoming Liu
Yu Kong
3DH
285
4
0
14 Apr 2025
SocialGesture: Delving into Multi-person Gesture Understanding
Computer Vision and Pattern Recognition (CVPR), 2025
Xu Cao
Pranav Virupaksha
Wenqi Jia
Bolin Lai
Fiona Ryan
Sangmin Lee
James M. Rehg
SLR
230
5
0
03 Apr 2025
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?
Shreyank N. Gowda
Boyan Gao
Xiao Gu
Xiaobo Jin
VLM
351
0
0
02 Apr 2025
FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action Detection
International Conference on Intelligent Computing (ICIC), 2025
Xinnan Zhu
Yicheng Zhu
Tixin Chen
Wentao Wu
Yuanjie Dang
295
1
0
01 Apr 2025
Sample-level Adaptive Knowledge Distillation for Action Recognition
Ping Li
Chenhao Ping
Wenxiao Wang
Mingli Song
331
1
0
01 Apr 2025
OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition
Shihao Cheng
Jinlu Zhang
Yue Liu
Zhigang Tu
VLM
218
1
0
30 Mar 2025
BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors
Chengyang Hu
Yuduo Chen
Lizhuang Ma
178
0
0
26 Mar 2025
Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
Computer Vision and Pattern Recognition (CVPR), 2025
Arun V. Reddy
Alexander Martin
Eugene Yang
Andrew Yates
Kate Sanders
Kenton W. Murray
Reno Kriz
Celso M. De Melo
Benjamin Van Durme
Rama Chellappa
356
9
0
24 Mar 2025
Context-Enhanced Memory-Refined Transformer for Online Action Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Zhanzhong Pang
Fadime Sener
Angela Yao
OffRL
352
5
0
24 Mar 2025
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks
Computer Vision and Pattern Recognition (CVPR), 2025
Nina Shvetsova
Arsha Nagrani
Bernt Schiele
Hilde Kuehne
Christian Rupprecht
268
1
0
24 Mar 2025
STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
Computer Vision and Pattern Recognition (CVPR), 2025
Zichen Liu
Kunlun Xu
Fuchun Sun
Xu Zou
Yuxin Peng
Jiahuan Zhou
VLM
AI4TS
429
10
0
20 Mar 2025
Quantum EigenGame for excited state calculation
David Quiroga
Jason Han
Anastasios Kyrillidis
280
5
0
17 Mar 2025
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Shristi Das Biswas
Efstathia Soufleri
Arani Roy
Kaushik Roy
331
1
0
17 Mar 2025
Elderly Activity Recognition in the Wild: Results from the EAR Challenge
Anh-Kiet Duong
179
0
0
10 Mar 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
297
6
0
27 Feb 2025
Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets
Jisoo Lee
Tamim Ahmed
Thanassis Rikakis
Pavan Turaga
231
0
0
27 Feb 2025
Online Meta-learning for AutoML in Real-time (OnMAR)
Mia Gerber
Anna Sergeevna Bosman
J. D. Villiers
OffRL
256
0
0
27 Feb 2025
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Junhyeok Kim
Min Soo Kim
Jiwan Chung
Jungbin Cho
Jisoo Kim
Sungwoong Kim
Gyeongbo Sim
Youngjae Yu
EgoV
160
3
0
17 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
327
0
0
11 Feb 2025
Conformal Predictions for Human Action Recognition with Vision-Language Models
Bary Tim
Fuchs Clément
Macq Benoît
VLM
356
0
0
10 Feb 2025
Seeing in the Dark: A Teacher-Student Framework for Dark Video Action Recognition via Knowledge Distillation and Contrastive Learning
Sharana Dharshikgan Suresh Dass
H. Barua
Ganesh Krishnasamy
Raveendran Paramesran
Raphael C.-W. Phan
396
0
0
06 Feb 2025
Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics
AAAI Conference on Artificial Intelligence (AAAI), 2025
Tze Ho Elden Tse
Runyang Feng
Linfang Zheng
Jiho Park
Yixing Gao
Jihie Kim
A. Leonardis
H. Chang
464
2
0
13 Jan 2025
Optimizing Multitask Industrial Processes with Predictive Action Guidance
IEEE Transactions on Automation Science and Engineering (T-ASE), 2025
Naval Kishore Mehta
Arvind
Shyam Sunder Prasad
Sumeet Saurav
Sanjay Singh
156
0
0
10 Jan 2025
An Efficient Adaptive Compression Method for Human Perception and Machine Vision Tasks
Lei Liu
Zhenghao Chen
Zhihao Hu
Dong Xu
269
5
0
08 Jan 2025
High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition
Ziao Li
Junyi Wang
Bangli Liu
Haibin Cai
Mohamad Saada
Guhong Nie
3DH
319
0
0
08 Jan 2025
Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences
Antonios Gasteratos
Stavros N. Moutsis
Konstantinos A. Tsintotas
Yiannis Aloimonos
194
0
0
17 Dec 2024
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yulin Wang
Haoji Zhang
Yang Yue
Shiji Song
Chao Deng
Junlan Feng
Gao Huang
286
12
0
15 Dec 2024
A Decade of Deep Learning: A Survey on The Magnificent Seven
Dilshod Azizov
Muhammad Arslan Manzoor
Velibor Bojkovic
Yingxu Wang
Liang Luo
...
Liang Li
Houcheng Su
Yu Zhong
Wei Liu
Shangsong Liang
OOD
AI4TS
MedIm
300
0
0
13 Dec 2024
EdgeOAR: Real-time Online Action Recognition On Edge Devices
Wei Luo
Deyu Zhang
Ying Tang
Fan Wu
Yaoxue Zhang
240
0
0
02 Dec 2024
When Spatial meets Temporal in Action Recognition
Wei Xu
Lei Wang
Yuxiao Chen
Tom Gedeon
Piotr Koniusz
301
3
0
22 Nov 2024
Privacy-Preserving Video Anomaly Detection: A Survey
Jing Liu
Zehua Wang
Xiaoguang Zhu
Jielin Li
Hao Yang
Liangyu Teng
Juncen Guo
Yan Wang
Jinjie Wei
Jing-nan Liu
524
11
0
21 Nov 2024
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
Hanyu Guo
Wanchuan Yu
Suzhou Que
Kaiwen Du
Yan Yan
Hanzi Wang
425
2
0
18 Nov 2024
OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing
European Conference on Computer Vision (ECCV), 2024
Pranav Gupta
Rishubh Singh
Pradeep Shenoy
Ravikiran Sarvadevabhatla
221
1
0
05 Nov 2024
Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition
Indian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2024
Sayanta Adhikari
Rishav Kumar
Konda Reddy Mopuri
Rajalakshmi Pachamuthu
240
0
0
05 Nov 2024
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge
Neural Information Processing Systems (NeurIPS), 2024
Weihua Du
Qiushi Lyu
Jiaming Shan
Zhenting Qi
Hongxin Zhang
...
Andi Peng
Tianmin Shu
Kwonjoon Lee
Behzad Dariush
Chuang Gan
477
9
0
04 Nov 2024
STAA: Spatio-Temporal Attention Attribution for Real-Time Interpreting Transformer-based Video Models
Zerui Wang
Yan Liu
313
6
0
01 Nov 2024
Recovering Complete Actions for Cross-dataset Skeleton Action Recognition
Neural Information Processing Systems (NeurIPS), 2024
Hanchao Liu
Yujiang Li
Tai-Jiang Mu
Shi-Min Hu
240
2
0
31 Oct 2024
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Adrian Iordache
B. Alexe
Radu Tudor Ionescu
306
2
0
29 Oct 2024
Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context
Computer Vision and Image Understanding (CVIU), 2024
Manuel Benavent-Lledo
David Mulero-Pérez
David Ortiz-Perez
José García Rodríguez
Antonis Argyros
319
3
0
28 Oct 2024
MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
Neural Information Processing Systems (NeurIPS), 2024
Xin Shen
Heming Du
Hongwei Sheng
Shuyun Wang
Hui Chen
...
Xiaobiao Du
Shuyun Wang
Ruihan Lu
Qingzheng Xu
Xin Yu
SLR
188
13
0
25 Oct 2024
Making Every Frame Matter: Continuous Activity Recognition in Streaming Video via Adaptive Video Context Modeling
Hao Wu
Donglin Bai
Shiqi Jiang
Qianxi Zhang
Yue Yang
Ting Cao
Fengyuan Xu
Yunxin Liu
Fengyuan Xu
588
0
0
19 Oct 2024
Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation
Visual Communications and Image Processing (VCIP), 2024
Kuan-Ying Lee
Qian Zhou
Klara Nahrstedt
265
1
0
17 Oct 2024
On-the-fly Modulation for Balanced Multimodal Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
240
28
0
15 Oct 2024
VidCompress: Memory-Enhanced Temporal Compression for Video Understanding in Large Language Models
Xiaohan Lan
Yitian Yuan
Zequn Jie
Lin Ma
VLM
181
4
0
15 Oct 2024
Movie Trailer Genre Classification Using Multimodal Pretrained Features
Expert systems with applications (ESWA), 2024
Serkan Sulun
Paula Viana
M. Davies
CLIP
214
8
0
11 Oct 2024
Fourier-based Action Recognition for Wildlife Behavior Quantification with Event Cameras
Advanced Intelligent Systems (AIS), 2024
Friedhelm Hamann
Suman Ghosh
Ignacio Juarez Martinez
Tom Hart
Alex Kacelnik
Guillermo Gallego
211
4
0
09 Oct 2024
Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for Action Detection
ACM Multimedia (MM), 2024
Zhe Luo
Weina Fu
Shuai Liu
Saeed Anwar
Muhammad Saqib
Sambit Bakshi
Khan Muhammad
251
4
0
08 Oct 2024
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Haibo Wang
Zhiyang Xu
Yu Cheng
Shizhe Diao
Jiuxiang Gu
Yixin Cao
Qifan Wang
Weifeng Ge
Lifu Huang
262
55
0
04 Oct 2024
Loose Social-Interaction Recognition in Real-world Therapy Scenarios
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Abid Ali
Rui Dai
Ashish Marisetty
Guillaume Astruc
Monique Thonnat
J. Odobez
Susanne Thümmler
Francois Bremond
277
1
0
30 Sep 2024
Previous
1
2
3
4
5
...
27
28
29
Next