Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1608.00859
Cited By
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
2 August 2016
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Temporal Segment Networks: Towards Good Practices for Deep Action Recognition"
50 / 1,449 papers shown
Title
EgoLM: Multi-Modal Language Model of Egocentric Motions
Computer Vision and Pattern Recognition (CVPR), 2024
Fangzhou Hong
Vladimir Guzov
Hyo Jin Kim
Yuting Ye
Richard Newcombe
Ziwei Liu
Lingni Ma
166
12
0
26 Sep 2024
Deep Learning for Video Anomaly Detection: A Review
Peng Wu
Chengyu Pan
Yuting Yan
Guansong Pang
Peng Wang
Yanning Zhang
VLM
AI4TS
192
30
0
09 Sep 2024
Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment
Keyne Oei
Amr Gomaa
Anna Maria Feit
João Belo
302
1
0
06 Sep 2024
GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Jun Li
Jinying Wu
Qiming Li
Feifei Guo
271
1
0
31 Aug 2024
Joint Temporal Pooling for Improving Skeleton-based Action Recognition
International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2023
Shanaka Ramesh Gunasekara
Wanqing Li
Jack Yang
P. Ogunbona
179
3
0
18 Aug 2024
Flatten: Video Action Recognition is an Image Classification task
Junlin Chen
Chengcheng Xu
Yangfan Xu
Zhiqiang Wang
Jun Yu Li
Zhiping Shi
224
2
0
17 Aug 2024
Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach
European Conference on Computer Vision (ECCV), 2024
Shizhou Zhang
Wenlong Luo
De Cheng
Qingchun Yang
Lingyan Ran
Yinghui Xing
Yanning Zhang
VOS
197
17
0
14 Aug 2024
Dynamic and Compressive Adaptation of Transformers From Images to Videos
Guozhen Zhang
Jingyu Liu
Shengming Cao
Xiaotong Zhao
Kevin Zhao
Kai Ma
Limin Wang
ViT
438
2
0
13 Aug 2024
HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization
European Conference on Computer Vision (ECCV), 2024
Sakib Reza
Yuexi Zhang
Mohsen Moghaddam
Mario Sznaier
207
5
0
12 Aug 2024
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts
ACM Multimedia (MM), 2024
Peng Wu
Xuerong Zhou
Guansong Pang
Zhiwei Yang
Qingsen Yan
Peng Wang
Yanning Zhang
394
39
0
12 Aug 2024
FADE: A Dataset for Detecting Falling Objects around Buildings in Video
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Zhigang Tu
Zhengbo Zhang
Zitao Gao
Chunluan Zhou
J. Yuan
Bo Du
411
1
0
11 Aug 2024
A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
IEEE Access (IEEE Access), 2024
Jungpil Shin
Abu Saleh Musa Miah
Md. Humaun Kabir
M. Rahim
Abdullah Al Shiam
234
40
0
10 Aug 2024
MU-MAE: Multimodal Masked Autoencoders-Based One-Shot Learning
Conference on Multimedia Information Processing and Retrieval (MIPR), 2024
Rex Liu
Xin Liu
247
1
0
08 Aug 2024
Online Temporal Action Localization with Memory-Augmented Transformer
European Conference on Computer Vision (ECCV), 2024
Youngkil Song
Dongkeun Kim
Minsu Cho
Suha Kwak
214
3
0
06 Aug 2024
RICA2: Rubric-Informed, Calibrated Assessment of Actions
European Conference on Computer Vision (ECCV), 2024
Abrar Majeedi
Viswanatha Reddy Gajjala
Satya Sai Srinath Namburi Gnvv
Yin Li
CML
424
11
0
04 Aug 2024
Text-Guided Video Masked Autoencoder
European Conference on Computer Vision (ECCV), 2024
D. Fan
Jue Wang
Shuai Liao
Zhikang Zhang
Vimal Bhat
Xinyu Li
VGen
149
7
0
01 Aug 2024
Hyper-parameter tuning for text guided image editing
Shiwen Zhang
DiffM
216
3
0
31 Jul 2024
Start from Video-Music Retrieval: An Inter-Intra Modal Loss for Cross Modal Retrieval
Zeyu Chen
Pengfei Zhang
Kai Ye
Wei Dong
Xin Feng
Yana Zhang
199
1
0
28 Jul 2024
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Habib Hajimolahoseini
Walid Ahmed
Austin Wen
Yang Liu
214
0
0
23 Jul 2024
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
ACM Multimedia (MM), 2024
Wenbo Huang
Jinghui Zhang
Xuwei Qian
Zhen Wu
Meng Wang
Lei Zhang
249
7
0
23 Jul 2024
A Comprehensive Review of Few-shot Action Recognition
Yuyang Wanyan
Xiaoshan Yang
Weiming Dong
Changsheng Xu
VLM
498
13
0
20 Jul 2024
Pose-guided multi-task video transformer for driver action recognition
Ricardo Pizarro
Roberto Valle
L. Bergasa
J. M. Buenaposada
Luis Baumela
ViT
182
0
0
18 Jul 2024
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Hyolim Kang
Jeongseok Hyun
Joungbin An
Youngjae Yu
Seon Joo Kim
148
1
0
17 Jul 2024
Human-Centric Transformer for Domain Adaptive Action Recognition
Kun-Yu Lin
Jiaming Zhou
Wei-Shi Zheng
217
9
0
15 Jul 2024
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding
Minghui Wu
Chenxu Zhao
Anyang Su
Donglin Di
Tianyu Fu
...
Min He
Ya Gao
Meng Ma
Kun Yan
Ping Wang
265
6
0
11 Jul 2024
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization
Feixiang Zhou
Bryan M. Williams
Hossein Rahmani
193
3
0
10 Jul 2024
C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition
Rongchang Li
Zhenhua Feng
Tianyang Xu
Linze Li
Xiao-Jun Wu
Muhammad Awais
Sara Atito
Josef Kittler
CoGe
356
11
0
08 Jul 2024
DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
Fei-Yu Guo
YiKang Wang
Han Qi
Li Zhu
Jing Sun
362
5
0
08 Jul 2024
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Jianwen Jiang
Gaojie Lin
Zhengkun Rong
Chao Liang
Yongming Zhu
Jiaqi Yang
Tianyun Zhong
3DH
373
13
0
08 Jul 2024
Computer Vision for Clinical Gait Analysis: A Gait Abnormality Video Dataset
Rahm Ranjan
David Ahmedt-Aristizabal
M. Armin
Juno Kim
231
13
0
05 Jul 2024
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Le Yang
Ziwei Zheng
Yizeng Han
Hao-Ran Cheng
Shiji Song
Gao Huang
Fan Li
288
21
0
03 Jul 2024
SVFormer: A Direct Training Spiking Transformer for Efficient Video Action Recognition
Liutao Yu
Liwei Huang
Chenlin Zhou
Han Zhang
Zhengyu Ma
Huihui Zhou
Yonghong Tian
ViT
215
7
0
21 Jun 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Yuanhao Zhai
Kevin Lin
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Chung-Ching Lin
David Doermann
Junsong Yuan
Lijuan Wang
VGen
DiffM
224
25
0
11 Jun 2024
SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks
Yi Pan
Jun-Jie Huang
Zihan Chen
Wentao Zhao
Ziyue Wang
182
4
0
04 Jun 2024
Object Aware Egocentric Online Action Detection
Joungbin An
Yunsu Park
Hyolim Kang
Seon Joo Kim
EgoV
196
1
0
03 Jun 2024
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
Khaled Alomar
Halil Ibrahim Aysel
Xiaohao Cai
MedIm
ViT
288
24
0
02 Jun 2024
Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition
Muhammad Adi Nugroho
Sangmin Woo
Sumin Lee
Jinyoung Park
Yooseung Wang
Donguk Kim
Changick Kim
159
3
0
28 May 2024
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities
Hao Dong
Yue Zhao
Eleni Chatzi
Olga Fink
OODD
194
25
0
27 May 2024
Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception
Shuangpeng Han
Ziyu Wang
Mengmi Zhang
261
1
0
26 May 2024
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
Muhammad Bilal Shaikh
Syed Mohammed Shamsul Islam
Douglas Chai
Naveed Akhtar
310
29
0
22 May 2024
OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Chao Fan
Saihui Hou
Junhao Liang
Chuanfu Shen
Jingzhe Ma
Dongyang Jin
Yongzhen Huang
Shiqi Yu
297
14
0
15 May 2024
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding
Yingjie Zhai
Wenshuo Li
Yehui Tang
Xinghao Chen
Yunhe Wang
ViT
211
2
0
14 May 2024
Deep video representation learning: a survey
Elham Ravanbakhsh
Yongqing Liang
J. Ramanujam
Xin Li
183
5
0
10 May 2024
A Survey on Backbones for Deep Video Action Recognition
Zixuan Tang
Youjun Zhao
Yuhang Wen
Mengyuan Liu
152
3
0
09 May 2024
Bidirectional Progressive Transformer for Interaction Intention Anticipation
European Conference on Computer Vision (ECCV), 2024
Zichen Zhang
Hongcheng Luo
Wei Zhai
Yang Cao
Yu Kang
308
8
0
09 May 2024
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
IEEE transactions on multimedia (IEEE TMM), 2024
Hongyu Qu
Rui Yan
Xiangbo Shu
Haoliang Gao
Peng Huang
Guo-Sen Xie
398
15
0
03 May 2024
Uncertainty-boosted Robust Video Activity Anticipation
Zhaobo Qi
Shuhui Wang
Weigang Zhang
Qingming Huang
280
10
0
29 Apr 2024
Movie101v2: Improved Movie Narration Benchmark
Zihao Yue
Yepeng Zhang
Ziheng Wang
Qin Jin
VGen
277
3
0
20 Apr 2024
STAT: Towards Generalizable Temporal Action Localization
Yangcen Liu
Ziyi Liu
Yuanhao Zhai
Wen Li
David Doerman
Junsong Yuan
225
3
0
20 Apr 2024
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
224
32
0
18 Apr 2024
Previous
1
2
3
4
5
6
...
27
28
29
Next