Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09288
Cited By
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
19 April 2019
Xitong Yang
Xiaodong Yang
Ming-Yu Liu
Fanyi Xiao
L. Davis
Jan Kautz
Re-assign community
ArXiv
PDF
HTML
Papers citing
"STEP: Spatio-Temporal Progressive Learning for Video Action Detection"
21 / 21 papers shown
Title
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
60
0
0
17 Mar 2025
Semi-supervised Active Learning for Video Action Detection
Aayush Singh
A. J. Rana
Akash Kumar
Shruti Vyas
Y. S. Rawat
28
7
0
12 Dec 2023
A Survey on Deep Learning-based Spatio-temporal Action Detection
Peng Wang
Fanwei Zeng
Yu Qian
26
5
0
03 Aug 2023
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
Pichao Wang
Fan Wang
Jiashi Feng
Mike Zheng Show
13
4
0
01 Apr 2023
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Raghav Goyal
E. Mavroudi
Xitong Yang
Sainbayar Sukhbaatar
Leonid Sigal
Matt Feiszli
Lorenzo Torresani
Du Tran
12
7
0
16 Feb 2023
Context Understanding in Computer Vision: A Survey
Xuan Wang
Zhigang Zhu
13
45
0
10 Feb 2023
RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems
B. Mudassar
Sho Ko
Maojingjing Li
Priyabrata Saha
Saibal Mukhopadhyay
11
2
0
30 Apr 2022
E^2TAD: An Energy-Efficient Tracking-based Action Detector
Xin Hu
Zhenyu Wu
Haoyuan Miao
Siqi Fan
Taiyu Long
...
Pengcheng Pi
Yi Wu
Zhou Ren
Zhangyang Wang
G. Hua
19
2
0
09 Apr 2022
Point3D: tracking actions as moving points with 3D CNNs
Shentong Mo
Jingfei Xia
Xiaoqing Ellen Tan
Bhiksha Raj
3DPC
18
5
0
20 Mar 2022
Hierarchical Deep Residual Reasoning for Temporal Moment Localization
Ziyang Ma
Xianjing Han
Xuemeng Song
Yiran Cui
Liqiang Nie
13
9
0
31 Oct 2021
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Elahe Vahdani
Yingli Tian
38
60
0
30 Sep 2021
Towards Long-Form Video Understanding
Chaoxia Wu
Philipp Krahenbuhl
VLM
ViT
36
165
0
21 Jun 2021
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
Yixuan Li
Lei Chen
Runyu He
Zhenzhi Wang
Gangshan Wu
Limin Wang
19
97
0
16 May 2021
Self-Supervised Pillar Motion Learning for Autonomous Driving
Chenxu Luo
Xiaodong Yang
Alan Yuille
SSL
3DPC
20
66
0
18 Apr 2021
ROAD: The ROad event Awareness Dataset for Autonomous Driving
Gurkirt Singh
Stephen Akrigg
Manuele Di Maio
Valentina Fontana
Reza Javanmard Alitappeh
...
Salman Khan
S. Grazioso
Andrew Bradley
G. Gironimo
Fabio Cuzzolin
27
89
0
23 Feb 2021
Multi-shot Temporal Event Localization: a Benchmark
Xiaolong Liu
Yao Hu
S. Bai
Fei Ding
X. Bai
Philip H. S. Torr
36
81
0
17 Dec 2020
We don't Need Thousand Proposals
:
\colon
:
Single Shot Actor-Action Detection in Videos
A. J. Rana
Y. S. Rawat
ViT
13
11
0
22 Nov 2020
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
Junting Pan
Siyu Chen
Zheng Shou
Yu Liu
Jing Shao
Hongsheng Li
3DPC
15
150
0
14 Jun 2020
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
66
998
0
09 Apr 2020
Actions as Moving Points
Yixuan Li
Zixu Wang
Limin Wang
Gangshan Wu
14
104
0
14 Jan 2020
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
20
143
0
15 Nov 2019
1