Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.05038
Cited By
v1
v2 (latest)
Long-Term Feature Banks for Detailed Video Understanding
12 December 2018
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Long-Term Feature Banks for Detailed Video Understanding"
50 / 313 papers shown
Title
Towards Weakly Supervised End-to-end Learning for Long-video Action Recognition
Jiaming Zhou
Hanjun Li
Kun-Yu Lin
Junwei Liang
279
2
0
28 Nov 2023
Query by Activity Video in the Wild
International Conference on Information Photonics (ICIP), 2023
Tao Hu
William Thong
Pascal Mettes
Cees G. M. Snoek
196
0
0
23 Nov 2023
Event Causality Is Key to Computational Story Understanding
Yidan Sun
Qin Chao
Boyang Albert Li
294
11
0
16 Nov 2023
Beyond still images: Temporal features and input variance resilience
Scientific Reports (Sci Rep), 2023
AmirHosein Fadaei
M. Dehaqani
258
0
0
01 Nov 2023
Object-centric Video Representation for Long-term Action Anticipation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Ce Zhang
Changcheng Fu
Shijie Wang
Nakul Agarwal
Kwonjoon Lee
Chiho Choi
Chen Sun
251
29
0
31 Oct 2023
ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors
Neural Information Processing Systems (NeurIPS), 2023
Xiaoxuan Ma
Stephan P. Kaufhold
Jiajun Su
Wentao Zhu
Jack Terwilliger
Andres Meza
Yixin Zhu
Federico Rossano
Yizhou Wang
206
26
0
25 Oct 2023
Flow Dynamics Correction for Action Recognition
Lei Wang
Piotr Koniusz
213
15
0
16 Oct 2023
A Grammatical Compositional Model for Video Action Detection
Zhijun Zhang
Xu Zou
Jiahuan Zhou
Sheng Zhong
Ying Wu
220
0
0
04 Oct 2023
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
289
14
0
29 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
217
1
0
20 Sep 2023
JOADAA: joint online action detection and action anticipation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Mohammed Guermal
François Brémond
Rui Dai
Abid Ali
169
11
0
12 Sep 2023
Object-Centric Multiple Object Tracking
IEEE International Conference on Computer Vision (ICCV), 2023
Zixu Zhao
Jiaze Wang
Max Horn
Yizhuo Ding
Tong He
...
Bernt Schiele
Yanwei Fu
Francesco Locatello
Zheng Zhang
Tianjun Xiao
VOT
OCL
314
9
0
01 Sep 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
267
4
0
23 Aug 2023
Video BagNet: short temporal receptive fields increase robustness in long-term action recognition
Ombretta Strafforello
X. Liu
Klamer Schutte
Jan van Gemert
129
3
0
22 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
ACM Multimedia (ACM MM), 2023
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
247
10
0
09 Aug 2023
A Survey on Deep Learning-based Spatio-temporal Action Detection
Peng Wang
Fanwei Zeng
Yu Qian
216
8
0
03 Aug 2023
Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities
IEEE transactions on multimedia (IEEE TMM), 2023
Kaijian Liu
Weizhen He
Ziyue Li
Zhishuai Li
Mengwei He
Feng Zhu
Rui Zhao
3DH
130
3
0
01 Aug 2023
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Computer Vision and Pattern Recognition (CVPR), 2023
Enxin Song
Wenhao Chai
Guanhong Wang
Yucheng Zhang
Haoyang Zhou
...
Tianbo Ye
Yanting Zhang
Yang Lu
Lei Li
Gaoang Wang
VLM
MLLM
575
450
0
31 Jul 2023
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase Recognition
IEEE Transactions on Biomedical Engineering (IEEE Trans. Biomed. Eng.), 2023
Isabel Funke
Dominik Rivoir
Stefanie Krell
Stefanie Speidel
350
11
0
19 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
193
16
0
18 Jul 2023
Human-to-Human Interaction Detection
International Conference on Neural Information Processing (ICONIP), 2023
Zhenhua Wang
Kaining Ying
Jiajun Meng
J. Ning
305
3
0
02 Jul 2023
How can objects help action recognition?
Computer Vision and Pattern Recognition (CVPR), 2023
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
202
25
0
20 Jun 2023
Of Mice and Mates: Automated Classification and Modelling of Mouse Behaviour in Groups using a Single Model across Cages
International Journal of Computer Vision (IJCV), 2023
Michael P. J. Camilleri
R. Bains
Christopher K. I. Williams
152
3
0
05 Jun 2023
Metrics Matter in Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Speidel
139
16
0
23 May 2023
Modelling Spatio-Temporal Interactions for Compositional Action Recognition
Ramanathan Rajendiran
Debaditya Roy
Basura Fernando
212
1
0
04 May 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
235
20
0
24 Apr 2023
MRSN: Multi-Relation Support Network for Video Action Detection
IEEE International Conference on Multimedia and Expo (ICME), 2023
Yin-Dong Zheng
Guo Chen
Minglei Yuan
Tong Lu
248
9
0
24 Apr 2023
Efficient Video Action Detection with Token Dropout and Context Refinement
IEEE International Conference on Computer Vision (ICCV), 2023
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
289
25
0
17 Apr 2023
Verbs in Action: Improving verb understanding in video-language models
IEEE International Conference on Computer Vision (ICCV), 2023
Liliane Momeni
Mathilde Caron
Arsha Nagrani
Andrew Zisserman
Cordelia Schmid
349
87
0
13 Apr 2023
Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection
Wei-Jhe Huang
Jheng-Hsien Yeh
Min-Hung Chen
Gueter Josmy Faure
S. Lai
285
5
0
10 Apr 2023
Boundary-Denoising for Video Activity Localization
International Conference on Learning Representations (ICLR), 2023
Mengmeng Xu
Mattia Soldan
Jialin Gao
Shuming Liu
Juan-Manuel Perez-Rua
Guohao Li
172
15
0
06 Apr 2023
VicTR: Video-conditioned Text Representations for Activity Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Kumara Kahatapitiya
Anurag Arnab
Arsha Nagrani
Michael S. Ryoo
316
36
0
05 Apr 2023
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
Pichao Wang
Fan Wang
Jiashi Feng
Mike Zheng Show
182
6
0
01 Apr 2023
Streaming Video Model
Computer Vision and Pattern Recognition (CVPR), 2023
Yucheng Zhao
Chong Luo
Chuanxin Tang
DongDong Chen
Noel Codella
Zhengjun Zha
225
18
0
30 Mar 2023
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
175
3
0
28 Mar 2023
Open Set Action Recognition via Multi-Label Evidential Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Chen Zhao
Dawei Du
A. Hoogs
Christopher Funk
EDL
155
35
0
27 Feb 2023
YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for Real-time Spatio-temporal Action Detection
International Conference on Intelligent Robotics and Applications (ICIRA), 2023
Jianhua Yang
Kun Dai
ObjD
223
21
0
14 Feb 2023
Program Generation from Diverse Video Demonstrations
British Machine Vision Conference (BMVC), 2023
Anthony Manchin
Jamie Sherrah
Qi Wu
Anton Van Den Hengel
VGen
71
0
0
01 Feb 2023
Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature Refinement
Jiafan Zhuang
Zilei Wang
Junjie Li
VOS
229
3
0
10 Jan 2023
HierVL: Learning Hierarchical Video-Language Embeddings
Computer Vision and Pattern Recognition (CVPR), 2023
Kumar Ashutosh
Rohit Girdhar
Lorenzo Torresani
Kristen Grauman
VLM
AI4TS
398
69
0
05 Jan 2023
Deep set conditioned latent representations for action recognition
VISIGRAPP (VISIGRAPP), 2022
Akash Singh
Tom De Schepper
Kevin Mets
P. Hellinckx
José Oramas
Steven Latré
BDL
151
2
0
21 Dec 2022
A Survey on Human Action Recognition
Zhou Shuchang
194
0
0
20 Dec 2022
Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance
IEEE International Conference on Multimedia and Expo (ICME), 2022
Congqi Cao
Xin Zhang
Shizhou Zhang
Peng Wang
Yanning Zhang
109
8
0
16 Dec 2022
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries
IEEE International Conference on Computer Vision (ICCV), 2022
Jinjie Mai
Abdullah Hamdi
Silvio Giancola
Chen Zhao
Guohao Li
EgoV
246
21
0
14 Dec 2022
Ego Vehicle Speed Estimation using 3D Convolution with Masked Attention
Athul M. Mathew
Thariq Khalid
121
3
0
11 Dec 2022
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Chen Zhang
Guorong Li
Yuankai Qi
Shuhui Wang
Laiyun Qing
Qingming Huang
Ming-Hsuan Yang
196
90
0
08 Dec 2022
Spatio-Temporal Crop Aggregation for Video Representation Learning
IEEE International Conference on Computer Vision (ICCV), 2022
Sepehr Sameni
Simon Jenni
Paolo Favaro
267
4
0
30 Nov 2022
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Computer Vision and Pattern Recognition (CVPR), 2022
Chen Zhao
Shuming Liu
K. Mangalam
Guohao Li
266
27
0
25 Nov 2022
Multi-Task Learning of Object State Changes from Uncurated Videos
Tomávs Souvcek
Jean-Baptiste Alayrac
Antoine Miech
Ivan Laptev
Josef Sivic
189
13
0
24 Nov 2022
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions
Yong-Lu Li
Hongwei Fan
Zuoyu Qiu
Yiming Dou
Liang Xu
...
Peiyang Guo
Haisheng Su
Dongliang Wang
Wei Wu
Cewu Lu
169
7
0
14 Nov 2022
Previous
1
2
3
4
5
6
7
Next