ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.05038
  4. Cited By
Long-Term Feature Banks for Detailed Video Understanding
v1v2 (latest)

Long-Term Feature Banks for Detailed Video Understanding

12 December 2018
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
ArXiv (abs)PDFHTML

Papers citing "Long-Term Feature Banks for Detailed Video Understanding"

50 / 313 papers shown
Title
Towards Weakly Supervised End-to-end Learning for Long-video Action
  Recognition
Towards Weakly Supervised End-to-end Learning for Long-video Action Recognition
Jiaming Zhou
Hanjun Li
Kun-Yu Lin
Junwei Liang
279
2
0
28 Nov 2023
Query by Activity Video in the Wild
Query by Activity Video in the WildInternational Conference on Information Photonics (ICIP), 2023
Tao Hu
William Thong
Pascal Mettes
Cees G. M. Snoek
196
0
0
23 Nov 2023
Event Causality Is Key to Computational Story Understanding
Event Causality Is Key to Computational Story Understanding
Yidan Sun
Qin Chao
Boyang Albert Li
294
11
0
16 Nov 2023
Beyond still images: Temporal features and input variance resilience
Beyond still images: Temporal features and input variance resilienceScientific Reports (Sci Rep), 2023
AmirHosein Fadaei
M. Dehaqani
258
0
0
01 Nov 2023
Object-centric Video Representation for Long-term Action Anticipation
Object-centric Video Representation for Long-term Action AnticipationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Ce Zhang
Changcheng Fu
Shijie Wang
Nakul Agarwal
Kwonjoon Lee
Chiho Choi
Chen Sun
251
29
0
31 Oct 2023
ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors
ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee BehaviorsNeural Information Processing Systems (NeurIPS), 2023
Xiaoxuan Ma
Stephan P. Kaufhold
Jiajun Su
Wentao Zhu
Jack Terwilliger
Andres Meza
Yixin Zhu
Federico Rossano
Yizhou Wang
206
26
0
25 Oct 2023
Flow Dynamics Correction for Action Recognition
Flow Dynamics Correction for Action Recognition
Lei Wang
Piotr Koniusz
213
15
0
16 Oct 2023
A Grammatical Compositional Model for Video Action Detection
A Grammatical Compositional Model for Video Action Detection
Zhijun Zhang
Xu Zou
Jiahuan Zhou
Sheng Zhong
Ying Wu
220
0
0
04 Oct 2023
A Survey on Deep Learning Techniques for Action Anticipation
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
289
14
0
29 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
217
1
0
20 Sep 2023
JOADAA: joint online action detection and action anticipation
JOADAA: joint online action detection and action anticipationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Mohammed Guermal
François Brémond
Rui Dai
Abid Ali
169
11
0
12 Sep 2023
Object-Centric Multiple Object Tracking
Object-Centric Multiple Object TrackingIEEE International Conference on Computer Vision (ICCV), 2023
Zixu Zhao
Jiaze Wang
Max Horn
Yizhuo Ding
Tong He
...
Bernt Schiele
Yanwei Fu
Francesco Locatello
Zheng Zhang
Tianjun Xiao
VOTOCL
314
9
0
01 Sep 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
267
4
0
23 Aug 2023
Video BagNet: short temporal receptive fields increase robustness in
  long-term action recognition
Video BagNet: short temporal receptive fields increase robustness in long-term action recognition
Ombretta Strafforello
X. Liu
Klamer Schutte
Jan van Gemert
129
3
0
22 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
View while Moving: Efficient Video Recognition in Long-untrimmed VideosACM Multimedia (ACM MM), 2023
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
247
10
0
09 Aug 2023
A Survey on Deep Learning-based Spatio-temporal Action Detection
A Survey on Deep Learning-based Spatio-temporal Action Detection
Peng Wang
Fanwei Zeng
Yu Qian
216
8
0
03 Aug 2023
Relation-Aware Distribution Representation Network for Person Clustering
  with Multiple Modalities
Relation-Aware Distribution Representation Network for Person Clustering with Multiple ModalitiesIEEE transactions on multimedia (IEEE TMM), 2023
Kaijian Liu
Weizhen He
Ziyue Li
Zhishuai Li
Mengwei He
Feng Zhu
Rui Zhao
3DH
130
3
0
01 Aug 2023
MovieChat: From Dense Token to Sparse Memory for Long Video
  Understanding
MovieChat: From Dense Token to Sparse Memory for Long Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Enxin Song
Wenhao Chai
Guanhong Wang
Yucheng Zhang
Haoyang Zhou
...
Tianbo Ye
Yanting Zhang
Yang Lu
Lei Li
Gaoang Wang
VLMMLLM
575
450
0
31 Jul 2023
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical
  Phase Recognition
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase RecognitionIEEE Transactions on Biomedical Engineering (IEEE Trans. Biomed. Eng.), 2023
Isabel Funke
Dominik Rivoir
Stefanie Krell
Stefanie Speidel
350
11
0
19 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
193
16
0
18 Jul 2023
Human-to-Human Interaction Detection
Human-to-Human Interaction DetectionInternational Conference on Neural Information Processing (ICONIP), 2023
Zhenhua Wang
Kaining Ying
Jiajun Meng
J. Ning
305
3
0
02 Jul 2023
How can objects help action recognition?
How can objects help action recognition?Computer Vision and Pattern Recognition (CVPR), 2023
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
202
25
0
20 Jun 2023
Of Mice and Mates: Automated Classification and Modelling of Mouse
  Behaviour in Groups using a Single Model across Cages
Of Mice and Mates: Automated Classification and Modelling of Mouse Behaviour in Groups using a Single Model across CagesInternational Journal of Computer Vision (IJCV), 2023
Michael P. J. Camilleri
R. Bains
Christopher K. I. Williams
152
3
0
05 Jun 2023
Metrics Matter in Surgical Phase Recognition
Metrics Matter in Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Speidel
139
16
0
23 May 2023
Modelling Spatio-Temporal Interactions for Compositional Action
  Recognition
Modelling Spatio-Temporal Interactions for Compositional Action Recognition
Ramanathan Rajendiran
Debaditya Roy
Basura Fernando
212
1
0
04 May 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
End-to-End Spatio-Temporal Action Localisation with Video TransformersComputer Vision and Pattern Recognition (CVPR), 2023
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
235
20
0
24 Apr 2023
MRSN: Multi-Relation Support Network for Video Action Detection
MRSN: Multi-Relation Support Network for Video Action DetectionIEEE International Conference on Multimedia and Expo (ICME), 2023
Yin-Dong Zheng
Guo Chen
Minglei Yuan
Tong Lu
248
9
0
24 Apr 2023
Efficient Video Action Detection with Token Dropout and Context
  Refinement
Efficient Video Action Detection with Token Dropout and Context RefinementIEEE International Conference on Computer Vision (ICCV), 2023
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
289
25
0
17 Apr 2023
Verbs in Action: Improving verb understanding in video-language models
Verbs in Action: Improving verb understanding in video-language modelsIEEE International Conference on Computer Vision (ICCV), 2023
Liliane Momeni
Mathilde Caron
Arsha Nagrani
Andrew Zisserman
Cordelia Schmid
349
87
0
13 Apr 2023
Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action
  Detection
Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection
Wei-Jhe Huang
Jheng-Hsien Yeh
Min-Hung Chen
Gueter Josmy Faure
S. Lai
285
5
0
10 Apr 2023
Boundary-Denoising for Video Activity Localization
Boundary-Denoising for Video Activity LocalizationInternational Conference on Learning Representations (ICLR), 2023
Mengmeng Xu
Mattia Soldan
Jialin Gao
Shuming Liu
Juan-Manuel Perez-Rua
Guohao Li
172
15
0
06 Apr 2023
VicTR: Video-conditioned Text Representations for Activity Recognition
VicTR: Video-conditioned Text Representations for Activity RecognitionComputer Vision and Pattern Recognition (CVPR), 2023
Kumara Kahatapitiya
Anurag Arnab
Arsha Nagrani
Michael S. Ryoo
316
36
0
05 Apr 2023
DOAD: Decoupled One Stage Action Detection Network
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
Pichao Wang
Fan Wang
Jiashi Feng
Mike Zheng Show
182
6
0
01 Apr 2023
Streaming Video Model
Streaming Video ModelComputer Vision and Pattern Recognition (CVPR), 2023
Yucheng Zhao
Chong Luo
Chuanxin Tang
DongDong Chen
Noel Codella
Zhengjun Zha
225
18
0
30 Mar 2023
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action
  Detection
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
175
3
0
28 Mar 2023
Open Set Action Recognition via Multi-Label Evidential Learning
Open Set Action Recognition via Multi-Label Evidential LearningComputer Vision and Pattern Recognition (CVPR), 2023
Chen Zhao
Dawei Du
A. Hoogs
Christopher Funk
EDL
155
35
0
27 Feb 2023
YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for
  Real-time Spatio-temporal Action Detection
YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for Real-time Spatio-temporal Action DetectionInternational Conference on Intelligent Robotics and Applications (ICIRA), 2023
Jianhua Yang
Kun Dai
ObjD
223
21
0
14 Feb 2023
Program Generation from Diverse Video Demonstrations
Program Generation from Diverse Video DemonstrationsBritish Machine Vision Conference (BMVC), 2023
Anthony Manchin
Jamie Sherrah
Qi Wu
Anton Van Den Hengel
VGen
71
0
0
01 Feb 2023
Video Semantic Segmentation with Inter-Frame Feature Fusion and
  Inner-Frame Feature Refinement
Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature Refinement
Jiafan Zhuang
Zilei Wang
Junjie Li
VOS
229
3
0
10 Jan 2023
HierVL: Learning Hierarchical Video-Language Embeddings
HierVL: Learning Hierarchical Video-Language EmbeddingsComputer Vision and Pattern Recognition (CVPR), 2023
Kumar Ashutosh
Rohit Girdhar
Lorenzo Torresani
Kristen Grauman
VLMAI4TS
398
69
0
05 Jan 2023
Deep set conditioned latent representations for action recognition
Deep set conditioned latent representations for action recognitionVISIGRAPP (VISIGRAPP), 2022
Akash Singh
Tom De Schepper
Kevin Mets
P. Hellinckx
José Oramas
Steven Latré
BDL
151
2
0
21 Dec 2022
A Survey on Human Action Recognition
A Survey on Human Action Recognition
Zhou Shuchang
194
0
0
20 Dec 2022
Weakly Supervised Video Anomaly Detection Based on Cross-Batch
  Clustering Guidance
Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering GuidanceIEEE International Conference on Multimedia and Expo (ICME), 2022
Congqi Cao
Xin Zhang
Shizhou Zhang
Peng Wang
Yanning Zhang
109
8
0
16 Dec 2022
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with
  Visual Queries
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual QueriesIEEE International Conference on Computer Vision (ICCV), 2022
Jinjie Mai
Abdullah Hamdi
Silvio Giancola
Chen Zhao
Guohao Li
EgoV
246
21
0
14 Dec 2022
Ego Vehicle Speed Estimation using 3D Convolution with Masked Attention
Ego Vehicle Speed Estimation using 3D Convolution with Masked Attention
Athul M. Mathew
Thariq Khalid
121
3
0
11 Dec 2022
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly
  Supervised Video Anomaly Detection
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Chen Zhang
Guorong Li
Yuankai Qi
Shuhui Wang
Laiyun Qing
Qingming Huang
Ming-Hsuan Yang
196
90
0
08 Dec 2022
Spatio-Temporal Crop Aggregation for Video Representation Learning
Spatio-Temporal Crop Aggregation for Video Representation LearningIEEE International Conference on Computer Vision (ICCV), 2022
Sepehr Sameni
Simon Jenni
Paolo Favaro
267
4
0
30 Nov 2022
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal
  Action Localization
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action LocalizationComputer Vision and Pattern Recognition (CVPR), 2022
Chen Zhao
Shuming Liu
K. Mangalam
Guohao Li
266
27
0
25 Nov 2022
Multi-Task Learning of Object State Changes from Uncurated Videos
Multi-Task Learning of Object State Changes from Uncurated Videos
Tomávs Souvcek
Jean-Baptiste Alayrac
Antoine Miech
Ivan Laptev
Josef Sivic
189
13
0
24 Nov 2022
Discovering A Variety of Objects in Spatio-Temporal Human-Object
  Interactions
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions
Yong-Lu Li
Hongwei Fan
Zuoyu Qiu
Yiming Dou
Liang Xu
...
Peiyang Guo
Haisheng Su
Dongliang Wang
Wei Wu
Cewu Lu
169
7
0
14 Nov 2022
Previous
1234567
Next