ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.05038
  4. Cited By
Long-Term Feature Banks for Detailed Video Understanding

Long-Term Feature Banks for Detailed Video Understanding

12 December 2018
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
ArXivPDFHTML

Papers citing "Long-Term Feature Banks for Detailed Video Understanding"

50 / 306 papers shown
Title
Active Speakers in Context
Active Speakers in Context
Juan Carlos León Alcázar
Fabian Caba Heilbron
Long Mai
Federico Perazzi
Joon-Young Lee
Pablo Arbelaez
Bernard Ghanem
19
61
0
20 May 2020
Human in Events: A Large-Scale Benchmark for Human-centric Video
  Analysis in Complex Events
Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events
Weiyao Lin
Huabin Liu
Shizhan Liu
Yuxi Li
Rui Qian
Tao Wang
Ning Xu
H. Xiong
Guojun Qi
N. Sebe
20
14
0
09 May 2020
Condensed Movies: Story Based Retrieval with Contextual Embeddings
Condensed Movies: Story Based Retrieval with Contextual Embeddings
Max Bain
Arsha Nagrani
A. Brown
Andrew Zisserman
28
100
0
08 May 2020
Cross-media Structured Common Space for Multimedia Event Extraction
Cross-media Structured Common Space for Multimedia Event Extraction
Manling Li
Alireza Zareian
Qi Zeng
Spencer Whitehead
Di Lu
Heng Ji
Shih-Fu Chang
10
102
0
05 May 2020
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Antonino Furnari
G. Farinella
EgoV
10
139
0
04 May 2020
Asynchronous Interaction Aggregation for Action Detection
Asynchronous Interaction Aggregation for Action Detection
Jiajun Tang
Jinchao Xia
Xinzhi Mu
Bo Pang
Cewu Lu
20
119
0
16 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
66
998
0
09 Apr 2020
Knowing What, Where and When to Look: Efficient Video Action Modeling
  with Attention
Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Juan-Manuel Perez-Rua
Brais Martínez
Xiatian Zhu
Antoine Toisoul
Victor Escorcia
Tao Xiang
32
19
0
02 Apr 2020
PaStaNet: Toward Human Activity Knowledge Engine
PaStaNet: Toward Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Shiyi Wang
Haoshu Fang
Ze Ma
Mingyang Chen
Cewu Lu
12
151
0
02 Apr 2020
Long Short-Term Relation Networks for Video Action Detection
Long Short-Term Relation Networks for Video Action Detection
Dong Li
Ting Yao
Zhaofan Qiu
Houqiang Li
Tao Mei
12
22
0
31 Mar 2020
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
Boxiao Pan
Haoye Cai
De-An Huang
Kuan-Hui Lee
Adrien Gaidon
Ehsan Adeli
Juan Carlos Niebles
23
235
0
31 Mar 2020
RetinaTrack: Online Single Stage Joint Detection and Tracking
RetinaTrack: Online Single Stage Joint Detection and Tracking
Zhichao Lu
V. Rathod
Ronny Votel
Jonathan Huang
VOT
28
188
0
30 Mar 2020
Learning Interactions and Relationships between Movie Characters
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
36
51
0
29 Mar 2020
Memory Enhanced Global-Local Aggregation for Video Object Detection
Memory Enhanced Global-Local Aggregation for Video Object Detection
Yihong Chen
Yue Cao
Han Hu
Liwei Wang
112
261
0
26 Mar 2020
Temporal Extension Module for Skeleton-Based Action Recognition
Temporal Extension Module for Skeleton-Based Action Recognition
Yuya Obinata
Takuma Yamamoto
22
34
0
19 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range
  Activities
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
18
13
0
18 Mar 2020
Beyond the Camera: Neural Networks in World Coordinates
Beyond the Camera: Neural Networks in World Coordinates
Gunnar A. Sigurdsson
Abhinav Gupta
Cordelia Schmid
Alahari Karteek
6
2
0
12 Mar 2020
Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid
  Network
Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Jialin Gao
Zhixiang Shi
Jiani Li
Guanshuo Wang
Yufeng Yuan
Shiming Ge
Xiaoping Zhou
6
73
0
09 Mar 2020
Hierarchical Conditional Relation Networks for Video Question Answering
Hierarchical Conditional Relation Networks for Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
14
258
0
25 Feb 2020
MAST: A Memory-Augmented Self-supervised Tracker
MAST: A Memory-Augmented Self-supervised Tracker
Zihang Lai
Erika Lu
Weidi Xie
VOS
16
184
0
18 Feb 2020
Symbiotic Attention with Privileged Information for Egocentric Action
  Recognition
Symbiotic Attention with Privileged Information for Egocentric Action Recognition
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
22
63
0
08 Feb 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
194
205
0
23 Jan 2020
A Comprehensive Study on Temporal Modeling for Online Action Detection
A Comprehensive Study on Temporal Modeling for Online Action Detection
Wen Wang
Xiaojiang Peng
Yu Qiao
Jian Cheng
24
2
0
21 Jan 2020
Self-supervising Action Recognition by Statistical Moment and Subspace
  Descriptors
Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors
Lei Wang
Piotr Koniusz
24
50
0
14 Jan 2020
EGO-TOPO: Environment Affordances from Egocentric Video
EGO-TOPO: Environment Affordances from Egocentric Video
Tushar Nagarajan
Yanghao Li
Christoph Feichtenhofer
Kristen Grauman
EgoV
17
123
0
14 Jan 2020
Something-Else: Compositional Action Recognition with Spatial-Temporal
  Interaction Networks
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
16
173
0
20 Dec 2019
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs
Jingwei Ji
Ranjay Krishna
Li Fei-Fei
Juan Carlos Niebles
39
335
0
15 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action
  Recognition
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
Jinwoo Choi
Chen Gao
Joseph C.E. Messou
Jia-Bin Huang
16
177
0
11 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
27
251
0
10 Dec 2019
Video action detection by learning graph-based spatio-temporal
  interactions
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
16
9
0
09 Dec 2019
VideoDG: Generalizing Temporal Relations in Videos to Novel Domains
VideoDG: Generalizing Temporal Relations in Videos to Novel Domains
Zhiyu Yao
Yunbo Wang
Jianmin Wang
Philip S. Yu
Mingsheng Long
OOD
ViT
24
23
0
08 Dec 2019
Context R-CNN: Long Term Temporal Context for Per-Camera Object
  Detection
Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection
Sara Beery
Guanhang Wu
V. Rathod
Ronny Votel
Jonathan Huang
ObjD
14
112
0
07 Dec 2019
A Multigrid Method for Efficiently Training Video Models
A Multigrid Method for Efficiently Training Video Models
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
16
94
0
02 Dec 2019
Gate-Shift Networks for Video Action Recognition
Gate-Shift Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
3DPC
6
155
0
01 Dec 2019
Zero-Shot Imitating Collaborative Manipulation Plans from YouTube
  Cooking Videos
Zero-Shot Imitating Collaborative Manipulation Plans from YouTube Cooking Videos
Hejia Zhang
Jie Zhong
S. Nikolaidis
LM&Ro
106
1
0
25 Nov 2019
Multi-Label Classification with Label Graph Superimposing
Multi-Label Classification with Label Graph Superimposing
Ya Wang
Dongliang He
Fu Li
Xiang Long
Zhichao Zhou
Jinwen Ma
Shilei Wen
12
165
0
21 Nov 2019
You Only Watch Once: A Unified CNN Architecture for Real-Time
  Spatiotemporal Action Localization
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
20
143
0
15 Nov 2019
Combinational Class Activation Maps for Weakly Supervised Object
  Localization
Combinational Class Activation Maps for Weakly Supervised Object Localization
Seunghan Yang
Yoonhyung Kim
Youngeun Kim
Changick Kim
WSOL
6
77
0
12 Oct 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Chenxu Luo
Alan Yuille
122
150
0
28 Sep 2019
Multitask Learning to Improve Egocentric Action Recognition
Multitask Learning to Improve Egocentric Action Recognition
G. Kapidis
R. Poppe
E. V. Dam
L. Noldus
R. Veltkamp
EgoV
14
36
0
15 Sep 2019
Three Branches: Detecting Actions With Richer Features
Three Branches: Detecting Actions With Richer Features
Jinchao Xia
Jiajun Tang
Cewu Lu
14
8
0
13 Aug 2019
An Evaluation of Action Recognition Models on EPIC-Kitchens
An Evaluation of Action Recognition Models on EPIC-Kitchens
Will Price
Dima Damen
EgoV
6
13
0
02 Aug 2019
Learning to Adapt Invariance in Memory for Person Re-identification
Learning to Adapt Invariance in Memory for Person Re-identification
Zhun Zhong
Liang Zheng
Zhiming Luo
Shaozi Li
Yezhou Yang
OOD
4
176
0
01 Aug 2019
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action
  Localization
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization
Chunfei Ma
Joonhyang Choi
Byeongwon Lee
Seungji Yang
9
0
0
25 Jul 2019
Domain-Specific Priors and Meta Learning for Few-Shot First-Person
  Action Recognition
Domain-Specific Priors and Meta Learning for Few-Shot First-Person Action Recognition
Huseyin Coskun
Zeeshan Zia
Bugra Tekin
Federica Bogo
Nassir Navab
Federico Tombari
H. Sawhney
14
27
0
22 Jul 2019
Deformable Tube Network for Action Detection in Videos
Deformable Tube Network for Action Detection in Videos
Wei Li
Zehuan Yuan
Dashan Guo
Lei Huang
Xiangzhong Fang
Changhu Wang
ViT
MedIm
12
5
0
03 Jul 2019
Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge
  2019
Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
14
19
0
22 Jun 2019
Learning Video Representations using Contrastive Bidirectional
  Transformer
Learning Video Representations using Contrastive Bidirectional Transformer
Chen Sun
Fabien Baradel
Kevin Patrick Murphy
Cordelia Schmid
SSL
ViT
13
133
0
13 Jun 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video
  Architectures
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures
Michael S. Ryoo
A. Piergiovanni
Mingxing Tan
A. Angelova
14
102
0
30 May 2019
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
Noureldien Hussein
E. Gavves
A. Smeulders
12
77
0
13 May 2019
Previous
1234567
Next