Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.10075
Cited By
Is an Object-Centric Video Representation Beneficial for Transfer?
20 July 2022
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Is an Object-Centric Video Representation Beneficial for Transfer?"
25 / 25 papers shown
Title
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Dibyadip Chatterjee
Edoardo Remelli
Yale Song
Bugra Tekin
Abhay Mittal
...
Shreyas Hampali
Eric Sauser
Shugao Ma
Angela Yao
Fadime Sener
VLM
30
0
0
10 Apr 2025
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
Angel Villar-Corrales
Sven Behnke
67
2
0
11 Feb 2025
SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Ruiqi Xian
Xiyang Wu
Tianrui Guan
Xijun Wang
Boqing Gong
Dinesh Manocha
ViT
19
0
0
26 Sep 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
36
1
0
09 Jul 2024
Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition
Xunsong Li
Pengzhan Sun
Yangcen Liu
Lixin Duan
Wen Li
27
1
0
18 Apr 2024
Reasoning-Enhanced Object-Centric Learning for Videos
Jian Li
Pu Ren
Yang Liu
Hao-Lun Sun
OCL
LRM
28
2
0
22 Mar 2024
GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition
Guangzhao Dai
Xiangbo Shu
Wenhao Wu
Rui Yan
Jiachao Zhang
VLM
11
5
0
18 Jan 2024
TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding
Yun-Hai Liu
Haolin Yang
Xu Si
Ling Liu
Zipeng Li
Yuxiang Zhang
Yebin Liu
Li Yi
52
22
0
16 Jan 2024
ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room
Idris Hamoud
Muhammad Abdullah Jamal
V. Srivastav
Didier Mutter
N. Padoy
Omid Mohareri
11
2
0
19 Dec 2023
Encoding Surgical Videos as Latent Spatiotemporal Graphs for Object and Anatomy-Driven Reasoning
Aditya Murali
Deepak Alapatt
Pietro Mascagni
Armine Vardazaryan
Alain Garcia
Nariaki Okamoto
Didier Mutter
N. Padoy
MedIm
20
4
0
11 Dec 2023
Object-centric Video Representation for Long-term Action Anticipation
Ce Zhang
Changcheng Fu
Shijie Wang
Nakul Agarwal
Kwonjoon Lee
Chiho Choi
Chen Sun
10
14
0
31 Oct 2023
RobustCLEVR: A Benchmark and Framework for Evaluating Robustness in Object-centric Learning
Nathan G. Drenkow
Mathias Unberath
16
3
0
28 Aug 2023
MOFO: MOtion FOcused Self-Supervision for Video Understanding
Mona Ahmadian
Frank Guerin
Andrew Gilbert
10
2
0
23 Aug 2023
Opening the Vocabulary of Egocentric Actions
Dibyadip Chatterjee
Fadime Sener
Shugao Ma
Angela Yao
VLM
14
6
0
22 Aug 2023
Helping Hands: An Object-Aware Ego-Centric Video Recognition Model
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
VLM
10
19
0
15 Aug 2023
Does Visual Pretraining Help End-to-End Reasoning?
Chen Sun
Calvin Luo
Xingyi Zhou
Anurag Arnab
Cordelia Schmid
OCL
LRM
ViT
20
3
0
17 Jul 2023
Multimodal Distillation for Egocentric Action Recognition
Gorjan Radevski
Dusan Grujicic
Marie-Francine Moens
Matthew Blaschko
Tinne Tuytelaars
EgoV
13
22
0
14 Jul 2023
How can objects help action recognition?
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
19
14
0
20 Jun 2023
Modelling Spatio-Temporal Interactions for Compositional Action Recognition
Ramanathan Rajendiran
Debaditya Roy
Basura Fernando
29
1
0
04 May 2023
Interaction Region Visual Transformer for Egocentric Action Anticipation
Debaditya Roy
Ramanathan Rajendiran
Basura Fernando
28
15
0
25 Nov 2022
Students taught by multimodal teachers are superior action recognizers
Gorjan Radevski
Dusan Grujicic
Matthew Blaschko
Marie-Francine Moens
Tinne Tuytelaars
11
1
0
09 Oct 2022
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Pei Sun
Yi-Xin Jiang
Dongdong Yu
Fucheng Weng
Zehuan Yuan
Ping Luo
Wenyu Liu
Xinggang Wang
VOT
94
1,289
0
13 Oct 2021
Graph-Based Global Reasoning Networks
Yunpeng Chen
Marcus Rohrbach
Zhicheng Yan
Shuicheng Yan
Jiashi Feng
Yannis Kalantidis
GNN
NAI
244
453
0
30 Nov 2018
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
208
809
0
04 Apr 2018
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
252
1,394
0
01 Dec 2016
1