Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2403.15245
Cited By
v1
v2 (latest)
Reasoning-Enhanced Object-Centric Learning for Videos
Knowledge Discovery and Data Mining (KDD), 2024
22 March 2024
Jian Li
Pu Ren
Yang Liu
Hao Sun
OCL
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reasoning-Enhanced Object-Centric Learning for Videos"
48 / 48 papers shown
Title
Smoothing Slot Attention Iterations and Recurrences
Rongzhen Zhao
Wenyan Yang
Juho Kannala
Joni Pajarinen
AI4TS
113
1
0
07 Aug 2025
Predicting Video Slot Attention Queries from Random Slot-Feature Pairs
Rongzhen Zhao
Jian Li
Juho Kannala
Joni Pajarinen
185
1
0
02 Aug 2025
Slot-VAE: Object-Centric Scene Generation with Slot Attention
International Conference on Machine Learning (ICML), 2023
Yanbo Wang
Letao Liu
Justin Dauwels
OCL
BDL
DRL
213
22
0
12 Jun 2023
Intrinsic Physical Concepts Discovery with Object-Centric Predictive Models
Computer Vision and Pattern Recognition (CVPR), 2023
Qu Tang
Xiangyu Zhu
Zhen Lei
Zhaoxiang Zhang
OCL
209
8
0
03 Mar 2023
Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions
International Conference on Information Photonics (ICIP), 2023
Angel Villar-Corrales
Ismail Wahdan
Sven Behnke
OCL
286
13
0
23 Feb 2023
SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric Models
International Conference on Learning Representations (ICLR), 2022
Ziyi Wu
Nikita Dvornik
Klaus Greff
Thomas Kipf
Animesh Garg
OCL
BDL
298
112
0
12 Oct 2022
Is an Object-Centric Video Representation Beneficial for Transfer?
Asian Conference on Computer Vision (ACCV), 2022
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
ViT
274
30
0
20 Jul 2022
Segmenting Moving Objects via an Object-Centric Layered Representation
Neural Information Processing Systems (NeurIPS), 2022
Jun Xie
Weidi Xie
Andrew Zisserman
VOS
OCL
219
61
0
05 Jul 2022
SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Neural Information Processing Systems (NeurIPS), 2022
Gamaleldin F. Elsayed
Aravindh Mahendran
Sjoerd van Steenkiste
Klaus Greff
Michael C. Mozer
Thomas Kipf
VOS
OCL
313
167
0
15 Jun 2022
Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos
Neural Information Processing Systems (NeurIPS), 2022
Gautam Singh
Yi-Fu Wu
Sungjin Ahn
OCL
347
145
0
27 May 2022
Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
IEEE International Conference on Robotics and Automation (ICRA), 2022
Negin Heravi
Ayzaan Wahid
Corey Lynch
Peter R. Florence
Travis Armstrong
Jonathan Tompson
P. Sermanet
Jeannette Bohg
Debidatta Dwibedi
SSL
203
21
0
12 May 2022
Visual Attention Methods in Deep Learning: An In-Depth Survey
Information Fusion (Inf. Fusion), 2022
Mohammed Hassanin
Saeed Anwar
Ibrahim Radwan
Fahad Shahbaz Khan
Lin Wang
253
229
0
16 Apr 2022
When Physics Meets Machine Learning: A Survey of Physics-Informed Machine Learning
Chuizheng Meng
Sungyong Seo
Defu Cao
Sam Griesemer
Yan Liu
PINN
AI4CE
250
106
0
31 Mar 2022
Kubric: A scalable dataset generator
Computer Vision and Pattern Recognition (CVPR), 2022
Klaus Greff
Francois Belletti
Lucas Beyer
Carl Doersch
Yilun Du
...
Ziyu Wang
Tianhao Wu
K. M. Yi
Fangcheng Zhong
Andrea Tagliasacchi
220
342
0
07 Mar 2022
Learning Multi-Object Dynamics with Compositional Neural Radiance Fields
Conference on Robot Learning (CoRL), 2022
Danny Driess
Zhiao Huang
Yunzhu Li
Russ Tedrake
Marc Toussaint
OCL
AI4CE
354
90
0
24 Feb 2022
Conditional Object-Centric Learning from Video
Thomas Kipf
Gamaleldin F. Elsayed
Aravindh Mahendran
Austin Stone
S. Sabour
G. Heigold
Rico Jonschkowski
Alexey Dosovitskiy
Klaus Greff
OCL
303
252
0
24 Nov 2021
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
Neural Information Processing Systems (NeurIPS), 2021
Mingyu Ding
Zhenfang Chen
Tao Du
Ping Luo
J. Tenenbaum
Chuang Gan
VGen
PINN
OCL
186
79
0
28 Oct 2021
Illiterate DALL-E Learns to Compose
Gautam Singh
Fei Deng
Sungjin Ahn
CoGe
OCL
302
161
0
17 Oct 2021
Generalization and Robustness Implications in Object-Centric Learning
Andrea Dittadi
Samuele Papa
Michele De Vita
Bernhard Schölkopf
Ole Winther
Francesco Locatello
OCL
OOD
250
80
0
01 Jul 2021
SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video Decomposition
Neural Information Processing Systems (NeurIPS), 2021
Rishabh Kabra
Daniel Zoran
Goker Erdogan
Loic Matthey
Antonia Creswell
M. Botvinick
Alexander Lerchner
Christopher P. Burgess
OCL
252
85
0
07 Jun 2021
Self-supervised Video Object Segmentation by Motion Grouping
IEEE International Conference on Computer Vision (ICCV), 2021
Charig Yang
Hala Lamdouar
Erika Lu
Andrew Zisserman
Weidi Xie
VOS
OCL
186
180
0
15 Apr 2021
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
International Conference on Learning Representations (ICLR), 2021
Zhenfang Chen
Jiayuan Mao
Jiajun Wu
Kwan-Yee K. Wong
J. Tenenbaum
Chuang Gan
VGen
192
100
0
30 Mar 2021
Is Space-Time Attention All You Need for Video Understanding?
International Conference on Machine Learning (ICML), 2021
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
874
2,552
0
09 Feb 2021
STAN: Spatio-Temporal Attention Network for Next Location Recommendation
The Web Conference (WWW), 2021
Yingtao Luo
Qiang Liu
Zhaocheng Liu
HAI
AI4TS
160
371
0
08 Feb 2021
Slot Self-Attentive Dialogue State Tracking
The Web Conference (WWW), 2021
Fanghua Ye
Jarana Manotumruksa
Qiang Zhang
Shenghui Li
Emine Yilmaz
278
65
0
22 Jan 2021
Improving Generative Imagination in Object-Centric World Models
International Conference on Machine Learning (ICML), 2020
Zhixuan Lin
Yi-Fu Wu
Skand Peri
Bofeng Fu
Jindong Jiang
Sungjin Ahn
OCL
191
90
0
05 Oct 2020
Unsupervised object-centric video generation and decomposition in 3D
Neural Information Processing Systems (NeurIPS), 2020
Paul Henderson
Christoph H. Lampert
OCL
231
37
0
07 Jul 2020
Object-Centric Learning with Slot Attention
Francesco Locatello
Dirk Weissenborn
Thomas Unterthiner
Aravindh Mahendran
G. Heigold
Jakob Uszkoreit
Alexey Dosovitskiy
Thomas Kipf
OCL
467
988
0
26 Jun 2020
Learning Physical Graph Representations from Visual Scenes
Daniel M. Bear
Chaofei Fan
Damian Mrowca
Yunzhu Li
S. Alter
...
Jeremy Schwartz
Li Fei-Fei
Jiajun Wu
J. Tenenbaum
Daniel L. K. Yamins
SSL
GNN
SSeg
AI4CE
208
85
0
22 Jun 2020
End-to-End Slot Alignment and Recognition for Cross-Lingual NLU
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Weijia Xu
Batool Haider
Saab Mansour
204
162
0
29 Apr 2020
MA-DST: Multi-Attention Based Scalable Dialog State Tracking
AAAI Conference on Artificial Intelligence (AAAI), 2020
Adarsh Kumar
Peter Ku
Anuj Kumar Goyal
A. Metallinou
Dilek Z. Hakkani-Tür
148
61
0
07 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Neural Information Processing Systems (NeurIPS), 2019
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
936
47,917
0
03 Dec 2019
Contrastive Learning of Structured World Models
International Conference on Learning Representations (ICLR), 2019
Thomas Kipf
Elise van der Pol
Max Welling
OCL
DRL
330
301
0
27 Nov 2019
Entity Abstraction in Visual Model-Based Reinforcement Learning
Conference on Robot Learning (CoRL), 2019
Rishi Veerapaneni
John D. Co-Reyes
Michael Chang
Michael Janner
Chelsea Finn
Jiajun Wu
J. Tenenbaum
Sergey Levine
OCL
OffRL
393
196
0
28 Oct 2019
R-SQAIR: Relational Sequential Attend, Infer, Repeat
Aleksandar Stanić
Jürgen Schmidhuber
156
31
0
11 Oct 2019
SCALOR: Generative World Models with Scalable Object Representations
International Conference on Learning Representations (ICLR), 2019
Jindong Jiang
Sepehr Janghorbani
Gerard de Melo
Sungjin Ahn
OCL
DRL
367
143
0
06 Oct 2019
MONet: Unsupervised Scene Decomposition and Representation
Christopher P. Burgess
Loic Matthey
Nicholas Watters
Rishabh Kabra
I. Higgins
M. Botvinick
Alexander Lerchner
OCL
274
565
0
22 Jan 2019
Spatial Broadcast Decoder: A Simple Architecture for Learning Disentangled Representations in VAEs
Nicholas Watters
Loic Matthey
Christopher P. Burgess
Alexander Lerchner
CoGe
335
181
0
21 Jan 2019
Reasoning About Physical Interactions with Object-Oriented Prediction and Planning
Michael Janner
Sergey Levine
William T. Freeman
J. Tenenbaum
Chelsea Finn
Jiajun Wu
OCL
210
133
0
28 Dec 2018
Sequential Attend, Infer, Repeat: Generative Modelling of Moving Objects
Adam R. Kosiorek
Hyunjik Kim
Ingmar Posner
Yee Whye Teh
BDL
287
265
0
05 Jun 2018
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Richard Y. Zhang
Phillip Isola
Alexei A. Efros
Eli Shechtman
Oliver Wang
EGVM
1.0K
14,846
0
11 Jan 2018
Attention Is All You Need
Neural Information Processing Systems (NeurIPS), 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
2.4K
157,232
0
12 Jun 2017
Visual Interaction Networks
Neural Information Processing Systems (NeurIPS), 2017
Nicholas Watters
Andrea Tacchetti
T. Weber
Razvan Pascanu
Peter W. Battaglia
Daniel Zoran
PINN
3DH
235
287
0
05 Jun 2017
A Compositional Object-Based Approach to Learning Physical Dynamics
Michael Chang
T. Ullman
Antonio Torralba
J. Tenenbaum
AI4CE
OCL
534
451
0
01 Dec 2016
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
870
1,474
0
01 Dec 2016
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Katharina Eggensperger
ODL
844
9,396
0
13 Aug 2016
Denoising Criterion for Variational Auto-Encoding Framework
Daniel Jiwoong Im
Sungjin Ahn
Roland Memisevic
Yoshua Bengio
197
209
0
19 Nov 2015
Adam: A Method for Stochastic Optimization
International Conference on Learning Representations (ICLR), 2014
Diederik P. Kingma
Jimmy Ba
ODL
4.4K
160,138
0
22 Dec 2014
1