Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.06922
Cited By
Object Scene Representation Transformer
14 June 2022
Mehdi S. M. Sajjadi
Daniel Duckworth
Aravindh Mahendran
Sjoerd van Steenkiste
Filip Pavetić
Mario Luvcić
Leonidas J. Guibas
Klaus Greff
Thomas Kipf
ViT
OCL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Object Scene Representation Transformer"
50 / 71 papers shown
Title
Object-Centric Pretraining via Target Encoder Bootstrapping
Nikola Đukić
Tim Lebailly
Tinne Tuytelaars
OCL
63
0
0
19 Mar 2025
Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering
Yanpeng Zhao
Yiwei Hao
Siyu Gao
Yunbo Wang
Xiaokang Yang
OCL
111
1
0
17 Feb 2025
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Guiqiu Liao
M. Jogan
Marcel Hussing
Kenta Nakahashi
Kazuhiro Yasufuku
Amin Madani
Eric Eaton
Daniel A. Hashimoto
41
0
0
21 Jan 2025
Moving Off-the-Grid: Scene-Grounded Video Representations
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
26
3
0
08 Nov 2024
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
Zhiwen Fan
Jian Zhang
Wenyan Cong
Peihao Wang
Renjie Li
...
Z. Wang
Danfei Xu
B. Ivanovic
Marco Pavone
Yue Wang
3DV
36
11
0
24 Oct 2024
Learning Global Object-Centric Representations via Disentangled Slot Attention
Tonglin Chen
Yinxuan Huang
Zhimeng Shen
Jinghao Huang
Bin Li
Xiangyang Xue
OCL
22
0
0
24 Oct 2024
Zero-Shot Object-Centric Representation Learning
Aniket Didolkar
Andrii Zadaianchuk
Anirudh Goyal
Mike Mozer
Yoshua Bengio
Georg Martius
Maximilian Seitzer
VLM
OCL
26
4
0
17 Aug 2024
SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields
Yu Liu
Baoxiong Jia
Yixin Chen
Siyuan Huang
OCL
31
4
0
13 Aug 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
35
5
0
22 Jul 2024
Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving
Ran Tian
Boyi Li
Xinshuo Weng
Yuxiao Chen
Edward Schmerling
Yue Wang
B. Ivanovic
Marco Pavone
26
13
0
01 Jul 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
31
10
0
13 Jun 2024
Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery
Anand Gopalakrishnan
Aleksandar Stanić
Jürgen Schmidhuber
M. C. Mozer
37
5
0
27 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
60
38
0
23 May 2024
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
25
0
0
06 May 2024
Learning to Compose: Improving Object Centric Learning by Injecting Compositionality
Whie Jung
Jaehoon Yoo
Sungjin Ahn
Seunghoon Hong
OCL
CoGe
24
4
0
01 May 2024
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features
Andre Rochow
Max Schwarz
Sven Behnke
ViT
33
6
0
15 Apr 2024
On permutation-invariant neural networks
Masanari Kimura
Ryotaro Shimizu
Yuki Hirakawa
Ryosuke Goto
Yuki Saito
OOD
AAML
30
7
0
26 Mar 2024
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning
Rao Fu
Jingyu Liu
Xilun Chen
Yixin Nie
Wenhan Xiong
LM&Ro
LRM
41
47
0
18 Mar 2024
Attentive Illumination Decomposition Model for Multi-Illuminant White Balancing
Dongyoung Kim
Jinwoo Kim
Junsang Yu
Seon Joo Kim
34
5
0
28 Feb 2024
Parallelized Spatiotemporal Binding
Gautam Singh
Yue Wang
Jiawei Yang
B. Ivanovic
Sungjin Ahn
Marco Pavone
Tong Che
36
1
0
26 Feb 2024
Disentangled 3D Scene Generation with Layout Learning
Dave Epstein
Ben Poole
B. Mildenhall
Alexei A. Efros
Aleksander Holynski
CoGe
OCL
3DV
32
20
0
26 Feb 2024
Unsupervised Discovery of Object-Centric Neural Fields
Rundong Luo
Hong-Xing Yu
Jiajun Wu
3DPC
OCL
85
3
0
12 Feb 2024
Binding Dynamics in Rotating Features
Sindy Lowe
Francesco Locatello
Max Welling
OCL
15
1
0
08 Feb 2024
ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis
Bernard Spiegl
Andrea Perin
Stéphane Deny
Alexander Ilin
DiffM
11
2
0
05 Feb 2024
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Yiqi Wang
Wentao Chen
Xiaotian Han
Xudong Lin
Haiteng Zhao
Yongfei Liu
Bohan Zhai
Jianbo Yuan
Quanzeng You
Hongxia Yang
LRM
33
66
0
10 Jan 2024
Slot-guided Volumetric Object Radiance Fields
Di Qi
Tong Yang
Xiangyu Zhang
OCL
16
2
0
04 Jan 2024
NViST: In the Wild New View Synthesis from a Single Image with Transformers
Wonbong Jang
Lourdes Agapito
ViT
19
9
0
13 Dec 2023
Inferring Hybrid Neural Fluid Fields from Videos
Hong-Xing Yu
Yang Zheng
Yuan Gao
Yitong Deng
Bo Zhu
Jiajun Wu
3DH
22
15
0
11 Dec 2023
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
Zehuan Huang
Hao Wen
Junting Dong
Yaohui Wang
Yangguang Li
...
Yan-Pei Cao
Ding Liang
Yu Qiao
Bo Dai
Lu Sheng
14
10
0
11 Dec 2023
Free3D: Consistent Novel View Synthesis without 3D Representation
Chuanxia Zheng
Andrea Vedaldi
3DV
35
48
0
07 Dec 2023
Action-slot: Visual Action-centric Representations for Multi-label Atomic Activity Recognition in Traffic Scenes
Chi-Hsi Kung
Shu-Wei Lu
Yi-Hsuan Tsai
Yi-Ting Chen
23
6
0
29 Nov 2023
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
Takeru Miyato
Bernhard Jaeger
Max Welling
Andreas Geiger
ViT
25
14
0
16 Oct 2023
Leveraging Image Augmentation for Object Manipulation: Towards Interpretable Controllability in Object-Centric Learning
Jinwoo Kim
Janghyuk Choi
Jaehyun Kang
Changyeon Lee
Ho-Jin Choi
Seon Joo Kim
OCL
21
0
0
13 Oct 2023
Pseudo-Generalized Dynamic View Synthesis from a Video
Xiaoming Zhao
Alex Colburn
Fangchang Ma
Miguel Angel Bautista
J. Susskind
A. Schwing
27
11
0
12 Oct 2023
DyST: Towards Dynamic Neural Scene Representations on Real-World Videos
Maximilian Seitzer
Sjoerd van Steenkiste
Thomas Kipf
Klaus Greff
Mehdi S. M. Sajjadi
VGen
ViT
16
8
0
09 Oct 2023
Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Jianglong Ye
Peng Wang
Kejie Li
Yichun Shi
Heng Wang
DiffM
22
72
0
04 Oct 2023
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation
Ke Fan
Jingshi Lei
Xuelin Qian
Miaopeng Yu
Tianjun Xiao
Tong He
Zheng-Wei Zhang
Yanwei Fu
VOS
8
4
0
23 Sep 2023
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
Patrick Butlin
R. Long
Eric Elmoznino
Yoshua Bengio
Jonathan C. P. Birch
...
L. Mudrik
Megan A. K. Peters
Eric Schwitzgebel
Jonathan Simon
Rufin VanRullen
LLMAG
16
95
0
17 Aug 2023
Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis
Yuxin Wang
Wayne Wu
Dan Xu
35
9
0
05 Aug 2023
Linking vision and motion for self-supervised object-centric perception
Kaylene C. Stocking
Zak Murez
Vijay Badrinarayanan
Jamie Shotton
Alex Kendall
Claire Tomlin
Christopher P. Burgess
OCL
22
0
0
14 Jul 2023
Equivariant Single View Pose Prediction Via Induced and Restricted Representations
Owen Howell
David M. Klee
Ondrej Biza
Linfeng Zhao
Robin G. Walters
16
2
0
07 Jul 2023
Tell Me Where to Go: A Composable Framework for Context-Aware Embodied Robot Navigation
Harel Biggie
Ajay Narasimha Mopidevi
Dusty Woods
Christoffer Heckman
LM&Ro
11
11
0
15 Jun 2023
DORSal: Diffusion for Object-centric Representations of Scenes et al
Allan Jabri
Sjoerd van Steenkiste
Emiel Hoogeboom
Mehdi S. M. Sajjadi
Thomas Kipf
14
17
0
13 Jun 2023
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles
Tal Daniel
Aviv Tamar
DiffM
14
3
0
09 Jun 2023
Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities
Andrii Zadaianchuk
Maximilian Seitzer
Georg Martius
OCL
12
15
0
07 Jun 2023
Rotating Features for Object Discovery
Sindy Lowe
Phillip Lippe
Francesco Locatello
Max Welling
OCL
17
21
0
01 Jun 2023
Sensitivity of Slot-Based Object-Centric Models to their Number of Slots
Roland S. Zimmermann
Sjoerd van Steenkiste
Mehdi S. M. Sajjadi
Thomas Kipf
Klaus Greff
OCL
23
5
0
30 May 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
19
18
0
26 May 2023
Provably Learning Object-Centric Representations
Jack Brady
Roland S. Zimmermann
Yash Sharma
Bernhard Schölkopf
Julius von Kügelgen
Wieland Brendel
OCL
24
31
0
23 May 2023
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models
Ziyi Wu
Jingyu Hu
Wuyue Lu
Igor Gilitschenski
Animesh Garg
DiffM
OCL
23
44
0
18 May 2023
1
2
Next